EGU23-10681, updated on 19 Apr 2023
https://doi.org/10.5194/egusphere-egu23-10681
EGU General Assembly 2023
© Author(s) 2023. This work is distributed under
the Creative Commons Attribution 4.0 License.

EarthQA: A Question Answering Engine for Earth Observation Data Archives

Dharmen Punjani1,2, Eleni Tsalapati1, and Manolis Koubarakis1
Dharmen Punjani et al.
  • 1University of Athens, Dept. of Informatics and Telecommunication, Greece (dharmen.punjani@gmail.com)
  • 2Université Jean Monnet Saint Etienne, France

The standard way for earth observation experts or users to retrieve images from image archives (e.g., ESA's Copernicus Open Access Hub) is to use a graphical user interface, where they can select the geographical area of the image they are interested in and additionally they can specify some other metadata, such as sensing period, satellite platform and cloud cover.

In this work, we are developing the question-answering engine EarthQA that takes as input a question expressed in natural language (English) that asks for satellite images satisfying certain criteria and returns links to such datasets, which can be then downloaded from the CREODIAS cloud platform. To answer user questions, EarthQA queries two interlinked knowledge graphs: a knowledge graph encoding metadata of satellite images from the CREODIAS cloud platform (the SPARQL endpoint of CREODIAS) and the well-known knowledge graph DBpedia. Hence, the questions can refer to image metadata (e.g., satellite platform, sensing period, cloud cover), but also to more generic entities appearing in DBpedia knowledge graph (e.g., lake, Greece). In this way, the users can ask questions like “Find all Sentinel-1 GRD images taken during October 2021 that show large lakes in Greece having an area greater than 100 square kilometers”.

EarthQA follows a template-based approach to translate natural language questions into formal queries (SPARQL). Initially, it decomposes the user question by generating its dependency parse tree and then automatically disambiguates the components appearing in the question to elements of the two knowledge graphs.  In particular, it automatically identifies the spatial or temporal entities (e.g., “Greece”, “October 2021”), concepts (e.g., “lake”), spatial or temporal relations (e.g., “in”, “during”), properties (e.g., “area”) and product types (e.g., “Sentinel-1 GRD”) and other metadata (e.g., “cloud cover below 10%”) mentioned in the question and maps them to the respective elements appearing in the two knowledge graphs (dbr:Greece, dbo:Lake, dbp:area, etc). After this, the SPARQL query is automatically generated.

How to cite: Punjani, D., Tsalapati, E., and Koubarakis, M.: EarthQA: A Question Answering Engine for Earth Observation Data Archives, EGU General Assembly 2023, Vienna, Austria, 24–28 Apr 2023, EGU23-10681, https://doi.org/10.5194/egusphere-egu23-10681, 2023.