EGU23-7177
https://doi.org/10.5194/egusphere-egu23-7177
EGU General Assembly 2023
© Author(s) 2023. This work is distributed under
the Creative Commons Attribution 4.0 License.

Destination Earth Data Lake

Jordi Duatis Juarez1, Michael Schick2, Danaele Puechmaille3, Miruna Stoicescu4, and Borys Saulyak5
Jordi Duatis Juarez et al.
  • 1EUMETSAT, TSS/PRS, Germany (jordi.duatis@eumetsat.int)
  • 2EUMETSAT, TSS/DSA, Germany (Michael.Schick@eumetsat.int)
  • 3EUMETSAT, TSS/DSA, Germany (Danaele.Puechmaille@eumetsat.int)
  • 4EUMETSAT, TSS/DSA, Germany (Miruna.Stoicescu@eumetsat.int)
  • 5EUMETSAT, TSS/DSA, Germany (Borys.Saulyak@eumetsat.int)

Destination Earth is an operational service under the lead of the European Commission being implemented jointly by ESA, ECMWF and EUMETSAT.

The presentation will provide insights into the EUMETSAT Data Lake Service component of the Destination Earth undertaking.

The objective of the European Commission’s Destination Earth (DestinE) initiative is to deploy several highly accurate digital replicas of the Earth (Digital Twins) in order to monitor and simulate natural as well as human activities and their interactions, to develop and test “what-if” scenarios that would enable more sustainable developments and support European environmental policies. DestinE addresses the challenge to manage and make accessible the sheer amount of data generated by the Digital Twins and observation data located at external sites such as the ones depicted in the figure below. This data will be made available fast enough and in a format ready to support analysis scenarios proposed by the DestinE service users.

 

Figure:  DestinE Data Sources (green) and Stakeholders (orange)

 

The “DestinE Data Lake” (DEDL) is one of the three Destination Earth components interacting with:

  • the Digital Twin Engine (DTE), which runs the simulation models, under ECMWF responsibility
  • the DestinE Core Service Platform (DESP), which represents the user entry point to the DestinE services and data, under ESA responsibility

The DestinE Data Lake (DEDL) fulfils the storage and access requirements for any data that is offered to DestinE users. It provides users with a seamless access to the datasets, regardless of data type and location. Furthermore, the DEDL supports big data processing services, such as near-data processing to maximize throughput and service scalability. The data lake is built inter alia upon existing data lakes such as Copernicus DIAS, ESA, EUMETSAT, ECMWF as well as complementary data from diverse sources like federated data spaces, in-situ or socio-economic data. The DT Data Warehouse is a sub-component of the DEDL which stores relevant subsets of the output from each  digital twin (DT) execution being powered by ECMWFs Hyper-Cube service.

How to cite: Duatis Juarez, J., Schick, M., Puechmaille, D., Stoicescu, M., and Saulyak, B.: Destination Earth Data Lake, EGU General Assembly 2023, Vienna, Austria, 24–28 Apr 2023, EGU23-7177, https://doi.org/10.5194/egusphere-egu23-7177, 2023.