EGU23-7842, updated on 30 Mar 2023
https://doi.org/10.5194/egusphere-egu23-7842
EGU General Assembly 2023
© Author(s) 2023. This work is distributed under
the Creative Commons Attribution 4.0 License.

Destination Earth - Processing Near Data and Massive Data Handling 

Danaele Puechmaille1, Jordi Duatis Juarez2, Miruna Stoicescu3, Michael Schick4, and Borys Saulyak5
Danaele Puechmaille et al.
  • 1EUMETSAT, TSS/DSA, Germany (Danaele.Puechmaille@eumetsat.int)
  • 2EUMETSAT, TSS/PRS, Germany (Jordi.Duatis@eumetsat.int)
  • 3EUMETSAT, TSS/DSA, Germany (Miruna.Stoicescu@eumetsat.int)
  • 4EUMETSAT, TSS/DSA, Germany (Michael.Schick@eumetsat.int)
  • 5EUMETSAT, TSS/DSA, Germany (Borys.Saulyak@eumetsat.int)

Destination Earth is an operational service under the lead of the European Commission being implemented jointly by ESA, ECMWF and EUMETSAT.

The presentation will provide insights of how Destination Earth provides Near Data Processing and deals with Massive Data.

The objective of the European Commission’s Destination Earth (DestinE) initiative is to deploy several highly accurate digital replicas of the Earth (Digital Twins) in order to monitor and simulate natural as well as human activities and their interactions, to develop and test “what-if” scenarios that would enable more sustainable developments and support European environmental policies. DestinE addresses the challenge to manage and make accessible the sheer amount of data generated by the Digital Twins and observation data located at external sites such as the ones depicted in the figure below. This data will be made available fast enough and in a format ready to support analysis scenarios proposed by the DestinE service users.

 

Figure 1 :  DestinE Data Sources (green) and Stakeholders (orange)

 

The “DestinE Data Lake” (DEDL) is one of the three Destination Earth components interacting with:

  • the Digital Twin Engine (DTE), which runs the simulation models, under ECMWF responsibility
  • the DestinE Core Service Platform (DESP), which represents the user entry point to the DestinE services and data, under ESA responsibility

The DestinE Data Lake (DEDL) fulfils the storage and access requirements for any data that is offered to DestinE users. It provides users with a seamless access to the datasets, regardless of data type and location. Furthermore, the DEDL supports big data processing services, such as near-data processing to maximize throughput and service scalability. The data lake is built inter alia upon existing data lakes such as Copernicus DIAS, ESA, EUMETSAT, ECMWF as well as complementary data from diverse sources like federated data spaces, in-situ or socio-economic data. The DT Data Warehouse is a sub-component of the DEDL which stores relevant subsets of the output from each  digital twin (DT) execution being powered by ECMWFs Hyper-Cube service.

During the session, EUMETSAT’s representative will share to the community how the Destination Earth Data Lake component implements and takes advantage of Near Data Processing and also how the System handles massive data access and exchange. The Destination Earth Data Portfolio will be presented.

Figure 2: Destination Earth Data Portfolio

How to cite: Puechmaille, D., Duatis Juarez, J., Stoicescu, M., Schick, M., and Saulyak, B.: Destination Earth - Processing Near Data and Massive Data Handling , EGU General Assembly 2023, Vienna, Austria, 24–28 Apr 2023, EGU23-7842, https://doi.org/10.5194/egusphere-egu23-7842, 2023.