EGU25-6201, updated on 14 Mar 2025
https://doi.org/10.5194/egusphere-egu25-6201
EGU General Assembly 2025
© Author(s) 2025. This work is distributed under
the Creative Commons Attribution 4.0 License.
Oral | Tuesday, 29 Apr, 15:10–15:20 (CEST)
 
Room -2.92
Advancing Computational Workflow Sharing in Earth Science: Insights from DT-GEO and Geo-INQUIRE
Marco Salvi1, Rossana Paciello1, Valerio Vinciarelli2, Kety Giuliacci1, Daniele Bailo1, Pablo Orviz3, Keith Jeffery4, Manuela Volpe1, Roberto Tonini1, and Alejandra Guerrero5
Marco Salvi et al.
  • 1Istituto Nazionale di Geofisica e Vulcanologia, Rome, Italy
  • 2EPOS ERIC, Rome, Italy
  • 3Institute of Physics of Cantabria, Santander, Spain
  • 4Keith G Jeffery Consultants, Faringdon, UK
  • 5Consejo Superior de Investigaciones Científicas, Madrid, Spain

The increasing complexity and volume of data in Solid Earth Science necessitate robust solutions for workflow representation, sharing, and reproducibility. Within the DT-GEO (https://dtgeo.eu/) project, we addressed the challenge of creating interoperable and discoverable representations of computational workflows to facilitate data reuse and collaboration. Leveraging the EPOS Platform (https://www.epos-eu.org/), a multidisciplinary research infrastructure focused on Solid Earth Science, we aimed to expose workflows, datasets, and software to the community while adhering to the FAIR (Findable, Accessible, Interoperable, and Reusable) principles. While the EPOS-DCAT-AP (https://github.com/epos-eu/EPOS-DCAT-AP) model, already used in EPOS, can effectively represent datasets and software, it lacks direct support for computational workflows, necessitating the adoption of alternative standards.

To overcome this limitation, we employed the Common Workflow Language (CWL, https://www.commonwl.org/) to describe workflows, capturing their structure, software, datasets, and dependencies. The developed CWL representations are "abstract" focusing on general workflow structures while omitting execution-specific details to prioritize interoperability. To package these workflows along with metadata, we utilized Workflow Run Crate, an extension of the RO-Crate (https://www.researchobject.org/ro-crate/) standard. Together, these technologies enable workflows to become self-contained entities, simplifying sharing and reuse. 

This approach not only aligns with community standards but also benefits from a mature ecosystem of tools and libraries, ensuring seamless integration and widespread applicability. Initial implementations within the DT-GEO project serve as a model for adoption in related initiatives such as Geo-INQUIRE (https://www.geo-inquire.eu/), where similar methodologies are being used to share workflows derived from the Simulation Data Lake (SDL) infrastructure. These implementations pave the way for broader integration within the EPOS Platform, enhancing access to advanced workflows across disciplines.

Our contribution highlights the value of adopting standardized tools and methodologies for workflow management in Solid Earth Science, showcasing how CWL and RO-Crate streamline interoperability and foster collaboration. These advances address challenges in data and computational management, contributing to the scalable FAIR workflows essential for tackling the complexities of Solid Earth Science. Moving forward, the integration of these standards across projects like DT-GEO and Geo-INQUIRE will further enhance the EPOS Platform's capabilities, offering a unified gateway to reproducible, secure, and trustworthy workflows that meet the evolving needs of the scientific community.

How to cite: Salvi, M., Paciello, R., Vinciarelli, V., Giuliacci, K., Bailo, D., Orviz, P., Jeffery, K., Volpe, M., Tonini, R., and Guerrero, A.: Advancing Computational Workflow Sharing in Earth Science: Insights from DT-GEO and Geo-INQUIRE, EGU General Assembly 2025, Vienna, Austria, 27 Apr–2 May 2025, EGU25-6201, https://doi.org/10.5194/egusphere-egu25-6201, 2025.