ESSI3.3 | Scalable and FAIR Workflow Approaches in Earth System Science: Addressing Data and Computational Challenges
EDI
Scalable and FAIR Workflow Approaches in Earth System Science: Addressing Data and Computational Challenges
Co-organized by CR6/GI2/HS13/NP4/TS9
Convener: Karsten Peters-von Gehlen | Co-conveners: Miguel Castrillo, Ivonne Anders, Donatello Elia, Manuel Giménez de Castro Marciani

Performing research in Earth System Science is increasingly challenged by the escalating volumes and complexity of data, requiring sophisticated workflow methodologies for efficient processing and data reuse. The complexity of computational systems, such as distributed and high-performance heterogeneous computing environments, further increases the need for advanced orchestration capabilities to perform and reproduce simulations effectively. On the same line, the emergence and integration of data-driven models, next to the traditional compute-driven ones, introduces additional challenges in terms of workflow management. This session delves into the latest advances in workflow concepts and techniques essential to address these challenges taking into account the different aspects linked with High-Performance Computing (HPC), Data Processing and Analytics, and Artificial Intelligence (AI).

In the session, we will explore the importance of the FAIR (Findability, Accessibility, Interoperability, and Reusability) principles and provenance in ensuring data accessibility, transparency, and trustworthiness. We will also address the balance between reproducibility and security, addressing potential workflow vulnerabilities while preserving research integrity.

Attention will be given to workflows in federated infrastructures and their role in scalable data analysis. We will discuss cutting-edge techniques for modeling and data analysis, highlighting how these workflows can manage otherwise unmanageable data volumes and complexities, as well as best practices and progress from various initiatives and challenging use cases (e.g., Digital Twins of the Earth and the Ocean).

We will gain insights into FAIR Digital Objects, (meta)data standards, linked-data approaches, virtual research environments, and Open Science principles. The aim is to improve data management practices in a data-intensive world.
On these topics, we invite contributions from researchers illustrating their approach to scalable workflows as well as data and computational experts presenting current approaches offered and developed by IT infrastructure providers enabling cutting edge research in Earth System Science.

Performing research in Earth System Science is increasingly challenged by the escalating volumes and complexity of data, requiring sophisticated workflow methodologies for efficient processing and data reuse. The complexity of computational systems, such as distributed and high-performance heterogeneous computing environments, further increases the need for advanced orchestration capabilities to perform and reproduce simulations effectively. On the same line, the emergence and integration of data-driven models, next to the traditional compute-driven ones, introduces additional challenges in terms of workflow management. This session delves into the latest advances in workflow concepts and techniques essential to address these challenges taking into account the different aspects linked with High-Performance Computing (HPC), Data Processing and Analytics, and Artificial Intelligence (AI).

In the session, we will explore the importance of the FAIR (Findability, Accessibility, Interoperability, and Reusability) principles and provenance in ensuring data accessibility, transparency, and trustworthiness. We will also address the balance between reproducibility and security, addressing potential workflow vulnerabilities while preserving research integrity.

Attention will be given to workflows in federated infrastructures and their role in scalable data analysis. We will discuss cutting-edge techniques for modeling and data analysis, highlighting how these workflows can manage otherwise unmanageable data volumes and complexities, as well as best practices and progress from various initiatives and challenging use cases (e.g., Digital Twins of the Earth and the Ocean).

We will gain insights into FAIR Digital Objects, (meta)data standards, linked-data approaches, virtual research environments, and Open Science principles. The aim is to improve data management practices in a data-intensive world.
On these topics, we invite contributions from researchers illustrating their approach to scalable workflows as well as data and computational experts presenting current approaches offered and developed by IT infrastructure providers enabling cutting edge research in Earth System Science.