EGU24-1042, updated on 08 Mar 2024
https://doi.org/10.5194/egusphere-egu24-1042
EGU General Assembly 2024
© Author(s) 2024. This work is distributed under
the Creative Commons Attribution 4.0 License.

Assessing Job Wrapping as an Strategy for Workflow Optimization on Shared HPC Platforms

Manuel Giménez de Castro Marciani1,2, Gladys Utrera2, Miguel Castrillo1, and Mario C. Acosta1,2
Manuel Giménez de Castro Marciani et al.
  • 1Earth Science Department, Barcelona Supercomputing Center (BSC), Barcelona, Spain
  • 2Departament d'Arquitectura de Computadors, Universitat Politècnica de Catalunya, Barcelona, Spain

Experimenting with modern ESM inherently requires a workflow organization to handle the multiple steps comprising of, but not limited to, execution, data governance, cleaning, and coordinating multiple machines. And for climate experiments, due to long scale of the simulations, workflows are even more critical. The community has thoroughly proposed enhancements for reducing the runtime of the models, but long has overlooked the time to response, which also takes into account the queue time. And, that is what we aim to optimize by wrapping jobs, which would otherwise be submitted individually, onto a single one. The intricate three-way interaction of the HPC system usage, scheduler policy, and user's past usage is the main challenge addressed here to analyze the impact of wrapping jobs.

How to cite: Giménez de Castro Marciani, M., Utrera, G., Castrillo, M., and Acosta, M. C.: Assessing Job Wrapping as an Strategy for Workflow Optimization on Shared HPC Platforms, EGU General Assembly 2024, Vienna, Austria, 14–19 Apr 2024, EGU24-1042, https://doi.org/10.5194/egusphere-egu24-1042, 2024.