EGU25-5694, updated on 14 Mar 2025
https://doi.org/10.5194/egusphere-egu25-5694
EGU General Assembly 2025
© Author(s) 2025. This work is distributed under
the Creative Commons Attribution 4.0 License.
Oral | Tuesday, 29 Apr, 09:25–09:35 (CEST)
 
Room 1.85/86
Development of a GPU-accelerated, Finite Element based Dynamical Core for Sea Ice
Robert Jendersie1,2, Christian Lessig3,2, and Thomas Richter1
Robert Jendersie et al.
  • 1Institute for Analysis and Numerics, Otto-von-Guericke University Magdeburg, Germany
  • 2Institute for Simulation and Graphics, Otto-von-Guericke University Magdeburg, Germany
  • 3European Centre for Medium-Range Weather Forecasts, Bonn, Germany

Sea ice is an import part of Earth's climate system. Yet, an accurate, highly resolved simulation of sea ice dynamics remains challenging. As the development of faster processors has slowed down, a turn to more specialized hardware is needed to achieve more accurate simulations at higher resolutions. Graphics processing units (GPUs) offer an order of magnitude higher floating-point performance and efficiency compared to CPUs. However, their full utilization often also requires significant engineering effort. Therefore, several frameworks have emerged in recent years which aim to simplify general-purpose GPU programming. In particular, heterogeneous compute frameworks such as SYCL and Kokkos make it possible to develop a unified code base that works accross GPUs and CPUs. Similarly, machine learning frameworks like PyTorch combine an easy to use interface with highly specialized backends that can make it possible to transparently exploit new hardware features to accelerate large-scale linear algebra workloads. Furthermore, their use provides a simple path-way to integrate machine learning components into simulations.

In this talk, we compare available options for the GPU-parallelizaton of the novel sea-ice code neXtSIM-DG. Its dynamical core is based on higher-order finite elements for the momentum equation and discontinous Galerkin elements for the advection. This makes the code highly parallezible. We discuss characteristics of our discretization and its consequences for the GPU implementation. For the full port of the dynamical core we use Kokkos as, based on our assessement, it combines usability with good performance. With moderate changes compared to the OpenMP-based CPU code, the new implementation achieves a sixfold speedup on the GPU while being as fast as the reference on the CPU.

How to cite: Jendersie, R., Lessig, C., and Richter, T.: Development of a GPU-accelerated, Finite Element based Dynamical Core for Sea Ice, EGU General Assembly 2025, Vienna, Austria, 27 Apr–2 May 2025, EGU25-5694, https://doi.org/10.5194/egusphere-egu25-5694, 2025.