EGU26-20630, updated on 14 Mar 2026
https://doi.org/10.5194/egusphere-egu26-20630
EGU General Assembly 2026
© Author(s) 2026. This work is distributed under
the Creative Commons Attribution 4.0 License.
Oral | Friday, 08 May, 17:20–17:30 (CEST)
 
Room -2.33
Compression and Reconstruction of High-Dimensional Weather Simulation Data Using Tensor Decompositions
Clara Hartmann1, Rafael Ballester-Ripoll2, Julian A. Croci1, Jorge Gacitua Gutierrez3,4, Juan Jose Ruiz3,4, Paola Salio3,4, Alexandra Diehl5, and Renato Pajarola1
Clara Hartmann et al.
  • 1University of Zürich, Zurich, Switzerland
  • 2School of Science and Technology, IE University, Madrid, Spain
  • 3Universidad de Buenos Aires, Facultad de Ciencias Exactas y Naturales, Departamento de Ciencias de la Atmósfera y los Océanos. Buenos Aires, Argentina
  • 4CONICET – Universidad de Buenos Aires. Centro de Investigaciones del Mar y la Atmósfera (CIMA). Buenos Aires, Argentina
  • 5University of Southern Denmark. Odense, Denmark

High-resolution numerical weather and climate simulations increasingly produce very large data with high dimensionality. Such datasets usually span three spatial dimensions, time, multiple physical variables, and ensemble members, leading to six-dimensional (6D) hypervolume datasets. Being grid-based, these datasets can be interpreted as 6D data tensors. The storage, processing, visualization, and analysis of such large data poses significant computational and memory storage challenges. Tensor decomposition and approximation methods have proven to be an efficient tool for compression and reconstruction of such large, high-dimensional scientific datasets. Using rigorous mathematical principles, tensor decompositions are exploiting multi-linear structure and redundancy inherent in scientific data, leading to an effective compression of the datasets while providing visually accurate results.

In this work, we investigate the applicability of tensor decompositions for the compression and efficient representation of 6D weather simulation data. We focus on two of the state-of-the-art low-rank tensor formats, tensor-train (TT) and Tucker decompositions. These methods generalize the singular value decomposition (SVD) to higher-order tensors, enabling compression of spatial, temporal, and physical modes through rank reduction. Therefore, the large high-dimensional tensor is factorized into multiple smaller, rank-reduced tensors with lower dimensionality, reducing the size of the original data significantly while preserving essential features. Such a reduced representation is also called a tensor approximation (TA).

We apply the tensor decompositions to a real-world weather simulation dataset from the Alpine region of Switzerland (COSMO-1E), organized along longitude, latitude, vertical level, time, physical variables (such as temperature), and 11 ensemble dimensions. We evaluate the performance of the compression in terms of storage reduction, relative reconstruction error, peak-signal-to-noise-ratio (PSNR), structural similarity index measure (SSIM), computational costs, and visual comparison to the original data. Our results demonstrate significant compression ratios while preserving high visual accuracy. For example, a TT-based compression with a compression ratio of 1 : 900 provides results with a relative error of only 0.0005. The obtained compression ratio reduces the size of 4GB of the original dataset to 4.6MB for the compressed dataset. Lower compression ratios lead to even higher accuracy.

Beyond efficient data compression, the linear structure of the tensor decompositions allows for efficient application of filters in the tensor domain. The computation of the mean, standard deviation or similar linear operations along user-defined dimensions can directly be performed on the decomposed tensors, without ever having to reconstruct the large 6D dataset. Furthermore, the structure of the tensors allows for efficient partial reconstruction and visualization of slices or subsets of the dataset without reconstructing the complete dataset.

Overall, this work highlights tensor decompositions as powerful tool for managing the growing size and complexity of high-dimensional weather simulation data. Their linear structure, which allows for efficient filter application in the compressed domain, makes them especially suitable for scientific analysis of complex datasets. Their integration into geoscientific data pipelines offers a promising pathway towards scalable and accurate data compression and analysis in numerical weather prediction and climate science. 

How to cite: Hartmann, C., Ballester-Ripoll, R., Croci, J. A., Gacitua Gutierrez, J., Ruiz, J. J., Salio, P., Diehl, A., and Pajarola, R.: Compression and Reconstruction of High-Dimensional Weather Simulation Data Using Tensor Decompositions, EGU General Assembly 2026, Vienna, Austria, 3–8 May 2026, EGU26-20630, https://doi.org/10.5194/egusphere-egu26-20630, 2026.