High-Efficiency Rainfall Data Compression Using Binarized Convolutional Autoencoder

Manuel Traub; Fedor Scholz; Thomas Scholten; Christiane Zarfl; Martin V. Butz

doi:https://doi.org/10.5194/egusphere-egu24-11768

[Back] [Session HS3.4]

EGU24-11768, updated on 22 Oct 2024

https://doi.org/10.5194/egusphere-egu24-11768

EGU General Assembly 2024

© Author(s) 2024. This work is distributed under
the Creative Commons Attribution 4.0 License.

High-Efficiency Rainfall Data Compression Using Binarized Convolutional Autoencoder

Manuel Traub¹, Fedor Scholz², Thomas Scholten³, Christiane Zarfl⁴, and Martin V. Butz⁵

Manuel Traub et al.

¹Neuro-Cognitive Modeling Group, University of Tübingen, Tübingen, Germany, (manuel.traub@uni-tuebingen.de)
²Neuro-Cognitive Modeling Group, University of Tübingen, Tübingen, Germany, (fedor.scholz@uni-tuebingen.de)
³Soil Science and Geomorphology, University of Tübingen, Tübingen, Germany, (thomas.scholten@uni-tuebingen.de)
⁴Environmental Systems Analysis, University of Tübingen, Tübingen, Germany, (christiane.zarfl@uni-tuebingen.de)
⁵Neuro-Cognitive Modeling Group, University of Tübingen, Tübingen, Germany, (martin.butz@uni-tuebingen.de)

In the era of big data, managing and storing large-scale meteorological datasets is a critical challenge. We focus on high-resolution rainfall data, which is crucial to atmospheric sciences, climate research, and real-time weather forecasting. This study introduces a deep learning-based approach to compress the German Radar-Online-Aneichung (RADOLAN) rainfall dataset. We achieve a compression ratio of 200:1 while maintaining a minimal mean squared reconstruction error (MSE). Our method combines a convolutional autoencoder with a novel binarization mechanism, to compress data from a resolution of 900x900 pixels at 32-bit depth to 180x180 pixels at 4-bit depth. Leveraging the ConvNeXt architecture (Zhuang Liu, et al., 'A ConvNet for the 2020s'), our method learns a convolutional autoencoder for enhanced meteorological data compression. ConvNeXt introduces key architectural modifications, such as revised layer normalization and expanded receptive fields, taking inspiration from Vision Transformer to form a modern ConvNet. Our novel binarization mechanism, pivotal for achieving the high compression ratio, operates by dynamically quantizing the latent space representations using a novel magnitude specific noise injection technique. This quantization not only reduces the data size but also preserves crucial meteorological information as our low reconstruction MSE demonstrates. Beyond rainfall data, our approach shows promise for other types of high-resolution meteorological datasets, such as temperature, humidity, etc. Adapting our method to these modalities could further streamline the data management processes in meteorological deep learning scenarios and thus facilitate efficient storage and processing of diverse meteorological datasets.

How to cite: Traub, M., Scholz, F., Scholten, T., Zarfl, C., and Butz, M. V.: High-Efficiency Rainfall Data Compression Using Binarized Convolutional Autoencoder, EGU General Assembly 2024, Vienna, Austria, 14–19 Apr 2024, EGU24-11768, https://doi.org/10.5194/egusphere-egu24-11768, 2024.