EGU24-11768, updated on 22 Oct 2024
https://doi.org/10.5194/egusphere-egu24-11768
EGU General Assembly 2024
© Author(s) 2024. This work is distributed under
the Creative Commons Attribution 4.0 License.

High-Efficiency Rainfall Data Compression Using Binarized Convolutional Autoencoder

Manuel Traub1, Fedor Scholz2, Thomas Scholten3, Christiane Zarfl4, and Martin V. Butz5
Manuel Traub et al.
  • 1Neuro-Cognitive Modeling Group, University of Tübingen, Tübingen, Germany, (manuel.traub@uni-tuebingen.de)
  • 2Neuro-Cognitive Modeling Group, University of Tübingen, Tübingen, Germany, (fedor.scholz@uni-tuebingen.de)
  • 3Soil Science and Geomorphology, University of Tübingen, Tübingen, Germany, (thomas.scholten@uni-tuebingen.de)
  • 4Environmental Systems Analysis, University of Tübingen, Tübingen, Germany, (christiane.zarfl@uni-tuebingen.de)
  • 5Neuro-Cognitive Modeling Group, University of Tübingen, Tübingen, Germany, (martin.butz@uni-tuebingen.de)

In the era of big data, managing and storing large-scale meteorological datasets is a critical challenge. We focus on high-resolution rainfall data, which is crucial to atmospheric sciences, climate research, and real-time weather forecasting. This study introduces a deep learning-based approach to compress the German Radar-Online-Aneichung (RADOLAN) rainfall dataset. We achieve a compression ratio of 200:1 while maintaining a minimal mean squared reconstruction error (MSE). Our method combines a convolutional autoencoder with a novel binarization mechanism, to compress data from a resolution of 900x900 pixels at 32-bit depth to 180x180 pixels at 4-bit depth. Leveraging the ConvNeXt architecture (Zhuang Liu, et al., 'A ConvNet for the 2020s'), our method learns a convolutional autoencoder for enhanced meteorological data compression. ConvNeXt introduces key architectural modifications, such as revised layer normalization and expanded receptive fields, taking inspiration from Vision Transformer to form a modern ConvNet. Our novel binarization mechanism, pivotal for achieving the high compression ratio, operates by dynamically quantizing the latent space representations using a novel magnitude specific noise injection technique. This quantization not only reduces the data size but also preserves crucial meteorological information as our low reconstruction MSE demonstrates. Beyond rainfall data, our approach shows promise for other types of high-resolution meteorological datasets, such as temperature, humidity, etc. Adapting our method to these modalities could further streamline the data management processes in meteorological deep learning scenarios and thus facilitate efficient storage and processing of diverse meteorological datasets.

How to cite: Traub, M., Scholz, F., Scholten, T., Zarfl, C., and Butz, M. V.: High-Efficiency Rainfall Data Compression Using Binarized Convolutional Autoencoder, EGU General Assembly 2024, Vienna, Austria, 14–19 Apr 2024, EGU24-11768, https://doi.org/10.5194/egusphere-egu24-11768, 2024.