ECSS2025-218, updated on 08 Aug 2025
https://doi.org/10.5194/ecss2025-218
12th European Conference on Severe Storms
© Author(s) 2025. This work is distributed under
the Creative Commons Attribution 4.0 License.
A novel Deep Learning framework for lightning probabilistic prediction based on ERA5 reanalysis data and lightning observations.
Adrien Burq1,2, Greta Cazzaniga1, Davide Faranda1,3,4, Mathieu Vrac1, Victor Xing2, and Jean Jouhaud2
Adrien Burq et al.
  • 1Laboratoire des Sciences du Climat et de l'Environnement, Gif-sur-Yvette, France (adrien.burq@lsce.ipsl.fr)
  • 2Descartes Underwriting, Paris, France
  • 3London Mathematical Laboratory, London, United Kingdom
  • 4Laboratoire de Météorologie dynamique, Paris, France

Traditional methods for studying thunderstorm climatology and associated hazards often rely on analyzing long-term trends of key environmental variables. These variables are derived from low-resolution reanalysis datasets like ERA5 and are aggregated over extended time periods (Taszarek et al., 2021). The methods also derive probability of occurrence of convective events, providing useful insights into large-scale climatological trends (Battaglioli et al., 2023). However, they struggle to capture the fine-scale characteristics of individual events. Their representations tend to suffer from probabilistic smoothing effects—such as overestimating hazard occurrence in regions without activity and underestimating it in areas of high activity—leading to unrealistic distributions of hazards in space, time, and intensity.

Machine learning approaches, including random forests, gradient boosting and more recently deep learning models, have improved short-term lightning nowcasting (McGovern et al., 2023). However, they rely on high-resolution inputs such as radar and satellite data which constrains their usage because of the spatio-temporal limitation of such data. To address this limitation, we develop a deep learning model tailored for reanalysis data which enables us to apply the model on a global scale and over a much larger period of time to study the evolution of thunderstorm activity.

Given a state of the atmosphere, we generate an ensemble forecast of lightnings at an hourly time resolution and 0.25° spatial resolution. Compared to previous models (Battaglioli et al., 2023, Saha et al., 2025), we use 3D inputs in our model to directly output a map of lightning with statistically coherent spatial structures. Additionally, we make ensemble predictions to capture a wide range of possible realistic scenarios for a given set of thermodynamic and dynamic variables.

Technically, we develop a guided diffusion model that learns to generate lightning maps at an hourly timestep over western Europe. Given 3 maps of environmental conditions (representing instability, humidity and wind-shear) derived from the ERA5 reanalysis and an ensemble of 2D noise data, we generate an ensemble of possible lightning maps.

Because of the large temporal availability of ERA5 data, our model will enable us to detect changes in past thunderstorm activity. 

How to cite: Burq, A., Cazzaniga, G., Faranda, D., Vrac, M., Xing, V., and Jouhaud, J.: A novel Deep Learning framework for lightning probabilistic prediction based on ERA5 reanalysis data and lightning observations., 12th European Conference on Severe Storms, Utrecht, The Netherlands, 17–21 Nov 2025, ECSS2025-218, https://doi.org/10.5194/ecss2025-218, 2025.

Supplementary materials

Supplementary material file

Comments on the supplementary material

AC: Author Comment | CC: Community Comment | Report abuse

supplementary materials version 1 – uploaded on 12 Nov 2025, no comments

Post a comment