OSA1.5 | Machine Learning in Weather and Climate
Machine Learning in Weather and Climate
Including EMS Technology Achievement Award Lecture
Conveners: Richard Müller, Bernhard Reichert, Dennis Schulze, Gert-Jan Steeneveld, Roope Tervo | Co-convener: Angela Meyer
Orals Wed1
| Wed, 10 Sep, 09:00–10:30 (CEST)
 
Room E3+E4
Orals Wed2
| Wed, 10 Sep, 11:00–13:00 (CEST)
 
Room E3+E4
Orals Wed3
| Wed, 10 Sep, 14:00–15:30 (CEST)
 
Room E3+E4
Orals Wed4
| Wed, 10 Sep, 16:00–17:15 (CEST)
 
Room E3+E4
Posters P-Thu
| Attendance Thu, 11 Sep, 16:00–17:15 (CEST) | Display Wed, 10 Sep, 08:00–Fri, 12 Sep, 13:00
 
Grand Hall, P7–16
Wed, 09:00
Wed, 11:00
Wed, 14:00
Wed, 16:00
Thu, 16:00
Artificial Intelligence (AI) is revolutionizing the weather-prediction value chain and is becoming a key technology for all climate-related sciences. This session focuses on machine learning techniques and aims at bringing together research with weather and climate-related background with relevant contributions from computer sciences using these techniques.

Contributions from all kinds of machine learning studies in weather and climate are encouraged, including but not limited to:

* Global, regional and local weather prediction, including both NWP emulators and training the model directly from observations, data driven models
* ECMWF Anemoi Framework contributions, relevant Destination Earth contributions
* Postprocessing of Numerical Weather Prediction (NWP) output
* Nowcasting studies, studies using satellite data, radar data, and observational weather data
* Seasonal forecasts
* Climate-related studies, including dimensionality reduction of weather and climate data, extraction of relevant features
* Operational frameworks (MLOps), cloud ecosystems, and data flows related to AI
* AI/ML projects in the European Weather Cloud (EWC)
* Benchmark datasets and validation of the model outputs
* Quantifying the impacts of weather and climate, connecting meteorological data with non-meteorological datasets
* Human aspect -- how AI changes our work, organisations, and culture?

Orals Wed1: Wed, 10 Sep, 09:00–10:30 | Room E3+E4

Chairpersons: Bernhard Reichert, Angela Meyer
Machine Learning-Based Weather Prediction (MLWP)
09:00–09:30
|
EMS2025-15
|
solicited
|
EMS Technology Achievement Award Lecture
|
Onsite presentation
Jasper Wijnands and Marek Jacob and the Anemoi team

The Anemoi Framework (EMS Technology Award 2025) represents a pioneering effort in the integration of machine learning (ML) with meteorological forecasting, all developed through a collaborative European initiative. Data-driven machine learning approaches are currently revolutionizing weather prediction with state-of-the-art models outperforming equation-based forecasting system across a wide range of scores. Through this, they have rapidly emerged as candidates for operational weather forecasting systems, delivering accurate forecasts for low computational cost. Designed to enhance the accuracy, efficiency, and accessibility of data-driven weather forecasting, Anemoi builds upon advanced ML techniques and a modular, open-source architecture to democratize access to cutting-edge forecasting tools.

The framework is organised into distinct Python packages covering the entire machine learning lifecycle—from the creation of customised datasets from diverse meteorological sources to the development and training of advanced deep learning graph models. Once a model is trained, Anemoi enables users to run it in operations, using the outputs of physics-based NWP analyses or ensembles as initial conditions, while maintaining comprehensive lineage tracking.

Anemoi has been instrumental in the development of ML-powered weather models including AIFS (ECMWF), Bris (MET Norway), and AICON (DWD), and supports both global and limited area domain models in both deterministic and ensemble settings. These applications demonstrate Anemoi’s potential to enhance forecasting accuracy by integrating ML techniques into existing systems.

More than just a technical framework, Anemoi represents a collaborative effort among meteorological services, researchers, and technologists, fostering knowledge exchange and innovation.

How to cite: Wijnands, J. and Jacob, M. and the Anemoi team: Anemoi: A New Collaborative Framework for Data-driven Weather Forecasting, EMS Annual Meeting 2025, Ljubljana, Slovenia, 7–12 Sep 2025, EMS2025-15, https://doi.org/10.5194/ems2025-15, 2025.

Show EMS2025-15 recording (20min) recording
09:30–09:45
|
EMS2025-239
|
Onsite presentation
Çağlar Küçük, Pascal Gfäller, Irene Schicker, Nauman Khurshid Awan, and Alexander Kann

Accurate weather prediction in complex topographical regions remains a significant challenge for numerical weather prediction (NWP). While global NWP models have advanced considerably, limited-area models still struggle to capture the fine-scale atmospheric processes critical for regions with complex topography, such as the Alps. Recent AI-based modelling approaches are transforming this landscape, with graph-based AI models showing great potential in capturing complex dynamics and offering flexibility across diverse data modalities. Most of these models, however, are developed for global applications and trained on global datasets like the ERA5 reanalysis. Despite the extensive temporal coverage of ERA5 encompassing diverse weather conditions, its limited spatial resolution is constraining its ability to resolve finer-scale atmospheric processes critical for limited-area modelling.

Building on these developments, we aim to leverage the power of graph-based AI models and the valuable information provided by ERA5 for limited-area modelling. Our approach utilizes the Anemoi framework to train specialized models for the Greater Alpine Region, centred over Austria, using a high-resolution regional reanalysis ensemble dataset. Specifically, we employ the Austrian ReAnalysis (ARA) dataset with its superior 2.5 km spatial resolution and 3-hourly temporal resolution in reanalysis to better capture localized weather phenomena over regions with complex topography regions that global models typically fail to capture accurately.

Our methodology encompasses multiple AI modelling approaches to determine the most effective forecasting strategy. These include global models focusing on our local study domain with a stretched-grid structure and limited-area models forced with external boundary conditions. Furthermore, we conduct transfer learning experiments that harmonise information from the long temporal record of ERA5 with the high spatial resolution of the ARA dataset, creating a more robust prediction system by mitigating issues in model training from the comparatively limited temporal extent of ARA.
To evaluate their effectiveness, we assess the performance of these different modelling approaches, comparing them against traditional physics-based models and validating the results against point-based observations. Additionally, we provide detailed examinations of these findings through case studies of impactful weather events in Austria, highlighting real-world applications of our approach.

The insights from this research will guide the next steps towards harnessing both large-scale and localised information with an AI-based approach, ultimately advancing the accuracy and relevance of limited-area models in operational weather forecasting. 

How to cite: Küçük, Ç., Gfäller, P., Schicker, I., Awan, N. K., and Kann, A.: Advancing Limited-area Numerical Weather Prediction with Graph-based AI: Leveraging Global and Regional Reanalysis Data for the Greater Alpine Region, EMS Annual Meeting 2025, Ljubljana, Slovenia, 7–12 Sep 2025, EMS2025-239, https://doi.org/10.5194/ems2025-239, 2025.

Show EMS2025-239 recording (13min) recording
09:45–10:00
|
EMS2025-494
|
Onsite presentation
Marcos Martínez-Roig, Cesar Azorin-Molina, Nuria P. Plaza, Miguel Andres.Martin, Kevin Monsalvez-Pozo, Deliang Chen, Zhengzhong Zeng, Sergio M. Vicente-Serrano, Tim R. McVicar, Jose A. Guijarro, and Amir Ali Safaei-Pirooz

The generation of accurate and reliable short-term forecasts (<12 hours) of near-surface (∼ 10m above ground level) wind speed fields, hereinafter called NSWS, are crucial for various socioeconomic and environmental applications. However, monitoring and forecasting NSWS is challenging due to its inherent space-time variability, especially in regions with complex orography such as the Iberian Peninsula in Spain.

Traditional NSWS forecasting methods relies on Numerical Weather Prediction (NWP) models, which require significant computational resources. In addition, these NWP models often yield inaccurate results, especially in regions with complex orography. As a more efficient alternative to this constraint, here we explore the ability to Artificial Intelligence (AI) methods to enhance the efficiency and accuracy of short-term NSWS predictions. We propose the use of two deep learning methods:

1) A U-Net architecture based on Partial Convolutions to generate high-resolution hourly NSWS maps from station-based observations[3].

2) An encoder-decoder architecture based on mixed convolutional and recurrent (ConvLSTM) layers to predict short-term NSWS maps using the generated infilled data as input[4].

This real-time AI-based product, designed as an early warning system, generates high-resolution (3/9 km) short-term (12 h; σ=1 h) NSWS forecasts in near real-time (seconds), achieving high correlation and low prediction errors.

Meteorological stations provide accurate, site-specific wind observations but have limited spatial coverage, especially in mountainous or remote areas. In contrast, reanalysis products offer full coverage at low resolution but fail to accurately reproduce local wind conditions. Our AI-based tool bridges these gaps by combining station and simulation data, though its inference relies solely on station data, making it a cost-effective alternative to NWP models. Observations come from Spanish Meteorological Agency (AEMET)[1], while the reanalysis used is ERA5-Land (9 km)[2].

Beyond performance evaluation, we apply well-established interpretability techniques to analyze the model’s decision-making process:

1. Feature Importance methods were used to evaluate the relevance of each input time step. Both Feature Permutation and Feature Ablation revealed an expected exponential decline in importance over time, but also highlighted that time steps around 7 hours in the past play a key role in accurate forecasting, underscoring the value of long-term information.

2. Pixel Attribution techniques were used to identify important spatial regions in the input wind speed maps. Despite differences, all methods consistently emphasized regions where unexpected wind patterns or extreme events occur, revealing the model’s primary focus. Guided Grad-CAM offered the most interpretable results by combining coarse and fine details.

These interpretability analyses enhance trust in AI-driven forecasts while guiding improvements in model development. While tested on the Iberian Peninsula, the approach is adaptable to other regions. This scalable AI-based method enhances short-term NSWS forecasting for AEMET and other meteorological services, showcasing AI’s potential to improve both forecast accuracy and operational efficiency in meteorology.

How to cite: Martínez-Roig, M., Azorin-Molina, C., P. Plaza, N., Andres.Martin, M., Monsalvez-Pozo, K., Chen, D., Zeng, Z., Vicente-Serrano, S. M., McVicar, T. R., Guijarro, J. A., and Safaei-Pirooz, A. A.: Evaluation and Interpretability of AI-Driven Short-Term Wind Speed Forecasting., EMS Annual Meeting 2025, Ljubljana, Slovenia, 7–12 Sep 2025, EMS2025-494, https://doi.org/10.5194/ems2025-494, 2025.

Show EMS2025-494 recording (12min) recording
10:00–10:15
|
EMS2025-541
|
Onsite presentation
Tobias Selz, Wessel Bruinsma, George Craig, Stratis Markou, Richard Turner, and Anna Vaughan

In recent years, models based on artificial intelligence (AI) have become equally good or even slightly better at predicting the weather as standard operational models, which are based on solving physical equations. Although the grid size of the AI weather models is similar to that of to global operational models, it has been widely noted that forecasts from the AI models are overly smooth. This smoothness poses a potential issue for ensemble generation and for the simulation of extreme weather events, which are often caused by a superposition of multiple spatial scales. In this study, we develop a mathematical argument to better understand the reason for this low "effective resolution" of deterministic AI weather models. We find that an ideal, perfectly trained AI model follows the mean of the predictive distribution for the lead time interval which is used in its loss function during training. We demonstrate the consequences and limitations of this result with forecast data from various AI models, including Aurora, Pangu, GraphCast and GenCast.

We further demonstrate that a low effective resolution leads to better mean-square forecast error scores by reducing the double-penalty effect, especially at longer forecast lead time. To avoid this often unwanted effect, we suggest a spectral rescaling method for a fairer comparison of two models with different effective resolution. By applying this method to the AI forecasts and the ensemble and deterministic forecasts from the European Centre for Medium Range Weather Forecasting (ECMWF) we estimate to what extent the reported advantages of the AI models are only related to their smoothing. Our results will help users of AI forecasts to interpret their output correctly and guide AI developers in the design of loss function and training protocols.

How to cite: Selz, T., Bruinsma, W., Craig, G., Markou, S., Turner, R., and Vaughan, A.: On the effective resolution of AI weather models, EMS Annual Meeting 2025, Ljubljana, Slovenia, 7–12 Sep 2025, EMS2025-541, https://doi.org/10.5194/ems2025-541, 2025.

Show EMS2025-541 recording (13min) recording
10:15–10:30
|
EMS2025-221
|
Onsite presentation
Uroš Perkan, Žiga Zaplotnik, and Gregor Skok

In physics-based numerical weather prediction models, underlying physical laws and numerical discretisation schemes offer some interpretability of the simulated processes and point to potential sources of errors. In contrast, data-driven deep-learning (DL) models lack explicit physics-based interpretation of their predictions. However, the low computational cost and auto-differentiable implementation allow for fast and extensive diagnostics of the origins of forecast errors using well-established explainable AI methods and traditional diagnostics tools. Here, we combine saliency maps and gridpoint relaxation to perform a multivariate regional analysis of the sources of forecast errors in global DL weather forecasting.

We developed ConvCastNet, a DL global weather prediction model based on depth-wise separable convolutional neural networks. The model predicts 6 atmospheric variables at 13 pressure levels and includes 2 additional single-level prognostic and 7 constant input fields. The forecast is computed on a 3-degree lon-lat grid and uses 12-hour autoregressive time stepping to roll out the forecast. ConvCastNet achieves significant success in predicting geopotential at the 500 hPa pressure level, with 8.5 days of useful forecast (based on an anomaly correlation coefficient greater than 0.6), which makes it a suitable tool for diagnostics of the origin of the forecast error.

We use ConvCastNet to systematically nudge subdomains of the forecast fields for 1) planetary boundary layer, 2) stratosphere, and 3) tropics towards a "true" weather state (reanalysis) and monitor the forecast skill improvements beyond selected subdomains. Our results show that an 8-day mid-latitude weather forecast improves significantly with relaxation in the stratosphere, whereas relaxation in the tropics has no significant impact on mid-latitude. This highlights the importance of accurately representing the stratosphere for medium-range weather prediction and the limited impact of tropical variability on mid-latitude forecasts.

Additionally, we investigate the relationship between model error sensitivity to initial conditions and relaxation experiments. By utilising the model's auto-differentiability, we analyse saliency maps, i.e. the gradients of the forecast errors with respect to input fields, to identify overlapping regions of large error sensitivity and high impact of relaxation to the truth. We find the model sensitivity largely consistent with physics-based expectations, with local errors being sensitive to the upstream dynamics and varying sensitivity to different variables and pressure levels. We believe that this combined approach provides valuable heuristics for diagnosing neural model errors and guiding targeted model improvements.

How to cite: Perkan, U., Zaplotnik, Ž., and Skok, G.: Forecast Error Diagnostics in Neural Weather Models Using Gridpoint Relaxation, EMS Annual Meeting 2025, Ljubljana, Slovenia, 7–12 Sep 2025, EMS2025-221, https://doi.org/10.5194/ems2025-221, 2025.

Show EMS2025-221 recording (13min) recording

Orals Wed2: Wed, 10 Sep, 11:00–13:00 | Room E3+E4

Chairpersons: Roope Tervo, Bernhard Reichert
Postprocessing
11:00–11:15
|
EMS2025-288
|
Onsite presentation
Martin Widmann, Chandni Thakur, Michael Angus, Kelvin Ng, Noemi Gonczol, Raghavendra Ashrit, Andrew Orr, Gregor Leckebusch, Ruth Geen, and Ashis Mitra

Postprocessing of ensemble forecasts with Ensemble Model Output Statistics (EMOS) can reduce systematic errors in the ensemble mean and spread. Classical EMOS finds linear transformations between the original and postprocessed ensemble mean and variance that optimise the Continuous Ranked Probability Score (CRPS). Within the Weather and Climate Science for Service Partnership - India (WCSSP-India) project HEavy Precipitation Forecast Post-processing over India (HEPPI) we have implemented this approach to postprocess daily precipitation forecasts over India for the monsoon seasons 2018-2022 from the NEPS-G forecasting system run at the National Centre for Medium Range Weather Forecasting (NCMRWF) in Noida. In Angus et al. (2024) we have shown that over most of India EMOS improves the CRPS and the prediction of the probability for heavy precipitation, and also reduces over- or underdispersion in the ensemble.

The EMOS approach is not restricted to the classical linear transformations between the original and postprocessed ensemble mean and variance, and to the optimisation of CRPS. It can be made more flexible by using Machine Learning (ML) to find non-linear transformations that optimise the CRPS or other forecast evaluation criteria. Within the WCSSP-India project HEPPI-ML we have explored different versions of ML-EMOS to postprocess NEPS-G daily precipitation forecasts. These include local and non-local Multilayer Perceptrons, and convolutional neural networks. We will present first evaluation results, including a comparison of ML and classical EMOS.

 

Angus, M., M. Widmann, A. Orr, G.C. Leckebusch, R. Ashrit, and A. Mitra, 2024: A comparison of two statistical postprocessing methods for heavy‐precipitation forecasts over India during the summer monsoon. Quarterly Journal of the Royal Meteorological Society, 150(761), 1865-1883.

 

How to cite: Widmann, M., Thakur, C., Angus, M., Ng, K., Gonczol, N., Ashrit, R., Orr, A., Leckebusch, G., Geen, R., and Mitra, A.: Classical and Machine Learning EMOS for postprocessing precipitation forecasts over India, EMS Annual Meeting 2025, Ljubljana, Slovenia, 7–12 Sep 2025, EMS2025-288, https://doi.org/10.5194/ems2025-288, 2025.

Show EMS2025-288 recording (13min) recording
11:15–11:30
|
EMS2025-206
|
Onsite presentation
John Bjørnar Bremnes

Probabilistic post-processing methods have over the last decades successfully been applied to enhance forecasts from numerical weather prediction (NWP) models. In this work, the Bernstein Quantile Networks (BQN) method is applied to gridded precipitation data for probabilistic downscaling of deterministic NWP model forecasts to hectometric-scale resolutions. In BQN, the predictive distribution is specified by a Bernstein polynomial whose coefficients are linked to input features by a neural network, enabling flexible distributional shapes to adequately represent the underlying forecast uncertainty. Models are trained using a quantile loss function that is extended to handle the point mass at zero through a censoring approach. Few restrictions on the predictive distribution combined with quantile loss make BQN applicable for more or less any target variable without modifications.

Forecast data from the Destination Earth initiative are here used to train BQN models. High-resolution, on-demand NWP runs targeting extreme events in Europe at hectometric resolutions (500m-750m) serve as the target, while coarse-resolution global IFS forecasts at 4.4 km resolution are used as inputs. As the forecast domains vary dynamically in response to anticipated extreme events across Europe, BQN models must generalise across diverse spatial and meteorological contexts. An additional complexity is that the amount of training data is limited. To address this, a few network architectures are explored and not least quantile loss variants, including adaptive weighting schemes and tail-focused penalties, to better capture extreme precipitation events with limited training data. The method provides fully specified marginal predictive distributions on grids. Possible extensions to generating scenarios by learning dependency structures are discussed.

How to cite: Bremnes, J. B.: Bernstein Quantile Networks for probabilistic downscaling of gridded extreme precipitation forecasts at hectometric scales, EMS Annual Meeting 2025, Ljubljana, Slovenia, 7–12 Sep 2025, EMS2025-206, https://doi.org/10.5194/ems2025-206, 2025.

11:30–11:45
|
EMS2025-563
|
Online presentation
|
Martin Vozár

With improving resolution of NWP products and observation data, efficiency and scalability of processing methods are becoming increasingly relevant. Substituting the inputs with their lower-dimensional representations in downstream computations is a common technique to address this issue.

This study examines the Encoder, Decoder, and the Encoder, Decoder, and Discriminator setup, utilizing Single-Head Vision Transformer (SHViT) as a backbone architecture. Scalability with a further increase in input size and/or resolution was considered in the selection. SHViT architecture demonstrates good scaling with larger input sizes, making it a suitable candidate.

We optimize for representation learning tasks on fields of selected near-surface variables from the CERRA dataset from 01/2010 to 12/2019. The CERRA dataset was chosen due to its availability and diversity within the spatial extent. We prepared a set of smaller regions, focusing on areas with prominent orographical features. The test set consists of regions not used during optimization, while the validation set is a temporal split from the training set regions. During the evaluation, the scaling constants are assumed to be known. Orography and land-sea mask are provided as input channels for the Encoder at inference and for the Discriminator during optimization for each region.

The Discriminator is enhanced with an explicit Haar wavelet transform and spectral power transform of each variable field as additional input channels.  The latent representation is a one-dimensional vector with 256 features. We evaluate the spatial distribution of mean absolute error (MAE) and associated standard deviation.

This study is based on work carried out at IBL Software Engineering, Innovation Department as part of research and development of NWP products post-processing framework.

How to cite: Vozár, M.: Representation learning of near-surface atmospheric fields using SHViT modules, EMS Annual Meeting 2025, Ljubljana, Slovenia, 7–12 Sep 2025, EMS2025-563, https://doi.org/10.5194/ems2025-563, 2025.

Show EMS2025-563 recording (13min) recording
Nowcasting, Remotesensing, Observational Data
11:45–12:00
|
EMS2025-550
|
Onsite presentation
Kevin Monsalvez-Pozo, Marcos Martinez-Roig, Nuria P. Plaza-Martín, Cesar Azorin-Molina, and Étienne Plésiat

Reconstruction of near-surface wind speed (NSWS; ~10 m above ground level) from local meteorological station measurements remains an open challenge in climate research. Traditional geostatistical interpolation techniques can provide partial solutions, but their reliability is often limited—especially in regions with complex topography, which are common across Spain. These methods are also highly sensitive to the number of available observations.

This work investigates the potential of state-of-the-art Deep Learning (DL) techniques for NSWS reconstruction. In particular, we employ the Climate Reconstruction AI (CRAI) model, an encoder-decoder architecture based on U-Net with partial convolutions, which we train on the Copernicus European Regional Reanalysis (CERRA) dataset, featuring a temporal resolution of three-hourly data and a spatial resolution of 5.5km. This model learns the spatiotemporal patterns of NSWS and is capable of infilling wind fields from grids with missing values.

To apply the model to real-world conditions, we focus on reconstructing daily-averaged wind speed fields from incomplete grids of observational data provided by AEMET (the State Meteorological Agency of Spain). We evaluate several model variants to assess the influence of auxiliary variables such as 2-m air temperature, surface air pressure, 2-m relative humidity and orography. In addition to the baseline U-Net approach, a convolutional attention mechanism is employed to capture complex interdependencies among variables — for example, compound events involving relative humidity, temperature, orography and wind such as the Foehn effect — as well as an LSTM-based recurrent module to leverage temporal information in the reconstruction process.

The resulting models are benchmarked on CERRA data and subsequently applied to reconstruct NSWS fields from AEMET observations across Spain.

How to cite: Monsalvez-Pozo, K., Martinez-Roig, M., Plaza-Martín, N. P., Azorin-Molina, C., and Plésiat, É.: Deep Learning-Based Reconstruction of Near-Surface Wind Speed Fields from Meteorological Observations, EMS Annual Meeting 2025, Ljubljana, Slovenia, 7–12 Sep 2025, EMS2025-550, https://doi.org/10.5194/ems2025-550, 2025.

Show EMS2025-550 recording (12min) recording
12:00–12:15
|
EMS2025-151
|
Onsite presentation
Richard Müller

Observations from weather radars are well established for the estimation of precipitation. However these data are unavailable over the ocean and only sparely available over remote regions. Furthermore, in mountains the retrieval of precipitation is severely compromised by the scan geometry. Finally, non-meteorological noise and the saturation of the radar signal by thunderstorm clouds  also pose seroius problems to accuracy. Satellite data can help to overcome these shortcomings. However, the infrared signal provides mainly information about the cloud top temperature and thus no information about rain droplets. However, during day there is a good correlation of the cloud optical thickness and effective radii as e.g. demonstrated in the PhD of Rob Roebeling. Nevertheless in any case the satellite based precipitation needs to be calibrated. Typically, satellite instruments measuring in the micro-wave are used for this purpose. However, in this study, we use ground based data (omrbometer) for the calibration of the satellite based precipitation rate.  We investigate various machine learning methods (e.g. SVM, transformer based GANs and transfer learning) to find the best way to perform recurrent, near-real time calibration updates based on pre-trained information.  In this way, the spatial information of the satellite precipitation can be optimally combined with the high accuracy of the ground based measurements. The presentation will provide an overview about the developed methods and the validation results and the link of the work to the WMO-AINNP project.  Further a outlook will be given about the nowcasting of satellite based precipitation and its integration into the concept of seamless prediction at DWD and the combination with radar nowcasting.

 

 

How to cite: Müller, R.: A novel approach for the estimation of precipitation from geosationary satellites., EMS Annual Meeting 2025, Ljubljana, Slovenia, 7–12 Sep 2025, EMS2025-151, https://doi.org/10.5194/ems2025-151, 2025.

Show EMS2025-151 recording (14min) recording
Impact Studies
12:15–12:30
|
EMS2025-156
|
Onsite presentation
|
Alice Alfonsi, Jasper M. van Nieuwenhuizen, Rosina A. Derks, Luca Trani, and Jouke H. S. de Baar

Transitioning from experimental research to operational AI/ML applications is a growing priority in weather and climate. At KNMI, a dedicated MLOps team was established, with the objective of developing the infrastructure and best practices for the implementation of AI/ML models. The resulting setup enables the scalable and reliable delivery of high-quality services to society.

This contribution demonstrates the approach taken through a concrete use case: the Collaborative Quantitative Impact Forecasting (C-QIF) for wildfires. Developed at KNMI in collaboration with the Netherlands Institute for Public Safety (NIPV), C-QIF is a data-driven framework that combines meteorological data with historical wildfire records, providing probabilistic quantitative forecasts of the daily number of wildfires across the Netherlands, up to two weeks in advance. This capability plays a crucial role in enabling decisions on resource allocation and public communication during wildfire events, as well as supporting the professional craftsmanship of operational wildfire experts and first responders.

Originally developed in Octave, C-QIF is being reimplemented in Python and adapted for practical use with cloud-based technologies to enhance scalability and reliability, in order to provide real-time operational services. Data pipelines, model execution, and output delivery are being restructured, with the entire infrastructure deployed on Amazon Web Services (AWS). A self-hosted instance of MLflow, an open-source platform for managing machine learning workflows, is integrated to track model development and ensure reproducibility.

Close collaboration between researchers, engineers, and end users is critical throughout the project. The ongoing interaction ensures alignment between scientific goals and operational needs, and enables the establishment of standard practices for testing, deployment and monitoring — key elements for future model development. This contribution highlights how MLOps capabilities are being built cooperatively, and reflects on the lessons learned in operationalising research models, managing heterogeneous data, and leveraging cloud technologies for AI/ML in practice.

How to cite: Alfonsi, A., van Nieuwenhuizen, J. M., Derks, R. A., Trani, L., and de Baar, J. H. S.: MLOps Practices at KNMI: The Collaborative Quantitative Impact Forecasting Use Case, EMS Annual Meeting 2025, Ljubljana, Slovenia, 7–12 Sep 2025, EMS2025-156, https://doi.org/10.5194/ems2025-156, 2025.

12:30–12:45
|
EMS2025-387
|
Onsite presentation
Luca Zamagni, William Aeberhard, Yun Cheng, Evelyn Mühlhofer, and Irina Mahlstein

MeteoSwiss disseminates its extreme weather warnings through various channels, one of the most important being the MeteoSwiss smart phone app. This app is the most widely used platform for informing the Swiss population, with over 4.8 million installations and a daily user base ranging from 700,000 to 1.8 million people. It provides weather forecasts and displays any active severe weather warnings. When enabled, the app also sends push notifications to alert users about such warnings.

In addition, the MeteoSwiss app allows users to submit their own weather observations, known as "Meteo reports." These crowd-sourced reports can include general weather conditions as well as specific impacts such as strong winds, hail, or heatwaves. Users can contribute their observations in three ways: (i) by uploading situational photos, (ii) by selecting predefined keywords to describe the intensity of the event, and (iii) by submitting free-text descriptions.

As MeteoSwiss advances toward impact-based warnings, these user-generated reports play a key role in improving the understanding of how different weather events affect people and infrastructure. By analyzing the relationship between past warnings and reported impacts, MeteoSwiss can refine its algorithms to better predict the consequences of future events based on physical warning parameters. A data driven model essentially predicts a selection of most likely outcomes in terms of user report based on properties of the issued warning. These include keywords describing the impact and draws from submitted photos deemed representative. This improved understanding supports the development of more accurate extreme weather warnings, ultimately enhancing preparedness and response planning.

How to cite: Zamagni, L., Aeberhard, W., Cheng, Y., Mühlhofer, E., and Mahlstein, I.: Leveraging  crowd-sourced impact data with ML to improve severe weather warnings at MeteoSwiss, EMS Annual Meeting 2025, Ljubljana, Slovenia, 7–12 Sep 2025, EMS2025-387, https://doi.org/10.5194/ems2025-387, 2025.

Show EMS2025-387 recording (11min) recording
12:45–13:00
|
EMS2025-12
|
Onsite presentation
Assaf Shmuel and Colin Price

Atmospheric rivers (ARs) are essential components of the Earth's hydrological cycle, transporting vast amounts of moisture from the tropics to midlatitudes. While vital for sustaining water resources, ARs are also major drivers of extreme precipitation and flooding, with profound impacts on human populations and ecosystems. In this study, we develop advanced Machine Learning models to predict flood risk associated with ARs by integrating AR data with additional meteorological variables, topographic information, and other relevant factors. Achieving an Area Under the Receiver Operating Characteristic Curve (ROC AUC) score of 0.96, our models demonstrate exceptional capability in forecasting flood events and uncovering key predictors of AR-induced flood risk. For comparison, a logistic regression model tested on this prediction task using the exact same data achieved a significantly lower score of 0.79. We further validate these results on an additional flood dataset and maintain the high predictive performance, underscoring the robustness and generalization of the Machine Learning model.

Next, we evaluate our model on a case study of the 2018 California flood, an event driven by a persistent Atmospheric River that triggered severe rainfall, flooding, and mudslides. The model predictions show a substantial increase in flood risk starting from the morning of the day before the event. The predicted danger level remained high for three consecutive days, accurately capturing the prolonged impact of the Atmospheric River and highlighting the model’s capability to forecast extended flood risk windows.

We find that Atmospheric Rivers drive a third of midlatitude floods, emphasizing their critical role in flood prediction systems. Building on these models, we analyze the primary meteorological conditions driving AR flood risk. Our findings reveal an increase of over 10% in AR-related flood risk over the last four decades, driven by the rising intensity and frequency of Atmospheric Rivers. We find that the majority of the global population is exposed to ARs to some extent, including those that pose a potential flood risk. These results underscore the potential of integrating Machine Learning and AR data to enhance early warning systems and support effective flood preparedness and response in a changing climate.

How to cite: Shmuel, A. and Price, C.: Machine Learning Reveals Increasing Flood Risk from Atmospheric Rivers, EMS Annual Meeting 2025, Ljubljana, Slovenia, 7–12 Sep 2025, EMS2025-12, https://doi.org/10.5194/ems2025-12, 2025.

Orals Wed3: Wed, 10 Sep, 14:00–15:30 | Room E3+E4

Chairperson: Richard Müller
14:00–14:15
|
EMS2025-283
|
Onsite presentation
Eleni Briola, Kasper Stener Hintz, and Leif Denby

Trapped lee waves and the associated turbulent rotors pose significant hazards for aviation and land-based transport. While high-resolution numerical weather prediction (NWP) models can capture these phenomena, there is a lack of automated, reliable tools for detecting and characterizing lee waves in model output. Traditional spectral methods, although useful, are often computationally expensive and require domain-specific tuning. This study presents an operational pipeline based on deep learning (DL) for the real-time detection and characterisation of trapped lee waves. 

Building on previous research demonstrating the effectiveness of a DL model (LeeWaveNet) for segmenting and extracting wave characteristics from vertical velocity fields in the UK, we have deployed this model in a production environment for the Iceland-Greenland region using an operational NWP model for Iceland and Greenland, named Harmonie-IG. The model was containerized and hosted on a server at Danish Meteorological Institute, receiving Harmonie-IG output as input and producing near-real-time predictions of lee wave activity. The results, including key characteristics such as wavelength, orientation, and amplitude, are stored in an AWS S3 bucket and visualized through an automated post-processing pipeline. 

To assess the generalizability and reliability of the model beyond the training domain, we performed an extended validation over Norway using the Danish ReAnalysis dataset (DANRA). In particular, we compared the predicted orientation of the lee waves with the observed wind direction, demonstrating promising correlation and physical consistency. This validation step is a key milestone towards confirming the robustness of the model when applied to new geographies and datasets. 

This work highlights not only the potential of DL in advancing the operational use of AI in meteorology, but also showcases a complete MLOps workflow—from inference at scale to data management and visualization. Our pipeline demonstrates that with the right infrastructure, deep learning models can be effectively integrated into real-time forecasting systems, providing timely and accurate identification of hazardous atmospheric features. 

How to cite: Briola, E., Stener Hintz, K., and Denby, L.: Real-Time Detection and Characterisation of Trapped Lee Waves: From Deep Learning Model to Operational Deployment in the Iceland-Greenland Region , EMS Annual Meeting 2025, Ljubljana, Slovenia, 7–12 Sep 2025, EMS2025-283, https://doi.org/10.5194/ems2025-283, 2025.

Show EMS2025-283 recording (13min) recording
14:15–14:30
|
EMS2025-258
|
Onsite presentation
Marko Zoldoš and Tomislav Džoić

The climatological characteristics of fog and mist at Pula Airport (Croatia) in the northeastern Adriatic were examined, with a focus on applying machine learning techniques to identify large-scale atmospheric patterns associated with their occurrence. Although the published study includes conventional statistical analyses, the primary focus of this presentation is the application of the Growing Neural Gas (GNG) algorithm—an unsupervised neural network model—for clustering synoptic conditions linked to fog and mist formation.

For this study, high-resolution 10-meter wind and mean sea level pressure (MSLP) data from the ERA5 reanalysis (ECMWF) were used to represent synoptic-scale variability over multiple decades. The GNG algorithm was applied to these spatio-temporal fields to identify recurring circulation patterns and characterize their relationship with fog and mist events observed at the airport.

The GNG algorithm constructs a topological map of high-dimensional input data, dynamically adapting its structure by inserting new units in response to data complexity. This adaptability makes it particularly effective for detecting subtle, evolving patterns in atmospheric fields. In this study, GNG successfully identified pressure configurations—particularly quasi-non-gradient conditions—historically associated with fog and mist formation.

A notable result is the observed decline in the frequency of these favorable synoptic patterns, which correlates with a decreasing trend in fog and mist occurrence at the site. This trend appears to be linked to rising sea surface and near-surface air temperatures, reducing the potential for moisture transport from the sea. These findings demonstrate the value of interpretable machine learning techniques in climatological research and provide insight into ongoing changes in low-visibility weather phenomena in coastal regions.

 

 

How to cite: Zoldoš, M. and Džoić, T.: Machine Learning Analysis of Fog and Mist Climatology at Pula Airport, Croatia, EMS Annual Meeting 2025, Ljubljana, Slovenia, 7–12 Sep 2025, EMS2025-258, https://doi.org/10.5194/ems2025-258, 2025.

Show EMS2025-258 recording (12min) recording
Sub-seasonal and climate studies, biometeorology, air quality
14:30–14:45
|
EMS2025-160
|
Onsite presentation
Ana-Cristina Mârza, Daniela I.V. Domeisen, Lorenzo Ramella-Pralungo, and Angela Meyer

Subseasonal weather forecasts (2 weeks to 2 months ahead) inform operational planning in many societally relevant sectors, including energy supply and demand, but their predictive skill varies widely. We propose machine learning (ML) as a computationally inexpensive tool to estimate forecast skill in advance, aiding decision-makers. In our study, an ML model learns to relate the forecast initial conditions in historical weather data to the probabilistic forecast error at subseasonal lead times. Explainability techniques further let us rank the sources of subseasonal predictability in hindcast data by their importance, a first to our knowledge.

A gradient boosted decision tree model is trained to predict the Continuous Ranked Probability Score (CRPS) of ECMWF hindcasts at lead times 0-46 days, by leveraging initial conditions (geopotential height, sea surface temperature, zonal wind speed) extracted from the Earth System Reanalysis 5 (ERA5). The ERA5 data undergo dimensionality reduction (e.g., principal component analysis) before being fed to the ML model, and are supplemented with pre-computed indices like the El Niño-Southern Oscillation Index. Forecast skill is computed for the 500 hPa geopotential height field in Europe against ERA5 ground truth.

The ML model outperforms a climatological baseline (averaged CRPS by calendar date and lead time) in predicting European forecast skill out to week 7. We find the most important predictor of skill is stratospheric polar vortex strength, in addition to lead time and calendar date. Training separate models by lead time reveals clear differences in feature importance, such that lead time contributes the most predictability in the first 2 weeks, while the seasonal cycle manifests strongly in weeks 3-4. Different teleconnections become important at different lead times, but their predictive potential also fluctuates throughout the year. We will provide an in-depth breakdown of the feature importances by lead time and season in our presentation.

In conclusion, machine learning provides a novel way to estimate a priori the forecast skill of numerical weather prediction models. The presented method enables us for the first time to rank the relative contributions of the sources of forecast skill, as deduced from hindcast data, thereby advancing our understanding of subseasonal predictability.

How to cite: Mârza, A.-C., Domeisen, D. I. V., Ramella-Pralungo, L., and Meyer, A.: Unraveling the sources of subseasonal predictability with machine learning, EMS Annual Meeting 2025, Ljubljana, Slovenia, 7–12 Sep 2025, EMS2025-160, https://doi.org/10.5194/ems2025-160, 2025.

14:45–15:00
|
EMS2025-226
|
Onsite presentation
Steffen Tietsche, Mariana Clare, Jakob Schloer, Inna Polichtchouk, and Frederic Vitart

The stratosphere is an important source of predictive skill for sub-seasonal forecasts, yet it is poorly represented in currently published machine-learning weather prediction (MLWP) models. These models have been developed to forecast the state of the atmosphere up two weeks ahead, but they also show potential to be used for sub-seasonal forecasting. However, this requires enhancing and adapting them to be more suitable to the task. One of these enhancements is a better representation of the stratosphere. Here, we report on progress with achieving this in the AIFS-CRPS model, a probabilistic MLWP trained with a loss function based on the continuous ranked probability score which has been developed at ECMWF. We increase the model top from 50 hPa to 1 hPa, adding 6 levels in the stratosphere. This allows to model important stratospheric features such as sudden stratospheric warmings and the quasi-biennial oscillation. We discuss how improvements to the loss function scaling, a revised training data set and modified treatment of some physical variables in the stratosphere lead to improved forecast skill in the upper troposphere and lowermost stratosphere. To build up trust in the stratospheric representation in AIFS-CRPS, we investigate in more detail some recent sudden stratospheric warming events, i.e. how well they were predicted and whether AIFS-CRPS represents the expected response of surface temperatures. We conclude that a more explicit representation of the stratosphere in AIFS-CRPS is feasible and overall beneficial to the forecasts, as it provides a previously untapped source of predictive skill on sub-seasonal to seasonal time scales.

How to cite: Tietsche, S., Clare, M., Schloer, J., Polichtchouk, I., and Vitart, F.: Data-driven sub-seasonal forecasts with a stratosphere, EMS Annual Meeting 2025, Ljubljana, Slovenia, 7–12 Sep 2025, EMS2025-226, https://doi.org/10.5194/ems2025-226, 2025.

15:00–15:15
|
EMS2025-483
|
Onsite presentation
Siyu Li and Julian Quinting

Subseasonal weather predictions for the extratropics remain a significant challenge. Although sources of extratropical subseasonal predictability are linked to slowly evolving components of the atmospheric system, such as tropical modes of variability (e.g., the Madden-Julian Oscillation and tropical waves), these sources are not yet fully leveraged due to systematic  errors in numerical weather prediction models. In particular, short-term errors in the tropics can degrade forecast skill in the extratropics on subseasonal timescales. However, the specific tropical regions where such errors most strongly influence extratropical forecast skill remain unclear. Relaxation experiments using NWP models provide a means to identify these key regions, though such experiments are computationally expensive, especially when run in ensemble mode. In this study, we utilize machine learning-based weather prediction models to perform relaxation experiments across various tropical regions on the subseasonal timescale. Probabilistic forecasts are generated using initial conditions from the Ensemble of Data Assimilations (EDA) system of the European Centre for Medium-Range Weather Forecasts (ECMWF). By evaluating reforecasts for a five year period (2020-2024), which lies outside the training period of the models, this study systematically investigates the role of nudging techniques and the influence of tropical variability, particularly the MJO, on subseasonal forecast skill. Additionally, we evaluate the experiments conditioned on the state of the MJO to assess the regional contributions of tropical predictability to forecast improvements in the extratropics. Our results underscore the potential of nudging as both a diagnostic tool and a means to enhance extended-range forecasting skill, especially when combined with emerging machine learning-based prediction approaches.

How to cite: Li, S. and Quinting, J.: Evaluating the Impact of Relaxation Experiments on Subseasonal Forecast Skill Using Machine Learning Weather Models, EMS Annual Meeting 2025, Ljubljana, Slovenia, 7–12 Sep 2025, EMS2025-483, https://doi.org/10.5194/ems2025-483, 2025.

Show EMS2025-483 recording (13min) recording
15:15–15:30
|
EMS2025-339
|
Onsite presentation
Assaf Shmuel, Leehi Magaritz-Ronen, Shira Raveh-Rubin, and Ron Milo

The seasonality of Earth has a profound impact on almost any aspect of life on our planet. Seasonality drives vegetation cycles, influences wildlife behavior, and shapes human health, society, and culture. Seasonality, traditionally defined by the equal-length Astronomical seasons uniformly applied across the Earth, provides a simple division but overlooks key weather patterns, latitudinal variations, and Climate Change effects. In this study we propose a data-driven approach to seasons based on unsupervised machine learning models. We develop an algorithm that clusters meteorological reanalysis data—temperature, precipitation, and relative humidity—into meaningful seasonal patterns. We build on this algorithm to objectively define seasons in every region globally, and analyze the effect of Climate Change on these clusters. The results demonstrate that seasonality in different regions of Earth is driven by different meteorological factors. We find that the duration of seasons varies significantly with latitude. For example, Summer lasts ~120 days at ±30° latitude but decreases by 7±1 days for every 10° closer to the poles. Conversely, Winter lasts ~125 days at ±30° latitude and extends by 13±2 days per 10° poleward. Our analysis reveals notable changes in the onset and duration of seasons driven by Climate Change; most notably, we find that summers have extended by 7±8 days at the expense of winters which have shortened by 8±11 days over the past 40 years, while the transition seasons have shifted accordingly. The observed shifts in seasonality highlight the rapid impact of Climate Change on Earth's systems, with profound consequences for ecosystems, agriculture, and society.

How to cite: Shmuel, A., Magaritz-Ronen, L., Raveh-Rubin, S., and Milo, R.: Revisiting Earth’s Seasonality using Machine Learning Models, EMS Annual Meeting 2025, Ljubljana, Slovenia, 7–12 Sep 2025, EMS2025-339, https://doi.org/10.5194/ems2025-339, 2025.

Orals Wed4: Wed, 10 Sep, 16:00–17:15 | Room E3+E4

Chairperson: Gert-Jan Steeneveld
16:00–16:15
|
EMS2025-471
|
Onsite presentation
Branislava Lalic, Dinh Viet Cuong, Mina Petric, Vladimir Pavlovic, Ana Firanj Sremac, and Mark Roantree

Biometeorological models have traditionally been categorized as mechanistic (deterministic) or stochastic, with recent expansions to include machine learning (ML) models. Mechanistic models represent system processes based on a cause-effect concept and domain knowledge, while a subset—physics-based models—explicitly incorporate physical laws. Despite their interpretability, such models often rely on empirically fixed parameters and may overlook complex environmental interactions. In this study, we investigate a hybrid modeling framework that combines physics-based modeling with Physics-Informed Neural Networks (PINNs) to enhance the simulation of biosphere-atmosphere interactions.

Focusing on mosquito population dynamics as a climate-sensitive system, we couple a physics-based dynamic model with a PINN to improve representation of environmental drivers affecting larval and pupal development rates. Traditionally, air temperature is used as the primary forcing variable in such models. However, our results show that the PINN, trained on historical meteorological and entomological data, identifies precipitation and humidity as significant additional predictors of mosquito development dynamics. This enriched modeling captures population peaks more accurately and improves predictive performance during critical seasonal transitions.

By integrating physics-based structure with data-driven learning, the hybrid model maintains explainability while revealing hidden nonlinear dependencies among meteorological variables. The findings demonstrate how advanced ML techniques like PINNs can uncover meteorological sensitivities that traditional models may not capture—highlighting the importance of meteorological data in biosphere modeling.

This approach not only enhances disease vector modeling under varying climate conditions but also offers a transferable framework for other environmental applications such as crop phenology, urban microclimate analysis, and infectious diseases transmission in the human population. The study underscores the value of combining physics-based models with machine learning to extract deeper insight from complex meteorological data. 

Acknowledgements: This research is supported by the Ministry of Science, Technological Development and Innovation of the Republic of Serbia (Grants No. ‪451-03-137/2025-03/ 200125 & 451-03-136/2025-03/ 200125) and COST Action CA20108 FAIR Network of micrometeorological measurements (FAIRNESS).

How to cite: Lalic, B., Cuong, D. V., Petric, M., Pavlovic, V., Firanj Sremac, A., and Roantree, M.: Modelling Biometeorological Processes using Physics-Informed Neural Networks, EMS Annual Meeting 2025, Ljubljana, Slovenia, 7–12 Sep 2025, EMS2025-471, https://doi.org/10.5194/ems2025-471, 2025.

16:15–16:30
|
EMS2025-372
|
Onsite presentation
Seong-il Lee, Hyo-Jong Song, Hye-Ryun Oh, and Young-Chae Kwon

This study aims to enhance the prediction performance of PM2.5 concentrations by applying post-processing bias correction to the forecast outputs of the Community Multiscale Air Quality (CMAQ) model, which utilizes meteorological data. The CMAQ model uses meteorological data and various environmental information to generate air quality predictions, but due to the complexity of atmospheric processes, the model often contains systematic errors. To evaluate and improve the accuracy of these predictions, PM2.5 forecasts from CMAQ were compared with ground-based observations from air quality monitoring stations across South Korea.

Bias correction was performed using Integrated Process Rate (IPR) data, which represents the physical and chemical processes that influence air quality within the CMAQ model. This correction was conducted using linear regression and Deep Neural Network (DNN) methods, both of which have shown promise in improving model predictions in other atmospheric studies.

The training data used for the correction covered air quality data from December 2020 to February 2021, a period representing typical wintertime conditions in South Korea. The bias correction was then applied to data from March 2021. This period is particularly significant as it coincides with South Korea's ‘Seasonal Fine Dust Management System,’ which is designed to address high levels of PM2.5 pollution during the winter months. The analysis used these three months of wintertime data to assess how well the bias correction techniques can improve the CMAQ model's accuracy during this critical pollution season.

The results demonstrate that combining the CMAQ model's predictions with IPR data and machine learning-based bias correction techniques significantly improves the prediction accuracy of PM2.5 concentrations. This study illustrates the potential of post-processing the CMAQ model's forecasts with bias correction methods to refine air quality predictions, especially during periods of elevated pollution. The use of advanced AI techniques such as DNN in this context offers a promising tool for improving the reliability and precision of air quality predictions. These improvements are essential for informing public health strategies, air quality management policies, and pollution control measures, particularly during times of high pollution. By improving the accuracy of PM2.5 predictions, this research contributes to more reliable forecasting systems, supporting better decision-making, policy development, and pollution management, which ultimately improves public health outcomes during critical periods of air pollution.

 

key word : CMAQ, Bias Correction, Machine Learning, PM2.5

How to cite: Lee, S., Song, H.-J., Oh, H.-R., and Kwon, Y.-C.: Machine Learning-Based Bias Correction for Improving PM2.5 Prediction Performance Using the Community Multiscale Air Quality (CMAQ) Model, EMS Annual Meeting 2025, Ljubljana, Slovenia, 7–12 Sep 2025, EMS2025-372, https://doi.org/10.5194/ems2025-372, 2025.

Show EMS2025-372 recording (13min) recording
General AI contributions
16:30–16:45
|
EMS2025-526
|
Onsite presentation
Marek Jacob and Roland Potthast

EUMETNET's E-AI programme aims to leverage Artificial Intelligence (AI) and Machine Learning (ML) to enhance weather, climate, and environmental applications. It is a strategic initiative and was setup by the EUMETNET Assembly as a five-year Optional Programme, which started January 2024. The programme combines the forces of European National Meteorological and Hydrological Services (NMHSs) and external partners, including ECMWF and EUMETSAT, to advance in these areas. To achieve its objectives, a strategic reallocation of development resources towards AI/ML-based techniques and capacity building are required.

E-AI is structured around three primary pillars: (a) Data Curation, (b) Analysis, Modelling, and Post-processing, and (c) Products and Services. These pillars are accompanied by Communication and Training activities, which support the general transition towards AI-based technologies. The programme is guided by its Strategic Expert Group, which has conducted comprehensive assessments of the AI/ML landscape to inform the strategic direction of the NMHSs. By promoting collaborative development under a permissive open licence, E-AI fosters widespread adoption, a culture of openness, and synergistic innovation. In line with its guiding principles, the programme welcomes further collaboration with international partners, academia, and industry.

To pursue its targets, the E-AI programme has organised a series of workshops, online tutorials, and established working groups. The workshops were structured around the three primary pillars, featuring both in-person and online events. These included joint workshops with EUMETSAT on data curation in pillar (a), ECMWF Machine Learning Pilot Project workshops in pillar (b), and workshops on Products and Services in pillar (c). The workshops have engaged approximately 200 scientists, while the online tutorials reached an audience of over 400 individuals. The workshops have also identified interest in establishing about a dozen working groups, focusing on specific aspects of AI and ML, particularly in the areas of products and services. We will present updates on the various activities, including the development of E-AI ML-ready datasets, the exploration of multimodal applications combining large language models with meteorological fields, and approaches to Machine Learning Operations (MLOps).

How to cite: Jacob, M. and Potthast, R.: EUMETNET's E-AI Programme: Advancing Weather, Climate, and Environmental Applications through Artificial Intelligence (AI) and Machine Learning (ML), EMS Annual Meeting 2025, Ljubljana, Slovenia, 7–12 Sep 2025, EMS2025-526, https://doi.org/10.5194/ems2025-526, 2025.

Show EMS2025-526 recording (12min) recording
16:45–17:00
|
EMS2025-195
|
Onsite presentation
Marija Zlata Božnar, Primož Mlakar, and Boštjan Grašič

In recent years, artificial intelligence has been widely used in the field of meteorology. However, it is perhaps less known that AI-based tools have been used for over 30 years in the modeling of pollutant dispersion in the atmosphere.

In the early 1990s, Slovenia and other parts of Europe faced significant air pollution problems caused by SO₂ emissions from coal-fired power plants without desulfurization systems. Due to the highly complex terrain, simple physical Gaussian dispersion models were not suitable for reconstructing the dispersion of SO₂ in the atmosphere, while complex numerical Lagrangian particle dispersion models were still in development.

Therefore, as early as 1992, we modeled air pollution using a Multilayer Perceptron Artificial Neural Network (MLPANN). We developed the world's first comprehensive method for selecting training samples, features, and the topology of the neural network, enabling the model to learn from measured meteorological variables and pollutant concentrations around the power plant to predict SO₂ concentrations at a selected location in advance.

This type of artificial neural network remains the foundation for various derived structures used for machine learning from data even today.

In the following decades, we expanded the use of MLPANN to ozone prediction. Researchers from other Slovenian research groups also worked on this topic. Meanwhile, the use of these and similar tools in pollutant dispersion modelling began to grow on a global scale.

Together with Brazilian researchers, we successfully applied MLPANN for predicting diffuse solar radiation. In this field, we also managed to develop a spatially transferable model, which is not a common capability for artificial neural network-based models.

For the classification of wind fields based on measurements from ground-based meteorological stations, we used a Kohonen neural network.

Later, we added another tool called "Gaussian processes" (which, like MLPANN, is a universal approximator) and the tool "decision trees." We expanded the use to point-based forecasting of basic meteorological variables. In recent years, with high computational capabilities available even without supercomputers, we have been using these tools for surrogate models that can represent or predict pollutant concentration fields in the atmosphere, not only for individual points but for entire areas around pollution sources.

Other Slovenian groups from Slovenian Meteorological Agency, the Department of Meteorology, and the Faculty of Computer Science have, in recent years, used machine learning for global medium-range forecasts of daily averages of meteorological variables and for post-processing weather forecasts from physical models.

In the presentation, we will explain the basic principles of building models based on artificial neural networks in a way that is understandable to laypeople. For experts in the field, we will showcase numerous examples of their application. For the latter group, we hope these examples will inspire others to expand the use of these modern tools even further.

How to cite: Božnar, M. Z., Mlakar, P., and Grašič, B.: Over 30 Years of Artificial Intelligence Use in Meteorology in Slovenia, EMS Annual Meeting 2025, Ljubljana, Slovenia, 7–12 Sep 2025, EMS2025-195, https://doi.org/10.5194/ems2025-195, 2025.

Show EMS2025-195 recording (14min) recording
17:00–17:15
|
EMS2025-722
|
Onsite presentation
Gabriela Aznar Siguan, Daniele Nerini, Néstor Tarin Burriel, Pascal Tay, and Hugues Lascombes de Laroussilhe

As machine learning (ML) matures from experimentation to operational deployment, institutions face the challenge of integrating ML into their production environments in a sustainable, scalable, and collaborative manner. This contribution shares the ongoing approach taken by MeteoSwiss to develop a cross-cutting ML capability that supports domain-specific innovation while laying the groundwork for robust operational practices. 

To this end, we are building an MLOps foundation that integrates with our existing DevOps culture and tools, enabling agile development, cross-institutional collaboration, and long-term operational reliability. Central to this effort is the design of an evolving ML platform that aligns with open standards and prioritizes reproducibility, monitoring, and modularity. The development workflows we support are inherently complex, involving multi-source data pipelines, extended model training sessions, and inference jobs for regular predictions using new data. 

This complexity is compounded by a fragmented infrastructure landscape spanning high-performance computing (HPC) and cloud environments—national (CSCS), international (European Weather Cloud operated by ECMWF and EUMETSAT), and private (AWS) providers. We present the architectural principles guiding both platform and process development, including CI/CD for ML pipelines, infrastructure-as-code, testing of data and models, metadata management, and model deployment strategies. Design decisions—such as the adoption of open-source tools for orchestration and lifecycle management—are discussed in the context of public sector constraints, such as limited resources and the need for transparency and auditability. While the platform is under active development, we illustrate its current capabilities through a concrete use case, and reflect on the processes that support the sustainable evolution of the ML operations and collaboration at MeteoSwiss. 

This contribution aims to foster discussion around best practices for implementing MLOps in public institutions, the role of cloud ecosystems in operational ML, and how to architect systems that are technically robust and open to collaboration across the weather and climate community.

How to cite: Aznar Siguan, G., Nerini, D., Tarin Burriel, N., Tay, P., and de Laroussilhe, H. L.: Enabling Efficient MLOps in Weather and Climate Services at MeteoSwiss, EMS Annual Meeting 2025, Ljubljana, Slovenia, 7–12 Sep 2025, EMS2025-722, https://doi.org/10.5194/ems2025-722, 2025.

Show EMS2025-722 recording (20min) recording

Posters: Thu, 11 Sep, 16:00–17:15 | Grand Hall

Display time: Wed, 10 Sep, 08:00–Fri, 12 Sep, 13:00
Chairperson: Angela Meyer
P7
|
EMS2025-538
Aurelio Diaz de Arcaya, Jon Ander Arrillaga, Ivan R. Gelpi, and Santiago Gaztelumendi

Accurate forecasting of surface meteorological variables, such as precipitation and air temperature, is essential for operational meteorology, particularly in regions with complex topography like the Basque Country. This study presents the implementation and preliminar validation of a forecasting system developed by the Basque Meteorological Service (Euskalmet), based on Random Forest (RF) machine learning algorithms. The system is designed to support operational forecasting tasks by providing high-resolution, site-specific predictions of precipitation and temperature.

The previous system, relied on a statistical framework, was based on multiple linear and logistic regression, using adjusted R² and pseudo-R² as performance metrics. While effective, it produced forecasts at daily resolution and had limitations in capturing non-linear relationships often present in mesoscale meteorological processes. The Random Forest approach overcomes these constraints by naturally modelling complex interactions and providing hourly forecasts, leading to improved accuracy, particularly in regions with complex terrain.

The RF system is trained using a combination of predictors from a synoptic-scale model and a mesoscale model, along with historical observational data from Basque Country automatic weather stations network. The predictors include variables such as surface pressure, humidity, wind and temperature at multiple levels. Model training and validation were carried out using data from a sufficiently long time period to ensure robustness across different weather regimes. Feature selection and model tuning were performed to optimize accuracy and computational efficiency.

Performance metrics such as bias, root mean square error, and correlation score and graphics such as scatterplots and Taylor diagrams were used to compare forecasted variables with observed values

This preliminary implementation demonstrates the potential of machine learning-based systems to complement traditional numerical weather prediction outputs in an operational setting. Finally, are presented some general conclusions and new features to take into consideration in order to improve the system and possible operational implementation.

How to cite: Diaz de Arcaya, A., Arrillaga, J. A., R. Gelpi, I., and Gaztelumendi, S.: A Random Forest-Based Forecasting System for temperature and precipitation in the Basque Meteorology Agency, EMS Annual Meeting 2025, Ljubljana, Slovenia, 7–12 Sep 2025, EMS2025-538, https://doi.org/10.5194/ems2025-538, 2025.

P8
|
EMS2025-396
Se-young Kim, Myeong-gyun Kim, and Hyo-jong Song

The transportation sector has a high proportion of greenhouse gas emissions following the energy and industrial sectors, and greenhouse gas reduction in the transportation sector is a key task that must be addressed to achieve the national carbon neutrality goal. For this, it is essential to calculate detailed CO₂ emissions by region and by road, but most studies currently rely on standardized traffic measurements or speed sensor-based data, so there is a limit to accurately reflecting complex traffic conditions and vehicle characteristics on the actual road.

 

Therefore, this study aims to improve the accuracy of traffic estimation by utilizing artificial intelligence (AI) technology based on real-time road images. In particular, the vehicle type is one of the most important factors in calculating co2 emissions in automatically identifying not only traffic volume but also vehicle types (passenger vehicles, lorries, electric vehicles, etc.) through vehicle detection and tracking technology using CCTV images on the road. Therefore, this study aims to advance a model that is directly used to calculate CO₂ emissions by using vision artificial intelligence models such as yoLO models.

 

By introducing YOLO-based object detection and tracking technology as its core, this study aims to overcome the limitations of existing structured data-based models. It promoted the development of an advanced model that can operate stably in various environmental conditions such as detection and tracking errors, bad weather conditions such as rain, snow, fog, and daytime and nighttime changes, such as urban boulevard with a wide road width and a large number of lanes. Detection and tracking performance were improved by applying the clustering technique based on the model's own parameter optimization and vehicle distribution characteristics, and in the case of failure to detect due to deterioration of image quality, the reliability was increased by directly performing hand-based vehicle coefficients for quantitative accuracy verification. Through this, the co2 emission calculation and comparison verification were conducted through the existing model, and it is expected that comparison with the existing statistical traffic estimation model will be possible.

 

Although the current research remains in the early stage of model development, it is expected that it will be able to develop into a real-time traffic calculation technology optimized for the actual road operation environment through continuous technological advancement in the future. Furthermore, this study is expected to dramatically improve the precision of predicting greenhouse gas emissions through video-based transportation information collection technology and provide a practical and scalable technical foundation in various fields such as smart city construction and eco-friendly transportation policy establishment.

How to cite: Kim, S., Kim, M., and Song, H.: A Study on the advancement of road traffic estimation and greenhouse gas emission estimation using Vision AI technology, EMS Annual Meeting 2025, Ljubljana, Slovenia, 7–12 Sep 2025, EMS2025-396, https://doi.org/10.5194/ems2025-396, 2025.

P9
|
EMS2025-93
Linna Zhao, Linna Zhao, and Lan Gao

    Precipitation forecasting is one of the key points and difficulties in weather forecasting. Numerical forecasting is the core means of precipitation forecasting, but its output results are often affected by model uncertainty, uncertainty of initial conditions, errors in parameterization schemes, spatial resolution, topography and geomorphology, and other complex factors, which lead to systematic bias of numerically forecasted precipitation, difficulty in accurately predicting the magnitude of heavy rainfall or sudden heavy precipitation, etc., and therefore, post-processing optimisation of numerical precipitation forecasting is needed. In recent years, studies have shown that compared with the traditional statistical post-processing technology, the artificial intelligence-based post-processing technology in medium-term numerical prediction model has own advantage in that it is a data-driven method. This technology can implicitly extract the spatio-temporal variations of nonlinear and multi-scale physical relationships from multi-source data, thus significantly enhancing the level of medium- and short-term weather forecasts.
    In this paper, a convolutional dendritic neural network (CDNN) model incorporating dendritic (DD) network is constructed on the basis of CNN model, in order to deal with the sample imbalance problem of deep learning precipitation forecast, in which the features are labelled with K-means clustering labels and tag labels.
    A 24-h precipitation forecasting study was carried out using the European Centre for Medium-Range Weather Forecasts Integrated Forecasting System (ECMWF-IFS) forecast product and the  CMA(China Meteorological Administration)Multi-source Precipitation Analysis System(CMPAS)hour-by-hour precipitation analysis product of China. 
    The results show that the CDNN model reduces the average RMSE of each class of precipitation, and has better overall performance in precipitation TS scores, bias, misses, and false alarms, which are closer to the observation. The K-means clustering labels constrain the samples, which allows the CDNN model after K-means clustering labels(K-CDNN) to learn the class information of precipitation, and significantly reduces the RMSE of precipitation forecast, significantly improves the medium and heavy precipitation forecasts. The labelled labels further constrain the samples of the model training, so that the labelled-labels(LK-CDNN) model based on K-means clustering labels takes more into account the small-sample events, and has better prediction ability for heavy precipitation.

How to cite: Zhao, L., Zhao, L., and Gao, L.: Improving on the precipitation forecasting of NWP based on convolutional neural network, EMS Annual Meeting 2025, Ljubljana, Slovenia, 7–12 Sep 2025, EMS2025-93, https://doi.org/10.5194/ems2025-93, 2025.

P10
|
EMS2025-285
Tim Radke, Johanna Baehr, and Marc Rautenhaus

Atmospheric features, including tropical cyclones, atmospheric rivers, extratropical cyclones, and atmospheric fronts (AFs), are important for understanding and predicting the weather. Hence, automated detection of atmospheric features in gridded datasets is used for weather forecasting, statistical and climatological studies, and visual data analysis. Typically, rule-based detection systems are used; however, in recent years, numerous studies have demonstrated the use of machine learning, especially convolutional neural networks (CNNs), to detect atmospheric features. While it was shown that CNNs can detect atmospheric features similarly well to human experts, they are “black box” systems. Therefore, whether the features are detected based on physically plausible patterns is unknown. In a recent study (Radke et al. 2025, Geosci. Model. Dev.) we showed how the explainable artificial intelligence technique “Layer-wise Relevance Propagation” (LRP) can be used to understand the patterns used by a CNN detecting tropical cyclones and atmospheric rivers. In this presentation, we build upon this study as well as recent work in AF detection with CNNs and use LRP to understand the detection patterns learned by a CNN detecting AFs. As hand-labeled data of AFs is only sparsely available, CNNs for AF detection are only trained on local regions.

We find that the patterns used by a CNN detecting AFs are not entirely plausible when compared to rule-based detection systems. This leads to erroneously detected AFs, especially outside the regions used for training the CNN. Taking a closer look at regions outside the training regions, we find that using CNNs to detect AFs outside their training region, for example, in the tropics, leads to poor results. To increase the detection quality, we explore an extension of the data to the tropics and show that additional data improves detection results; however, as a dataset with globally labeled AFs does not exist, further research is required before CNNs can be used to reliably detect AFs globally.

How to cite: Radke, T., Baehr, J., and Rautenhaus, M.: Explaining neural networks for detection of atmospheric surface fronts in gridded atmospheric simulation data, EMS Annual Meeting 2025, Ljubljana, Slovenia, 7–12 Sep 2025, EMS2025-285, https://doi.org/10.5194/ems2025-285, 2025.

P11
|
EMS2025-379
Dong-Hun Lee, Hyo-Jong Song, Jong-Yeon Park, and Chae-Eun Im

Since the onset of industrialization, accelerated global warming has significantly impacted global ecosystems, particularly affecting the survival and distribution of plant species. Rising temperatures and increasing climate variability pose substantial threats to the structural and functional stability of ecosystems, interspecies interactions, and the geographical distribution of plant taxa. These changes may ultimately lead to reduced biodiversity and potential species extinction. In response, predicting future vegetation dynamics has become essential for biodiversity conservation and human sustainability. This study aims to analyze the occurrence trends of plant species using machine learning techniques, based on past and present climate data, to project future distributions under climate change.

Temperature and precipitation data from the Gwangyang region in South Korea were used as input variables. The target variable, representing species presence or absence, was derived from pollen and environmental DNA (eDNA) analyses of soil samples collected from the Topyeongcheon area. Separate logistic regression models were constructed for three representative plant species. The statistical significance of regression coefficients was examined, and model performance was evaluated using accuracy, precision, recall, and F1-score metrics.

Future projections were based on climate scenarios from the CMIP6 dataset, incorporating a range of socio-economic factors including demographics, economic development, welfare levels, ecosystem conditions, resource availability, technological advancements, social dynamics, and policy interventions. These scenarios were applied to the models to estimate future plant species occurrence and to conduct comparative analyses across different pathways. The findings of this study provide a quantitative assessment of species distribution under climate uncertainty and offer a foundational basis for long-term biodiversity conservation planning.

How to cite: Lee, D.-H., Song, H.-J., Park, J.-Y., and Im, C.-E.: Predicting Future Plant Species Occurrence Using Paleo- and Modern Climate Data with Machine Learning, EMS Annual Meeting 2025, Ljubljana, Slovenia, 7–12 Sep 2025, EMS2025-379, https://doi.org/10.5194/ems2025-379, 2025.

P12
|
EMS2025-391
Myeong-Gyun Kim, Se-Young Kim, and Hyo-Jong Song

In 2021, the global greenhouse gas (GHG) emissions totaled 49.55 gigatons of carbon dioxide equivalent (GtCO2eq), with the transport sector accounting for 15.8% of the total. Among these, road transport was the dominant source. Therefore, reducing carbon dioxide (CO2) emissions from the road transport sector is critical for achieving carbon neutrality in transportation, which requires a high-resolution emissions inventory. However, current CO₂ emission calculations are typically conducted at the national level, limiting spatial accuracy. This study aims to develop a road-level CO₂ emissions inventory across the Republic of Korea. To achieve this, high-resolution traffic volume data is essential. Given the lack of observed traffic data on many roads, we first developed traffic volume estimation models using machine learning algorithms, including Random Forest (RF), LightGBM (LGBM), XGBoost (XGB), and Deep Neural Networks (DNN). The models demonstrated strong performance, with R² values of 0.9404 (MSE: 94,331) in Seoul, 0.9490 (MSE: 26,929) in Daejeon, and 0.8619 (MSE: 40,293) in Incheon. Furthermore, we applied clustering techniques and model diagnostics to construct optimal region-specific and variable-specific models, allowing us to quantify the uncertainty of the estimated traffic volumes. Using emission factors, we estimated road-level CO₂ emissions from the predicted traffic volumes and indirectly derived the uncertainty in CO₂ emissions based on traffic volume uncertainties. Additionally, CO₂ observations from select roadside monitoring sites were used for further validation. In future work, we plan to enhance the traffic volume estimation models and develop a direct CO₂ emissions prediction model. These efforts are expected to support evidence-based policymaking and enable more effective CO₂ emission reduction strategies in the road transport sector.

How to cite: Kim, M.-G., Kim, S.-Y., and Song, H.-J.: Estimation of High-Resolution CO₂ Emissions by Road Segment in South Korea Using Machine Learning Model Analysis and Applications, EMS Annual Meeting 2025, Ljubljana, Slovenia, 7–12 Sep 2025, EMS2025-391, https://doi.org/10.5194/ems2025-391, 2025.

P13
|
EMS2025-439
|
Markus Rosenberger, Manfred Dorninger, and Martin Weissmann

Clouds of any kind play a substantial role in a wide variety of atmospheric processes. They are directly linked to the formation of precipitation, and significantly affect the atmospheric energy budget via radiative effects and latent heat. Moreover, both the amount and type of clouds is supposed to alter in a changing climate. Hence, the currently decreasing number of operational cloud observations limits not only the possibility and accuracy of short-term weather forecasts but also the availability of long-term cloud type records.

To show that automatized methods can close this emerging gap, we trained an ensemble of 10 identically initialized residual neural network architectures from scratch on ground-based RGB sky pictures to classify clouds into 30 different classes. Four different pictures taken in the main cardinal directions are used as input at each instance, so that the whole visible sky is covered. Operational manual cloud classification reports at the nearby station Vienna Hohe Warte are used as ground truth. For each instance up to 3 out of 30 categories are reported according to the state-of-the-art WMO cloud classification scheme for operational synoptic observations, making this a multi-label classification task. To the best of our knowledge we are the first to automatically classify clouds based on this elaborate classification scheme. We utilize class specific resampling to reduce prediction biases because of highly imbalanced observation frequencies among categories. Results show that precision and recall scores are high and that every member of our ensemble outperforms both random and climatological predictions in each class. A substantial ratio of wrongly assigned pictures is made up by false negative predictions, where the model recognized the correct class in the input but the assigned probability was too small. Although the WMO classification scheme is well-defined, cloud classification is subjective to some extent because of e.g. the occurrence of clouds in transitional stages. Therefore, we also investigate the reliability of ground-truth observations.

Autonomy and output consistency are the main advantages of such a trained classifier, hence we consider operational cloud monitoring as main application. Either for consistent cloud class observations or to observe the current state of the weather and its short time evolution with high temporal resolution, e.g. in proximity of solar power plants. There, upcoming clouds can substantially change the possible energy output, which leads to the necessity of taking precautions. 

How to cite: Rosenberger, M., Dorninger, M., and Weissmann, M.: Multi-label cloud type classification from ground-based RGB pictures with a residual neural network ensemble, EMS Annual Meeting 2025, Ljubljana, Slovenia, 7–12 Sep 2025, EMS2025-439, https://doi.org/10.5194/ems2025-439, 2025.

P14
|
EMS2025-523
Santiago Gaztelumendi and José Antonio Aranda

Artificial intelligence (AI) and machine learning (ML) represents an important evolution in computer science and data processing that is quickly transforming a vast array of industries. Driven by the increasing need for digitalization over the last decade, AI/ML are being used in businesses of all types to automate processes, analyze large datasets for insights, improve decision-making, enhance customer experiences, and optimize operations.

The Basque Meteorological Agency (Euskalmet) is the official weather service of the Basque Country, responsible for weather monitoring, forecasting, and civil protection support. It provides weather data, develops climate studies, and offers public information, contributing to emergency preparedness and climate change research. Established in 1990 and reorganized in 2024 as a public entity under the Basque Government, Euskalmet also plays a key role in managing alerts, collecting meteorological data, and communicating weather-related risks to the public.

AI and ML are powerful tools that enhance observation, modeling, forecasting, and analysis capabilities in the geosciences, making meteorological and climate tools and systems more efficient, intelligent, and useful for society. In the case of Euskalmet, AI/ML are primarily used or planned to be used, in relation to their utility in improving forecast accuracy, analyzing large volumes of meteorological and climatological data, detecting extreme weather events and anomalies, downscaling models to generate high-resolution local projections, automating operational tasks such as quality control and satellite image classification, supporting hybrid physical-data modeling approaches, and enabling data-driven decision-making for weather-sensitive sectors. These technologies enhance Euskalmet’s capacity to monitor, predict, and communicate weather and climate information more effectively and efficiently, particularly during severe weather and impact events.

In this work, we will present the strategy we are pursuing at Euskalmet regarding the exploration of opportunities offered by AI/ML. We will provide an overview of the steps we are taking, including the activities already undertaken and those planned, with a focus on our experience as a small-scale meteorological center.

We will also share details of Research, Development and innovation projects and operational implementations we are currently doing in this field, which may serve as inspiration for others following a similar path. Particular attention will be given to key aspects dealing with surveillance, forecasting, modeling, climate, and observation. Finally, we will present future plans and conclusions based not only on our technical experience but also on other relevant aspects such as management, funding, infrastructure, partnerships, and more.

How to cite: Gaztelumendi, S. and Aranda, J. A.: Artificial Intelligence and Machine Learning at Basque Meteorology Agency, EMS Annual Meeting 2025, Ljubljana, Slovenia, 7–12 Sep 2025, EMS2025-523, https://doi.org/10.5194/ems2025-523, 2025.

P15
|
EMS2025-275
Luiz dos Santos Neto, Vanderlei Maniesi, and Carlos Querino

Flooding is a type of natural disaster in which the rising river level during the high-water season comes into contact with society, causing damage. In the Amazon, floods are among the most frequent natural disasters in the region. Aiming to minimize the impacts caused by flooding, this research sought to apply different statistical hydroclimatological modeling methods and analyze their effectiveness in forecasting the monthly maximum river levels of an Amazonian river at a given control point, four months in advance. To this end, a case study was conducted on the 2012 flooding of the Acre River in the city of Rio Branco, capital of the Brazilian state of Acre, where the river reached a level of 17.64 meters—its third highest recorded level since measurements began. Four statistical modeling methods were used: one based on Multiple Linear Regression (MLR) and three using different Artificial Neural Network (ANN) algorithms. For model simulations, the input data consisted of 40 years (1971 to 2010) of monthly maximum levels of the Acre River in Rio Branco, monthly average Sea Surface Temperature (SST) data from the tropical regions of the Pacific and Atlantic Oceans, and monthly average atmospheric pressure data from Darwin, Australia, and Tahiti, French Polynesia, over the same period. The simulation results from each model were compared with observed data at the monitoring station, and model accuracy was evaluated using performance indices. In the analysis of the models tested to simulate monthly maximum levels in a continuous historical time series, all showed acceptable predictions with satisfactory performance indices, with the MLR-based model standing out. However, when comparing only the maximum level observed in 2012, the ANN-based models were more accurate, missing the observed value by only 27 cm four months in advance, proving more efficient at capturing the climatic patterns that cause the Acre River to reach exceptional levels—unlike the MLR method, which underestimated the peak by 323 cm. The performance results found in this study endorse these statistical hydroclimatic models as operational tools for environmental agencies, serving as indispensable instruments for water management and disaster prevention months in advance, thereby helping to mitigate the frequent flooding of the Acre River.

How to cite: dos Santos Neto, L., Maniesi, V., and Querino, C.: Use of Machine Learning in flood forecasting in the Amazon: a case of study of the Acre River flooding in Rio Branco, Acre, Brazil, EMS Annual Meeting 2025, Ljubljana, Slovenia, 7–12 Sep 2025, EMS2025-275, https://doi.org/10.5194/ems2025-275, 2025.

P16
|
EMS2025-92
Chung-Chieh Wang and Shin-Hau Chen

A long-standing problem of all numerical weather predictions, regardless deterministic or ensemble, is the more accurate assessment in probability (or likelihood) for the predicted scenario to occur, especially at longer lead times due to typically larger errors. The rapid development of artificial intelligence today may offer an effective method to tackle this issue. In this study, a neural-network machine-learning model is developed to, after training, project the expected value of the similarity skill score (SSS) of predicted total rainfall distribution in Taiwan for westward-moving typhoons during their influence period, thus serving as an objective guidance for the quality of the prediction. Ten typhoons are included, and a total of 105 parameters linked to rainfall are used from time-lagged forecasts (out to 8 days) every 6 h by a cloud-resolving model, when they cover the entire influence period (inside 300 km from Taiwan) with enough lead time. For each typhoon, only data from the other nine cases are used to train the model.

The results indicate that machine learning can capture the tendency of the actual SSS (calculated against observed rainfall) for most cases (eight out of ten), thereby informing the forecasters which quantitative precipitation forecasts (QPFs) are more trustworthy and which other ones are less so beforehand. Such guidance is particularly valuable at longer lead times, when the forecast uncertainty is relatively high. Thus, our results are highly encouraging. Nevertheless, if a typhoon behaves differently in forecasts from those that serve as the training data, the outcome would be less useful. Possible directions to remedy this issue and make further improvement are also offered. 

How to cite: Wang, C.-C. and Chen, S.-H.: Projecting Quality of Quantitative Precipitation Forecasts before Events through Machine Learning: Preliminary Results for Westbound Typhoons in Taiwan from A Cloud Model, EMS Annual Meeting 2025, Ljubljana, Slovenia, 7–12 Sep 2025, EMS2025-92, https://doi.org/10.5194/ems2025-92, 2025.