HS4.10 | Recent advances in (hybrid) hydrological forecasting using physically-based and machine learning models
EDI
Recent advances in (hybrid) hydrological forecasting using physically-based and machine learning models
Convener: Sandra Margrit Hauswirth | Co-conveners: Hamid Moradkhani, Ilias Pechlivanidis, Louise Slater
Orals
| Thu, 01 May, 16:15–18:00 (CEST)
 
Room 3.29/30, Fri, 02 May, 08:30–10:15 (CEST)
 
Room 3.29/30
Posters on site
| Attendance Fri, 02 May, 14:00–15:45 (CEST) | Display Fri, 02 May, 14:00–18:00
 
Hall A
Posters virtual
| Attendance Tue, 29 Apr, 14:00–15:45 (CEST) | Display Tue, 29 Apr, 08:30–18:00
 
vPoster spot A
Orals |
Thu, 16:15
Fri, 14:00
Tue, 14:00
In recent years, there has been a strong increase in the use of machine learning techniques to enhance hydrological simulation and forecasting. These methods are receiving growing attention due to their ability to handle large datasets, combine different sources of predictability, increase forecasting skill and minimize the effect of biases, as well as enhance computational efficiency. Furthermore, the range of implementations is broad, from purely data-driven forecasting systems to hybrid setups, combining both physically-based models and machine learning techniques, from large to local scales as well as different time horizons. These all allow forecasters to address and cover various aspects and processes of the hydrological cycle, including extreme conditions (floods and droughts), which are important for water resources and emergency management.

This session aims to highlight and bring together recent efforts in hydrological forecasting, using machine learning based techniques and/or hybrid approaches. Contributions are welcome showcasing examples of model developments (ranging from implementations to operational setups), studies ranging from local to global scales and across different time horizons (short-, medium- and long-term), as well as studies showcasing the efforts data-driven/hybrid approaches to tackle challenges in hydrological forecasting. We particularly welcome talks that reach beyond the description of machine learning architectures to uncover physical and human-induced processes, account for uncertainties, generate novel insights about hydrological forecasting, or support efforts in reducing common forecasting difficulties.

Other topics related to the subdivision of Hydrological Forecasting and the corresponding sessions can be found here: https://www.egu.eu/hs/about/subdivisions/hydrological-forecasting/

Orals: Thu, 1 May, 16:15–18:00 | Room 3.29/30

Chairpersons: Sandra Margrit Hauswirth, Hamid Moradkhani, Louise Slater
16:15–16:20
16:20–16:40
|
EGU25-11925
|
ECS
|
solicited
|
On-site presentation
Gwyneth Matthews, Calum Baugh, Matthew Chantry, Cinzia Mazzetti, Hamidreza Mosaffa, Ewan Pinnington, Nina Raoult, Karan Ruparell, Maria-Luisa Taccari, Florian Pappenberger, and Christel Prudhomme

Hydrological modeling has entered a new era in recent years, largely driven by the curation of extensive datasets and availability of open-source machine learning libraries. While traditional physically based models have been key to improving our understanding of hydrological systems and establishing early warning systems, they often face challenges such as high computational costs and requiring simplifications of complex processes. Conversely, machine learning methods, despite potential pitfalls such as generating unphysical outputs and requiring large volumes of training data, are computationally quick and capable of capturing highly non-linear relationships. Hybrid hydrological modeling bridges these approaches, combining the efficiency and flexibility of machine learning with the proven capabilities and interpretability of traditional physical models.

This talk will provide an overview of the hybrid hydrological modeling research being conducted at the European Centre for Medium-range Weather Forecasts (ECMWF). Using case studies, we will show how machine learning methods could be incorporated into the pre-established physical modeling chain, addressing both scientific and operational challenges. Examples will cover the full modelling chain including the coupling of machine learning and physically based models, the use of emulators of sub-models to reduce computational overhead, and the integration of data-driven techniques to correct model biases in real time. The development of a machine learning forecasting model will also be discussed as a component of a hybrid multi-model system. Attention will be given to the practical aspects of implementation, including computational efficiency both for an operational system and for sensitivity and calibration experiments, scalability to large operational systems, and the potential to incorporate new datasets, such as remote sensing data, into hybrid frameworks. We will also discuss how artificial intelligence can be used to support auxiliary services such as simulation verification.

Finally, we will reflect on the implications of hybrid hydrological modeling for advancing hydrological science and operational forecasting. By combining the strengths of physical and machine learning models, this approach has the potential to improve flood prediction, water resource management, and climate impact assessments. This hybrid approach marks an important step forward in the development of hydrological modeling, enabling more accurate, efficient, and actionable understanding of water systems in a rapidly changing world.

How to cite: Matthews, G., Baugh, C., Chantry, M., Mazzetti, C., Mosaffa, H., Pinnington, E., Raoult, N., Ruparell, K., Taccari, M.-L., Pappenberger, F., and Prudhomme, C.: A new era of hybrid hydrological forecasting at ECMWF , EGU General Assembly 2025, Vienna, Austria, 27 Apr–2 May 2025, EGU25-11925, https://doi.org/10.5194/egusphere-egu25-11925, 2025.

16:40–16:50
|
EGU25-12113
|
ECS
|
On-site presentation
Ritesh Moon, Shasha Han, Laura Graham, and David Hannah

Accurate streamflow forecasting in both managed and natural catchments is critical for sustainable water resource management in the UK. Simulating hydrological processes such as flashy flood responses and reservoir-influenced flow dynamics remains a significant challenge, particularly in non-natural catchments where human interventions alter natural flow regimes. This study evaluated the effectiveness of three modelling frameworks across 341 selected UK catchments from the CAMELS-GB database: the HBV (a conceptual hydrological model), Long Short-Term Memory (LSTM, a data-driven model), and a hybrid Physics-Informed Machine Learning (PIML) model supplemented with hydrological signatures. The regional and spatial patterns of their performance was investigated using evaluation metrics such as Nash-Sutcliffe Efficiency (NSE) and Kling-Gupta Efficiency (KGE) during both the calibration and validation phases. The results show that LSTM outperforms HBV in 65% of the catchments, particularly in northern Scotland and the western UK, which have steep terrain with rapid runoff; but it lacks the physical interpretability that HBV provides. Despite its advantages in natural catchments, HBV tends to produce unsatisfactory simulations for   processes such as snowmelt and rapid storm response, as well as for regulated flows in reservoir-affected catchments, which are common in central and southern England. The incorporation of hydrological signatures (e.g., baseflow index, rainfall-runoff ratio) into the PIML framework addresses this limitation by encapsulating key reservoir processes (e.g., flow smoothing, seasonal redistribution) and anthropogenic influences, allowing for improved streamflow predictions and enhanced interpretability. Our study emphasises the need for hybrid modelling approaches that combine the physical coherence of conceptual models with the adaptability of data-driven procedures.  The findings highlight the necessity of adapting models to local conditions and accounting for the effects of human activity, providing a reliable way to enhance streamflow predictions in complicated and regulated catchments.

How to cite: Moon, R., Han, S., Graham, L., and Hannah, D.: Bridging Physics and Machine Learning: A Signature-enhanced Hybrid Framework for Streamflow Prediction in Complex Catchments, EGU General Assembly 2025, Vienna, Austria, 27 Apr–2 May 2025, EGU25-12113, https://doi.org/10.5194/egusphere-egu25-12113, 2025.

16:50–17:00
|
EGU25-17296
|
ECS
|
Virtual presentation
Majid Niazkar, Omar Cenobio-Cruz, Gloria Mozzi, Giuliano Di Baldassarre, and Jeremy Pal

Accurate streamflow prediction is crucial for water resources management, particularly in the regions facing challenges such as water scarcity and hydrological unpredictability. Physical-based hydrological models have long been used for rainfall-runoff simulations by solving equations governing hydrological processes in a typical watershed. In addition, Machine Learning (ML) models emerged as versatile data-driven approaches capable of capturing intricate patterns of hydroclimatic variables, which can be used for streamflow prediction.

The aim of this study is to compare performances of two distinct approaches: (i) the process-based and semi-distributed Hydrological Predictions for the Environment (HYPE) model and Extreme Gradient Boosting (XGBoost), a tree-based ML algorithm. The case study is upper Reno River Basin, situated in northern Italy. Precipitation across the basin varies considerably due to orographic influences, while this spatial variability drives diverse seasonal and regional streamflow patterns. For this purpose, a 5-km gridded meteorological data (the ERG5 dataset) was used as input for both models from 2001 to 2022. The database was developed by ARPAe-SIMC for the Emilia-Romagna region in Italy. Furthermore, the streamflow was considered as output results. For the sake of comparison, both models were calibrated using the same time series, partitioning the data into 75% for calibration/training and 25% for testing.

The simulation performance for river discharge showed high values of the Kling-Gupta Efficiency (KGE) for the training phase as XGBoost showed slightly better values of KGE (0.86) than that of HYPE (0.82). For the test period, KGE around 0.8 was obtained by both models. Thus, the KGE values were comparable for both models, with HYPE slightly outperforming XGBoost (0.82 vs. 0.78). The flow-duration curves revealed that both models performed well for estimating peak discharges (below 30% occurrence). However, for drier conditions, HYPE shows a better agreement with the observed data, while ML tended to overestimate it.

The results indicate that traditional hydrological models performed slightly better than XGBoost for streamflow estimation in the region under investigation. The performance of XGBoost may be improved if seasonality was taken into account, which can be explored in future works. Based on the comparative analysis, ML techniques can provide a suitable alternative in cases where little is known about the region’s hydrological characteristics, leveraging data patterns without requiring detailed process knowledge. Nonetheless, the application of ML requires caution, as its black-box nature may obscure the underlying physical and hydrological processes, potentially leading to misinterpretation of results. Finally, this comparison provides valuable guidance for researchers and practitioners in selecting appropriate tools for streamflow prediction tasks.

Acknowledgements: This research work was carried out as part of the TRANSCEND project with funding received from the European Union Horizon Europe Research and Innovation Programme under Grant Agreement No. 10108411.

How to cite: Niazkar, M., Cenobio-Cruz, O., Mozzi, G., Di Baldassarre, G., and Pal, J.: Hydrological Modelling vs. Machine Learning for Water Availability: Case Study from the Reno Basin (Italy), EGU General Assembly 2025, Vienna, Austria, 27 Apr–2 May 2025, EGU25-17296, https://doi.org/10.5194/egusphere-egu25-17296, 2025.

17:00–17:10
|
EGU25-7887
|
On-site presentation
Hybrid Integration of Land Surface Models and Deep Learning for Enhanced Streamflow Prediction Across India
(withdrawn after no-show)
Bhanu Magotra and Manabendra Saharia
17:10–17:20
|
EGU25-10567
|
ECS
|
On-site presentation
Claudia Bertini, Yiheng Du, Schalk Jan van Andel, and Ilias Pechlivanidis

Despite the advances in hydro-meteorological forecasting systems, challenges to accurately forecast streamflow at seasonal time horizons still remain, especially when models are applied across a strong hydro-climatic gradient. In this work, we explore the potential of AI-based approaches combined with the output of process-based hydrological models and meteorological forecasts from Numerical Weather Prediction models to enhance seasonal streamflow forecasts, with lead-times up to 30 weeks. We employ the multi-time scale Long Short-Term Memory (MTS-LSTM) model trained with a combination of reanalysis data from the process-based pan-European E-HYPE hydrological model, in-situ observations from GRDC, and bias-adjusted seasonal meteorological forecasts from the ECMWF SEAS5 prediction system. The MTS-LSTM is developed at the pan-European scale, using more than 500 catchments over Europe, which lie in 11 different clusters according to their hydrological regime. We then compare the AI-based forecast performance against streamflow climatology and the E-HYPE streamflow forecasts. Our results show that the streamflow forecasts based on MTS-LSTM outperform the E-HYPE ones in catchments characterised by highly variable and flashy hydrological response and snow-dominated catchments with high seasonality. However, the MTS-LSTM underperform compared to E-HYPE results in catchments with highly variable streamflow regimes and long recessions. These preliminary findings highlight the potential of AI approaches to enhance streamflow predictability at seasonal lead-times across Europe’s strong hydro-climatic gradient, having both scientific and operational added value.

How to cite: Bertini, C., Du, Y., van Andel, S. J., and Pechlivanidis, I.: AI-based seasonal streamflow forecasts across Europe’s hydro-climatic gradient, EGU General Assembly 2025, Vienna, Austria, 27 Apr–2 May 2025, EGU25-10567, https://doi.org/10.5194/egusphere-egu25-10567, 2025.

17:20–17:30
|
EGU25-10870
|
ECS
|
Virtual presentation
Lucas Dalgaard Jensen, Grith Martinsen, Henrik Madsen, Phillip Aarestrup, and Raphael Payet-Burin

Hydrological modeling provides a quantitative foundation for effective water resource management. Simulating discharge from meteorological forecasts is essential for flood prediction and risk assessment.

Traditional hydrological models, such as the Hydrological Predictions for the Environment (HYPE) model, leverage explicit equations to represent well-known catchment characteristics and provide process-based discharge forecasts. However, these models often struggle to capture unknown or poorly understood spatio-temporal dependencies and nonlinear dynamics.

In contrast, machine learning approaches, such as Long Short-Term Memory (LSTM) networks, are able to learn complex patterns without requiring pre-defined relationships. However, these networks introduce variabilities into hydrological model simulations, which in turn complicates the development of well-supported arguments based on their findings.

A promising solution is a hybrid model in which the physical model's output serves as dynamic input to the LSTM. This approach preserves the strengths of physics-based models in representing well-understood hydrological processes while allowing the LSTM to capture implicit dependencies.

This study investigated the application of hybrid hydrological modeling for simulating discharge in Danish catchments by combining simulated discharge from the Danish HYPE model (DK-HYPE) with an LSTM model. The analysis encompassed 570 catchments, characterized by static attributes and dynamic variables. Dynamic variables were derived from a high-resolution CAMELS dataset (DK-CAMELS) with a spatial resolution of 1x1 km, from Danish weather stations covering the time period from 2001 to 2022.

Fifteen LSTM models were trained under various configurations: different sequence lengths (30, 90, and 365 days), inclusion of static attributes, and utilization of DK-HYPE outputs. Model training used two loss functions—Mean Squared Error (MSE) and the Nash-Sutcliffe model efficiency coefficient (NSE)—while performance was evaluated using the Kling-Gupta Efficiency (KGE), Flow Balance (FBAL), and Critical Success Index (CSI).

Incorporating static attributes enhanced model accuracy, while longer sequence lengths captured hydrological dependencies. Across all configurations, the LSTM models outperformed DK-HYPE. The best-performing hybrid model achieved a KGE of 0.7 and a CSI of 0.36, a significant improvement over DK-HYPE's baseline values of 0.01 for KGE and 0.18 for CSI. Similarly, the standalone LSTM model, which excluded DK-HYPE outputs during training, achieved a KGE of 0.71 and a CSI of 0.35.

While the hybrid model did not demonstrate a clear advantage over the pure LSTM model with longer sequence lengths, it outperformed the pure LSTM model with shorter sequence lengths. Specifically, comparing models using sequence length of 30 days, the hybrid model achieved a KGE of 0.65 and a CSI of 0.36, compared to the pure LSTM model's KGE of 0.59 and CSI of 0.31, which is most likely because it utilized information from DK-HYPE.

This project is a step towards combining physics-based models with data-driven models for the national flood warning system. Further work should focus on fine-tuning the hybrid models and integrate them into an ensemble towards building robust systems for flood forecasting.

How to cite: Jensen, L. D., Martinsen, G., Madsen, H., Aarestrup, P., and Payet-Burin, R.: Utilizing Physics-Based Predictions as Inputs to LSTM Models for Robust Data-Driven Discharge Simulations of Gauged Catchments Across Denmark, EGU General Assembly 2025, Vienna, Austria, 27 Apr–2 May 2025, EGU25-10870, https://doi.org/10.5194/egusphere-egu25-10870, 2025.

17:30–17:40
|
EGU25-3263
|
Virtual presentation
Anna Malago', Timo Schaffhauser, Fayçal Bouraoui, Paolo Lazzarini, Andrea Marziali, Alberto Ravasi, and Valerio Bertelli

Accurate streamflow predictions in glacial basins are critical for effective water resource management and flood risk mitigation, especially under changing climatic conditions. This study presents a hybrid modeling framework that integrates the SWAT-GL model, an extension of the Soil and Water Assessment Tool (SWAT) designed to include glacial hydrological processes, with advanced machine learning techniques (Random Forest, Support Vector Regression, and Multilayer Perceptron).

SWAT-GL was selected for its proven ability to simulate glacial hydrological processes at a daily scale. However, its application at an hourly scale is limited due to the reliance on the Green-Ampt infiltration method, which is less suitable for representing the unique soil characteristics typically observed in glacier-fed basins. To overcome this limitation, machine learning models were employed to refine the daily SWAT-GL outputs into hourly predictions. An ensemble model was developed to enhance accuracy, combining the complementary strengths of the individual machine learning approaches.

The model was calibrated using data from 2021 to 2023, with a one-year warmup period (2020), and validated with observed data from January 2024 to September 2024. Meteorological forecasts from ECMWF-IFS and MOLOCH models were incorporated, providing hourly data on precipitation, temperature, solar radiation, and wind speed. This approach enabled day-ahead operational forecasting, aligning model outputs with real-time management needs.

The ensemble model showed strong performance during training and testing, highlighting its robustness in refining daily SWAT-GL outputs into accurate hourly predictions.

The hybrid framework was applied to the Forni Basin, a glacier-fed system in the Italian Alps characterized by high variability in meltwater contributions and limited hydrological data. By addressing key challenges such as input uncertainties, limitations of process-based modeling at sub-daily scales, and scaling from daily to hourly forecasts, the model offers a robust tool for predicting streamflow in data-scarce, glacierized regions. This study highlights the potential of hybrid approaches to improve hydrological forecasting accuracy and scalability, contributing to the sustainable management of water resources in sensitive alpine environments.

How to cite: Malago', A., Schaffhauser, T., Bouraoui, F., Lazzarini, P., Marziali, A., Ravasi, A., and Bertelli, V.: Hybrid Hydrological Modeling in the Forni Basin: Combining SWAT-GL and Machine Learning Techniques, EGU General Assembly 2025, Vienna, Austria, 27 Apr–2 May 2025, EGU25-3263, https://doi.org/10.5194/egusphere-egu25-3263, 2025.

17:40–17:50
|
EGU25-13016
|
On-site presentation
Hamidreza Mosaffa, Christel Prudhomme, Matthew Chantry, Liz Stephens, Christoph Rüdiger, Florian Pappenberger, and Hannah Cloke

Recent advances in Earth observation and data collection technologies have made high-resolution hydrological datasets increasingly accessible, enhancing our capabilities for monitoring and predicting hydrological processes. While a variety of artificial intelligence (AI) models can be employed to leverage these datasets, the challenges and opportunities of different AI approaches in the context of high-resolution data availability remain only partially explored. For instance, although Long Short-Term Memory (LSTM) networks are widely used for discharge prediction, the potential of Graph Neural Networks (GNNs)—which naturally represent river networks as graphs and capture spatial dependencies—has yet to be fully investigated.

In this study, we conduct a comprehensive analysis of GNN-based models for river discharge prediction in the Danube River Basin. Leveraging the LamaH-CE (Large-Sample Data for Hydrology and Environmental Sciences for Central Europe) dataset, we incorporate both dynamic features (e.g., daily precipitation, temperature, soil moisture) and static variables (e.g., digital elevation model, river density, basin area). Three architectures—GNN, a hybrid LSTM-GNN, and a standalone LSTM—are trained, validated, and tested at daily time steps from 2000 to 2017.

We further investigate the impact of network density and high-resolution (1km) soil moisture and precipitation data on discharge prediction accuracy. Our analysis reveals the potential advantages and limitations of these architectural approaches in river discharge prediction under high-resolution data availability and underscores the growing importance of harnessing graph-based deep learning methods for hydrological applications.

How to cite: Mosaffa, H., Prudhomme, C., Chantry, M., Stephens, L., Rüdiger, C., Pappenberger, F., and Cloke, H.: A Comprehensive Analysis of Graph Neural Networks for River Discharge Prediction: High-Resolution Applications in the Danube Basin, EGU General Assembly 2025, Vienna, Austria, 27 Apr–2 May 2025, EGU25-13016, https://doi.org/10.5194/egusphere-egu25-13016, 2025.

17:50–18:00

Orals: Fri, 2 May, 08:30–10:15 | Room 3.29/30

Chairpersons: Sandra Margrit Hauswirth, Hamid Moradkhani, Louise Slater
08:30–08:35
08:35–08:45
|
EGU25-608
|
ECS
|
Virtual presentation
Humaira Hamid and Sandeep Samantaray

The Jhelum River, which is the major river of the signal, contributes to the Indus River system as one of the notable tributaries and is bestowed with crucial importance in adhering its water for various uses, including irrigation, hydropower supply and domestic purposes. However, it is very vulnerable to serious floods that cause many losses of life and property. As a result, precise flood forecasting in the Jhelum River is essential to facilitate proper disaster response and prevention strategies. Flood forecasting is critical to disaster preparedness, particularly in countries vulnerable to repeated hydrologic disasters. This research aims to improve the flood forecasting technique applicable to the Kupwara district of Jammu and Kashmir, as the area is frequently ravaged by floods, mostly occasioned by its geographical and climatic attributes. We employ hydrometeorological data from January 1975 to December 2023 to investigate the interaction of the factors that determine flood occurrence, and we evaluate the capability of data-based models in providing reliable monthly flood predictions. This study proposes Support Vector Machines (SVMs), Artificial Neural Networks (ANNs), and finally, the hybrid model SVM-PSO (Particle Swarm Optimization) to predict flood events in the Jhelum River. The results of the hybrid SVM-PSO model show maximum goodness of fit with an R² value of 0.9562, minimum MSE value of 9.2237 and Nash-Sutcliffe Model Efficiency of 0.9483. These outcomes illustrate the model's strengths for flood forecasting for Kupwara; its application to disaster risk reduction is valuable. The study’s findings highlight the possibility of extending the applications of progressive AI tools to reduce the effects of flooding and preserve the areas’ susceptible populations and assets in the Kupwara district.

This research was supported by the Empowerment and Equity Opportunities for Excellence (EEQ) in Science (Dr SS) under SERB, Govt. of India, under grant no. EEQ/2023/000585

How to cite: Hamid, H. and Samantaray, S.: Hybrid SVM-PSO Approach for Flood Prediction in the Jhelum River Basin: A Case Study of Kupwara District, EGU General Assembly 2025, Vienna, Austria, 27 Apr–2 May 2025, EGU25-608, https://doi.org/10.5194/egusphere-egu25-608, 2025.

08:45–08:55
|
EGU25-921
|
ECS
|
On-site presentation
Priyam Deka and Manabendra Saharia

We propose a robust framework for flood forecasting that integrates advanced hydrological modeling with high-performance computing to deliver accurate and timely predictions. At its core, the India Land Data Assimilation System (ILDAS) leverages multiple Land Surface Models (LSMs) and routing models to enhance flood prediction capabilities. The framework utilizes the Noah-MP LSM for simulating land surface processes, with runoff routed using mizuRoute, a vector-based hydrodynamic model, to estimate critical flood metrics. Calibration is optimized through an HPC-enabled scheme, ensuring precise model tuning. The system ingests meteorological forecasts from the National Center for Medium-Range Weather Forecasting (NCMRWF), ISRO, and GEFS, offering lead times of 1 to 10 days. Operating at a spatial resolution of 12 meters, it delivers nationwide flood forecasts within 15 minutes, with outputs available at daily and sub-daily temporal resolutions. The flood forecast is improved with an ML based hydrological post-processor to enhance the forecast information. A case study focusing on the flood-prone Brahmaputra basin demonstrates the system's effectiveness in accurately predicting flood events, underscoring its potential as a valuable operational tool for flood preparedness and risk mitigation. 

How to cite: Deka, P. and Saharia, M.: A Medium-Range Ensemble Flood Forecasting System for India., EGU General Assembly 2025, Vienna, Austria, 27 Apr–2 May 2025, EGU25-921, https://doi.org/10.5194/egusphere-egu25-921, 2025.

08:55–09:05
|
EGU25-13060
|
ECS
|
On-site presentation
Everett Snieder and Usman Khan

Flood early warning systems rely on accurate streamflow forecasts. Deep learning based approaches have been widely shown to outperform traditional, process-based approaches. While literature is rich with comparisons between these opposing modelling paradigms, most comparisons have been conducted at daily temporal resolutions and feature spatially coarse (i.e., lumped) process-based models. Flood forecasting applications, especially those in flashy urban catchments, rely on sub-daily forecasts. In this work, we compare the performance of a state-of-the-art regionally trained LSTM models with semi-distributed StormWater Management Models (SWMM) at temporal frequencies ranging from 15-minutes to 1-day, for roughly 40 highly urbanised catchments in Toronto, Canada. Results show that the LSTM approaches struggle at fine temporal resolution and when limited observed data is available. In contrast, SWMM models can be automatically parameterized and calibrated using comparatively much less data. While the amount of available historical data would be enough to train deep learning models at a daily resolution, it is insufficient to train hourly models, which we attribute to the comparatively more complex urban rainfall-runoff system. Potential solutions to this problem include model transfer between space and different temporal frequencies. Finally, another contribution of this work is LSTM hyperparameter optimization, which is not widely documented at a sub-hourly resolution. Results from this research reaffirm the need for multi-model approaches for flood forecasting, particularly in urbanised catchments.

How to cite: Snieder, E. and Khan, U.: Comparing deep-learning and semi-distributed models for flow forecasting at fine spatial and temporal resolutions: a case study of 40 urbanized catchments in Toronto, Canada., EGU General Assembly 2025, Vienna, Austria, 27 Apr–2 May 2025, EGU25-13060, https://doi.org/10.5194/egusphere-egu25-13060, 2025.

09:05–09:25
|
EGU25-13539
|
ECS
|
solicited
|
On-site presentation
Andrew Bennett, Amanda Triplett, Peter Melchior, Reed Maxwell, and Laura Condon

Use of complex, high-resolution integrated hydrologic models offer the most comprehensive and detailed representations of groundwater, surface water, and land surface processes, but are challenging to use for forecasting tasks due to high computational costs and parameter uncertainty. On the flipside, machine learning approaches are highly accurate and can be computationally frugal for targeted tasks, but are difficult to audit and must be retrained to adapt to new tasks or domains.

In this work we present several case studies of using deep learning surrogate modeling approaches for integrated hydrologic modeling that alleviates many of the weaknesses of taking a purely physically based or purely data driven approach. We first show how deep learning surrogates can readily achieve orders of magnitude speedup over the original physically based models with high degree of accuracy, which allows for on demand forecasting. While this approach is great for generating forecasts from the original model configuration, it is still challenging to adapt to new scenarios such as use in parameter calibration or running long simulations such as climate change scenarios. We close the presentation by discussing recent work to address these challenges using model inversion techniques and by developing hybrid model emulation strategies.

How to cite: Bennett, A., Triplett, A., Melchior, P., Maxwell, R., and Condon, L.: Surrogate modeling for large-scale integrated hydrologic modeling: A case study in deep learning, model inversion, and hybrid methods across the Continental United States, EGU General Assembly 2025, Vienna, Austria, 27 Apr–2 May 2025, EGU25-13539, https://doi.org/10.5194/egusphere-egu25-13539, 2025.

09:25–09:35
|
EGU25-1365
|
ECS
|
Virtual presentation
Amit Kantode, Thallam Prashanth, and Sayantan Ganguly

Groundwater depletion in the Bist-Doab region of the Punjab State of India is a significant threat to sustainable agricultural practices, underscoring the need for effective management strategies. Modelling groundwater heads is essential for understanding groundwater flow dynamics, trends, and their interaction with surface water. It helps assess the aquifer's health, prevent over-extraction and contamination, and predict ambient groundwater responses to extreme events such as droughts or floods. Inaccurate groundwater models, which overestimate or underestimate groundwater levels and fail to capture temporal fluctuations, hinder proper water management. These errors lead to suboptimal decisions regarding water allocation and resource sustainability and ultimately impact crop yields and water availability.

This study aims to integrate physically-based models, such as those developed by MODFLOW, with machine-learning algorithms to improve prediction accuracy and support more informed decision-making. MODFLOW was used to simulate groundwater flow under both steady-state and transient conditions, utilizing field hydrogeological data from existing literature. Machine learning (ML) models, including Support Vector Regression (SVR), Random Forest (RF), and Artificial Neural Networks (ANN), were trained and tested on historical groundwater levels and meteorological data to enhance prediction accuracy.

The methodology employs data-driven models (DDMs) as error-correcting tools for the physically-based models. Historical residuals, calculated as the difference between observed and simulated groundwater heads, were used as inputs alongside features such as well location coordinates, simulated groundwater heads, and time of measurements. ML techniques such as SVR, RF and ANN were used to train the DDMs, which learn systematic errors in the physically-based model by analysing these historical residuals. Outputs include predicted systematic errors and updated groundwater heads, where corrections are applied to the initial simulated values. The effectiveness of the DDMs relies on the structure and patterns of the residuals in the physically-based model, with strong correlations between the groundwater heads, leading to better error correction and improved predictive accuracy.

Results show that integrating MODFLOW with ML, significantly reduces model error compared to traditional simulation approaches. The combined model effectively captures both seasonal fluctuations and long-term trends in groundwater levels, leading to more accurate predictions. The developed framework provides a reliable tool for improving groundwater resource management and optimizing water allocation strategies, ultimately supporting the sustainable management of groundwater in agriculturally stressed regions like Bist-Doab.

How to cite: Kantode, A., Prashanth, T., and Ganguly, S.: Development of a precise regional-scale groundwater model by coupling MODFLOW & Machine Learning algorithms: A case study in Bist-Doab region, Punjab, India, EGU General Assembly 2025, Vienna, Austria, 27 Apr–2 May 2025, EGU25-1365, https://doi.org/10.5194/egusphere-egu25-1365, 2025.

09:35–09:45
|
EGU25-16839
|
ECS
|
On-site presentation
Ali Shakil, Charlotte Sakarovitch, Najim Bouhafa, and Cyril Leclerc

Reliable forecasting of resource availability and demand evolution is paramount for effective community planning. The forecast horizon, which can range from a few days to several years, plays a crucial role in this process. Short-term forecasts allow for the optimization of withdrawals according to demand, while medium-term forecasts enable anticipation of resource scarcity risks, planning operations on sensitive structures, or anticipating demand peaks. Long-term forecasts, on the other hand, help anticipate and analyze development strategies and usage scenarios.
 
This study, a component of the Water Resources Forecast (WRF) SUEZ’s project, partially funded by the French Ecological Transition Agency (ADEME’s innov’eau initiative), introduces an innovative approach to predicting river flow rates. We assess the performance of three distinct model types: traditional lumped rainfall–runoff conceptual models with two reservoirs, classic AI models (Random Forest, LSTM, etc.), and hybrid models that synergize AI and conceptual models for enhanced predictive accuracy.
 
Preliminary findings suggest LSTM and that hybrid models utilizing LSTM demonstrate superior short-term performance, reducing error rates by 41% compared to standalone conceptual models. These results indicate the potential of AI and hybrid models in improving the accuracy of resource availability forecasts. The analysis of the medium and long-term performances of the forecast models is currently underway and the findings will be presented at the conference.
 
This ongoing research contributes to the development of Aquadvanced® Water Resources, a comprehensive platform aimed at monitoring and forecasting various resource types, both underground and surface. By aligning resource availability with water demand forecasts, this tool will facilitate resource management strategies.

How to cite: Shakil, A., Sakarovitch, C., Bouhafa, N., and Leclerc, C.: Advancing Resource Forecasting: An Evaluation of Hybrid Predictive Models on River Flow Rates, EGU General Assembly 2025, Vienna, Austria, 27 Apr–2 May 2025, EGU25-16839, https://doi.org/10.5194/egusphere-egu25-16839, 2025.

09:45–09:55
|
EGU25-18959
|
ECS
|
Virtual presentation
Enhancing Hydrological Forecasting in the Sabarmati Basin through Hybrid Approaches Integrating Physically-Based and Machine Learning Models
(withdrawn)
Samnan Kadri and Mohdzuned M. Shaikh
09:55–10:05
|
EGU25-19297
|
On-site presentation
Javier Senent-Aparicio, Patricia Jimeno-Sáez, Sara Asadi, Nerea Bilbao-Barrenetxea, Gerardo Castellanos-Osorio, Adrián López-Ballesteros, and Francisco Cabezas

The Segura River Basin, which supplies water for agriculture, receives water from the Upper Tagus River Basin (UTRB) through the Tagus-Segura water transfer, involving two reservoirs: Entrepeñas and Buendía. Accurate reservoir inflow forecasts, particularly seasonal ones, are crucial for making better and more reliable water transfer decisions. This study introduces a methodology for seasonal forecasting using ensemble weather forecasts from climate models, with a focus on the SEAS5 model from the European Centre for Medium-Range Weather Forecasts (ECMWF). Initially, by combining the global climate model with machine learning algorithms, bias correction of daily precipitation and temperature forecasts is achieved. The Soil and Water Assessment Tool (SWAT+) hydrological model is employed to simulate inflows to the Entrepeñas and Buendía reservoirs, calibrated against observed inflows. The first five years from 1995 to 1999 are used for warm-up, the period from 2000 to 2009 for calibration, and from 2010 to 2019 for validation. The calibrated SWAT+ model is then forced with bias-corrected meteorological data forecasts to predict reservoir inflows for the upcoming months. The SWAT+ model's performance during calibration and validation was very good, with monthly NSE values exceeding 0.7 and PBIAS values below 14% for both reservoirs. When the model was forced with bias-corrected hydrological forecasts, it performed well, demonstrating the effectiveness of bias-corrected forecasted meteorological data in predicting reservoir inflows. This work was supported by the Spanish Ministry of Science and Innovation, under grants PID2021-128126OA-I00.

Keywords: SWAT+, machine learning, coupled modelling, streamflow simulation, seasonal hydrological forecasting

How to cite: Senent-Aparicio, J., Jimeno-Sáez, P., Asadi, S., Bilbao-Barrenetxea, N., Castellanos-Osorio, G., López-Ballesteros, A., and Cabezas, F.: Coupling SWAT+ and machine learning-Enhanced global climate models for seasonal hydrological prediction, EGU General Assembly 2025, Vienna, Austria, 27 Apr–2 May 2025, EGU25-19297, https://doi.org/10.5194/egusphere-egu25-19297, 2025.

10:05–10:15

Posters on site: Fri, 2 May, 14:00–15:45 | Hall A

Display time: Fri, 2 May, 14:00–18:00
Chairpersons: Sandra Margrit Hauswirth, Hamid Moradkhani, Louise Slater
A.44
|
EGU25-9650
Global-Scale Evaluation of Drivers Influencing Seasonal Forecast Skills
(withdrawn)
Jude Lubega Musuuza, Yiheng Du, Louise Crochemore, and Ilias Pechlivanidis
A.45
|
EGU25-8061
Igor Leščešen, Pavla Pekárová, Pavol Miklánek, and Zbyňek Bajtek

The Carpathian Basin, is a climatically sensitive region influenced by Atlantic, continental and Mediterranean climates. Understanding the river dynamics in this region is crucial for sustainable water management given the diverse climatic and hydrological conditions. Despite extensive research, few studies have thoroughly compared the performance of advanced machine learning models for predicting river discharge in this region.

In this study we show that Random Forest (RF), LightGBM (LGBM), Support Vector Regression (SVR), Temporal Convolutional Networks (TCN) and XGBoost can improve streamflow prediction by utilizing their ability to capture nonlinear and temporal relationships in hydrological data. Using daily discharge data for 1961-2020 period from Danube, Sava, Tisa and Drava Rivers, we tested these models at six stations and analyzed their effectiveness using metrics such as RMSE, MAE and R². The Augmented Dickey-Fuller test confirmed the stationarity across all stations and thus confirmed the robustness of our prediction framework.

The RF model performed consistently better than the other models, achieving the lowest RMSE (e.g. 31.739 m³/s at Bezdan station and 19.582 m³/s in Donji Miholjac station) and the highest R² values (e.g. 0.999 at Szolnok and Bezdan station). In contrast, the SVR showed the weakest performance with significantly higher RMSE values and lower R² values at all stations. XGBoost and LGBM also performed strongly, but fell slightly short of the prediction accuracy of RF. These results emphasize the robustness of RF in capturing complex, nonlinear hydrological dynamics and its resistance to overfitting.

Our results suggest that Random Forest is the most reliable model for predicting discharge in the Carpathian Basin, providing high accuracy and robustness for different rivers. These results have significant implications for improving predictive hydrological models that enable more effective water resource management and adaptive strategies in climatically sensitive regions.

How to cite: Leščešen, I., Pekárová, P., Miklánek, P., and Bajtek, Z.: Streamflow Variability and Predictive Modeling in the Carpathian Basin: Assessing the Performance of Machine Learning Algorithms, EGU General Assembly 2025, Vienna, Austria, 27 Apr–2 May 2025, EGU25-8061, https://doi.org/10.5194/egusphere-egu25-8061, 2025.

A.46
|
EGU25-13252
|
ECS
Ciara Wall and Tiernan Henry

Understanding the relationships between rainfall and river flows, especially as climate shifts and rainfall patterns are modified, and in the context of human intervention, requires studying rivers within their catchments. This is especially important for ungauged, remote catchments, which are often understudied due to the challenges in data collection. In line with the EU Water Framework Directive, studying such catchments can offer valuable insights into broader hydrological processes. This study focuses on two instrumented, small, upland rivers near Newport, County Mayo, in the west of Ireland. The lack of data from most small, upland catchments highlights the importance of using innovative approaches like Machine Learning (ML) for hydrological forecasting. ML, a branch of artificial intelligence, enables the development of predictive models that identify patterns in meteorological and hydrological data, even in the absence of direct measurements. Using at least 48 months of meteorological and hydrological data, this research aims to model catchment behaviour and improve the understanding of hydrological responses to climate variability and change in remote areas. This work seeks to enhance our ability to predict the physical drivers of hydrological change and contribute to more accurate forecasting in ungauged catchments. 

How to cite: Wall, C. and Henry, T.: Machine Learning for Hydrological Forecasting in Ungauged Upland Catchments: A Case Study from County Mayo, Ireland, EGU General Assembly 2025, Vienna, Austria, 27 Apr–2 May 2025, EGU25-13252, https://doi.org/10.5194/egusphere-egu25-13252, 2025.

A.47
|
EGU25-3948
|
ECS
Seung Cheol Lee and Daeha Kim

This study explores the potential of a hybrid streamflow model that addresses the interpretability limitations commonly associated with the ‘black-box’ nature of machine learning models. The hybrid model simulates the rainfall-runoff process through its conceptual structure, integrating a deep-learning neural network to estimate the associated parameters. Using the Catchment Attributes and Meteorology for Large-sample Studies (CAMELS) dataset, we evaluated the model’s performance across 671 catchments in the United States, and compared it with the widely used Hydrologiska Byråns Vattenbalansavdelning (HBV) and the Long Short-Term Memory (LSTM) neural network. In gauged basins, where the three models were directly trained using runoff observations, the LSTM showed superior performance, achieving a median Kling-Gupta Efficiency (KGE) of 0.79. In comparison, the HBV model and the hybrid model attained median KGE values of 0.66 and 0.68, respectively. However, when the same catchments were treated as ungauged and runoff was predicted using regionalization approaches, the performance of all three models declined: the LSTM experienced a 17% reduction in KGE (0.79 → 0.66), while the hybrid and HBV models showed reductions of 13% (0.68 → 0.59) and 11% (0.66 → 0.59), respectively. The largest performance degradation observed in the LSTM underscores the advantage of the physical constraints inherent in the HBV and hybrid models, which help mitigate potential information loss. However, the hybrid model exhibited a ‘lower boundary problem,’ where it failed to generate hydrographs below a certain threshold. Although the hybrid model did not surpass the regionalized LSTM in performance, this study emphasizes the interpretability benefits offered by its conceptual structure. Furthermore, it highlights the hybrid model’s potential as an effective regionalization approach, combining the learnability of machine learning with the physical consistency of conceptual models.

 

Acknowledgements: this work was supported by the National Research Foundation of Korea (NRF) grant funded by the Korea government (MSIT) (RS-2024-00416443).

How to cite: Lee, S. C. and Kim, D.: Performance evaluation of coupled conceptual and machine-learning frameworks for streamflow prediction in ungauged basins, EGU General Assembly 2025, Vienna, Austria, 27 Apr–2 May 2025, EGU25-3948, https://doi.org/10.5194/egusphere-egu25-3948, 2025.

A.48
|
EGU25-883
|
ECS
|
Virtual presentation
|
Abdelrahman Habash, Onur Yuzugullu, and Emre Alp

Modeling rivers discharge is an essential tool for the sustainable management of freshwater systems, as it facilitates more efficient allocation and distribution strategies of water resources through accurate forecasting. Moreover, with the proper datasets and features engineering, it can also provide an accurate backcasting, thereby enhancing the understanding of the long-term effects of climate change on natural water bodies and providing valuable insights into the historical behavior of freshwater systems. With these objectives in mind, this study evaluates the capability of selected parameters of the ERA5-Land dataset in modeling rivers discharge using machine learning techniques. ERA5-Land is a widely acknowledged global reanalysis climate dataset known for its high temporal and spatial resolution, and made available by the Copernicus Climate Change Service (C3S). The research considers six diverse gauging stations across Switzerland, representing a variety of watershed characteristics and catchment sizes.

Two different data extraction schemes were employed through Google Earth Engine to process the ERA5-Land data: Station-based Climate Data (SCD) scheme, which extracts the climate data directly from the location of the station, and Catchment-based Climate Data (CCD) scheme, which aggregates the climate data over the entire catchment area of each station. Additionally, two feature engineering approaches were investigated. The (Raw features) approach, which used only the base climate parameters as model features, while the (Windowed features) approach utilized sliding windows with various temporal intervals for each parameter, and dynamically added the ones that have relative high Gini importance to the model features. Moreover, six machine learning methods were analyzed: Artificial Neural Networks (ANN), Long Short-Term Memory (LSTM), Hybrid Convolutional Neural Network-LSTM (CNN-LSTM), Random Forests (RF), eXtreme Gradient Boosting (XGBoost), and Support Vector Machines (SVM). Making a total of 24 models for each station.

The models were evaluated on their ability to capture the monthly average discharge (m3/s) at each gauging station. Promising results were achieved, with testing R2 values ranging from at least 0.80 to as high as 0.92, and MAPE values of 10-17%, demonstrating the strong predictive potential of the ERA5-Land dataset for modeling rivers discharge.

Key findings include the superior performance of the (CCD) over the (SCD) in terms of ERA5-Land climate data extraction scheme. Additionally, (Windowed features) approach improved the model’s accuracy in general, though the degree of improvement varied across stations. Among the tested machine learning methods, (CNN-LSTM) was the most consistent and robust method, performing the best mostly, and providing a very close performance to the best model in cases where it was not. Nevertheless, (ANN), (LSTM), and (XGBoost) methods are also worth considering, as they achieved the best performance in some stations, depending on the discharge patterns.

This study underscores the applicability of the ERA5-Land dataset for rivers discharge modeling and offers insights into the climate data processing, feature engineering strategies, and machine learning techniques for hydrological modeling. These findings contribute to advancing predictive hydrology and inform future applications in water resource management and climate impact assessment.

 

 

 

 

 

*For Figures/Tables with good quality: (https://drive.google.com/drive/folders/1j8iJR7MsHnGkyD4F5y1hFZskTdwvRJGn)

How to cite: Habash, A., Yuzugullu, O., and Alp, E.: Evaluation of ERA5-Land Dataset for Modeling Rivers Discharge using Machine Learning: A Comparative Analysis, EGU General Assembly 2025, Vienna, Austria, 27 Apr–2 May 2025, EGU25-883, https://doi.org/10.5194/egusphere-egu25-883, 2025.

A.49
|
EGU25-14726
|
ECS
|
Mizuki Funato and Yohei Sawada

Despite the critical need for accurate flood prediction, water resource management, and climate impact planning, many regions—particularly in Asia, Africa, and South America—face a significant lack of river discharge observation. Although numerous hydrological and machine learning models have been proposed, it is still a grand challenge to achieve rainfall-runoff modeling which is accurate, interpretable, and computationally cheap even under conditions with limited river discharge observation data. We address this challenge by proposing a novel method that leverages multi-model ensemble and reservoir computing (RC). First, we applied Bayesian model averaging (BMA) to 43 “uncalibrated” catchment-based conceptual hydrological models. Second, we trained RC to correct errors in the BMA predictions of river discharge. Since training RC is intrinsically a linear regression to determine the weights of its output layer, there are no iterative computations in the whole process of our proposed method, which significantly enhances computational efficiency. Third, based on both the weights of BMA and RC obtained in gauged river basins, we inferred the corresponding weights for ungauged river basins by linking catchment attributes to these weights. We evaluated this method in 87 ungauged river basins in Japan and found that it achieved a median Kling-Gupta Efficiency (KGE) of 0.55 and a median Nash-Sutcliffe Efficiency (NSE) of 0.52. These results reveal that individual conceptual hydrological models do not necessarily need to be calibrated when an effectively large ensemble is assembled and combined with machine-learning-based bias correction. Furthermore, by leveraging the relationship between observed data and catchment attributes, our method enables river discharge prediction in ungauged basins, making it applicable to a wide range of regions.

How to cite: Funato, M. and Sawada, Y.: Multi-Model Ensemble and Reservoir Computing for Efficient River Discharge Prediction in Ungauged Basins, EGU General Assembly 2025, Vienna, Austria, 27 Apr–2 May 2025, EGU25-14726, https://doi.org/10.5194/egusphere-egu25-14726, 2025.

A.50
|
EGU25-7194
|
ECS
|
Léo Soucy, Richard Arsenault, and Jean-Luc Martel

Hydrological forecasting is essential across multiple sectors, including hydroelectric power generation, flood prediction and mitigation, and water resource management. In this field of research, Machine Learning (ML) models have shown promising results and are increasingly used to replace traditional hydrological models.

This work presents a novel framework for forecasting 14-day inflow volumes to a hydropower reservoir using deep-learning models and atmospheric reforecasts in a Canadian catchment. The forecasting framework investigates whether Long Short-Term Memory (LSTM) models can directly forecast inflow volumes without relying on intermediate daily streamflow predictions, and whether integrating meteorological reforecast data during training can enhance model performance and forecast quality.

Three LSTM models were trained using various combinations of meteorological data from the European Centre for Medium-Range Weather Forecasts (ECMWF), including ERA5 reanalysis data and probabilistic reforecast datasets. The target hydrological forecast is the 14-day cumulative inflow volume to the reservoir. The first model is trained exclusively with ERA5 data, the second using a combination of ERA5 data and reforecasts, and the third combining the training datasets of the first two models. The models are then used to generate hydrological forecasts using ECMWF ensemble meteorological forecasts and assessed with quantitative metrics such as the Kling-Gupta Efficiency (KGE), Continuous Ranked Probability Score (CRPS), and Average Bin Distance to Uniformity (ABDU).

Results indicate that the three LSTM models can directly predict 14-day cumulative inflow volumes with reasonable accuracy and reliability, yielding strong performance metrics. However, no single model consistently outperforms the others. The model trained solely on reanalysis data exhibits greater variability in its predictions, resulting in lower accuracy but higher reliability. Results also vary seasonally. These findings suggest that incorporating meteorological reforecast data during training offers valuable potential for improving inflow volume forecasts within specific seasons and depends on the desired trade-off between accuracy and reliability.

Overall, it can be stated that LSTM models are a promising alternative to current operational models for inflow volume forecasting, although further research is necessary to understand how to fully exploit their potential and ensure their applicability and transferability into an operational context.

How to cite: Soucy, L., Arsenault, R., and Martel, J.-L.: Assessing the value of meteorological reforecast data to predict inflow volumes over a Canadian snow-dominated catchment using a deep learning model, EGU General Assembly 2025, Vienna, Austria, 27 Apr–2 May 2025, EGU25-7194, https://doi.org/10.5194/egusphere-egu25-7194, 2025.

A.51
|
EGU25-11474
|
ECS
Dylane Martel, Jean-Luc Martel, and Richard Arsenault

Hydropower reservoirs typically require inflow forecasts to allow water resources managers to optimize drawdown rates and improve infrastructure efficiency. Usually, operators use physically-based or conceptual hydrological models to forecast streamflow for a desired lead-time, and then evaluate the total inflows for the period of interest. Recently, deep-learning models have been shown to provide better streamflow forecasts than classical hydrological models in certain cases. They have also shown better performance when trained on a multitude of donor catchments to increase the number of available data for training.

This work presents a novel method to provide inflow forecasts volumes directly, i.e. without first generating day-to-day streamflow, by using a deep-learning model trained on ensemble meteorological forecasts and observed inflow volumes for given lead-times. Furthermore, the model makes use of large-scale datasets during its training, by including data from 200 catchments in Canada. The model is then applied to a hydropower system reservoir to estimate 14-day inflow volume forecasts. The model shows promising results in terms of accuracy and reliability, and it is demonstrated that the addition of extra donor catchments during training helps increase the forecast performance. Furthermore, training the model using forecasted meteorological data as the inputs helps further increase model performance. This work demonstrates the potential residing in training regional models using meteorological forecasts for ensemble inflow volumes forecasts.

How to cite: Martel, D., Martel, J.-L., and Arsenault, R.: Inflow volume forecasting using regional deep learning models trained on operational meteorological ensemble forecasts in Canada, EGU General Assembly 2025, Vienna, Austria, 27 Apr–2 May 2025, EGU25-11474, https://doi.org/10.5194/egusphere-egu25-11474, 2025.

A.52
|
EGU25-14747
|
ECS
Urmin Vegad and Vimal Mishra

Floods are the most frequent natural disasters in India, causing widespread disruption to agriculture, infrastructure, and lives during the Indian summer monsoon. Dams play a critical role in mitigating downstream flooding by regulating reservoir storage and release. As the ability of dams to control floods strongly depend on antecedent reservoir storage, reservoir inflow forecasts are crucial for effective decision-making. Despite an extensive network of large dams, India currently lacks a reservoir inflow forecasting system incorporating all its major dams. Using the H08 land surface hydrological model with the CaMa-Flood hydrodynamic model, we developed a reservoir inflow forecast system for the major reservoirs in India. Using the meteorological forecasts from the Global Ensemble Forecast System (GEFS), the framework provides short-range inflow forecasts to support decision-making. This forecast system offers potential to optimize reservoir storage levels, attenuate projected inflows, and mitigate downstream flooding.

How to cite: Vegad, U. and Mishra, V.: Reservoir inflow forecast system for major Indian dams, EGU General Assembly 2025, Vienna, Austria, 27 Apr–2 May 2025, EGU25-14747, https://doi.org/10.5194/egusphere-egu25-14747, 2025.

A.53
|
EGU25-7726
|
ECS
Jiyeon Park, Seoyoung Kim, Gayoung Lee, Juyoung Shin, and Sangbeom Jang

For long-term time series forecasting of hydro-meteorological variables, physical models and artificial intelligence (AI)-based models have been used. Long-term forecast using physical models may have a limited predictive performance due to assumptions and conditions used in the physical model. Although AI-based models for long-term forecast of hydro-meteorological variables have a restricted capacity to explain phenomena, they have practical strengths such as high precision and short computation time. Zeng et al. (2022) proposed Long-Term Time Series Forecasting (LTSF) models and showed that they outperformed transformer-based AI models for long-term forecast In this study, a long-term forecasting model was developed using the LTSF model applied to dam inflow data and assess the feasibility of LTSF in the long-term forecast of inflow data. For comparison, The Long Short Term Memory (LSTM) algorithm was employed. The results show that the LTSF showed a comparable performance of long-term forecast to the LSTM although the structures of LTSF models are much simpler than LSTM. The LSTF models can be considered as a good alternative of LSTM when the forecast task need prompt computation.

Zeng, A., Chen, M., Zhang, L., & Xu, Q. (2022). Are Transformers Effective for Time Series Forecasting? The Chinese University of Hong Kong and International Digital Economy Academy (IDEA).

How to cite: Park, J., Kim, S., Lee, G., Shin, J., and Jang, S.: Development of a Long Term Forecasting Model for Dam Inflow in South Korea Using the LTSF Algorithm, EGU General Assembly 2025, Vienna, Austria, 27 Apr–2 May 2025, EGU25-7726, https://doi.org/10.5194/egusphere-egu25-7726, 2025.

A.54
|
EGU25-8432
|
ECS
|
Sarah Gautier, Guillaume Artigue, Yves Tramblay, and Anne Johannet

Flash floods are an important hazard that particularly affects the Mediterranean region. Flood forecasting using simulation tools adapted to this context is therefore a crucial issue. In exposed regions, the difficulty of measuring and forecasting the spatial variability and intensity of rainfall, as well as the difficulty of identifying processes at the necessary time and space scales, has often led to the use of highly conceptual - or even statistical - models that make few assumptions about hydrological processes. Among these, neural networks have proven their relevance for flash flood forecasting. However, without hydrometeorological coupling, flow forecasting is often limited to the response time of the basin, i.e. a few hours in general. The purpose is to find a way of increasing this lead time, which is often too short for crisis management.

A flood forecasting model for the Gardon de Mialet basin (Southern France) is being developed as part of the HydIA joint laboratory funded by the ANR (French National Research Agency) and the Synapse company, with the aim of developing a range of hydrometeorological forecasting services based on artificial intelligence approaches. The use of gridded observed data, like in a meteorological model, has enabled the neural network model implemented (Multilayer Perceptron) to reduce its sensitivity to support change.

In the absence of rainfall forecasts, performance decreases with the lead time. With perfect forecasts (observed data used as future data), performance remains high for lead times up to 24h. The model has been coupled with two high resolution weather models, AROME and ARPEGE (2.5km and 10km respectively), implemented by Météo-France for short-range numerical weather prediction. The use of forecasts from these meteorological models for the 49 events in the database enables us to identify the error generated by the hydrological model and that generated by the meteorological model, in comparison with perfect forecasts. Analysis of these errors opens operational perspectives for crisis management. It also makes it possible to improve model training based on perfectible forecast data, and to correct rainfall forecasting biases to achieve higher performance.

How to cite: Gautier, S., Artigue, G., Tramblay, Y., and Johannet, A.: Coupling high resolution meteorological models with neural networks for flash flood forecasting: implementation on a Southern France basin, EGU General Assembly 2025, Vienna, Austria, 27 Apr–2 May 2025, EGU25-8432, https://doi.org/10.5194/egusphere-egu25-8432, 2025.

A.55
|
EGU25-17758
|
ECS
Jinjie Zhao and Carlo De Michele

Compound flood events result from multiple physical processes across spatial and temporal scales. Analyzing floods from a single or univariate perspective can underestimate their complexity and dynamics. Conversely, understanding flood mechanisms from a compound perspective facilitates the development of effective adaptation strategies. Here, we made a process-based analysis using a hybrid model, that integrates a hydrological model (namely HBV) with a machine learning model for hydrodynamics, to assess flood events from a compound perspective. This hybrid approach balances model interpretability with computational efficiency. This study aims to quantify the contribution of compound factors to flood events, selected from the Global Flood Database. The findings may serve as a reference for analyzing other types of compound events.

How to cite: Zhao, J. and De Michele, C.: Understanding flood events from the compound perspective, EGU General Assembly 2025, Vienna, Austria, 27 Apr–2 May 2025, EGU25-17758, https://doi.org/10.5194/egusphere-egu25-17758, 2025.

A.56
|
EGU25-8202
|
ECS
Hyuna Woo, Bomi Kim, Hyeonjin Choi, Minyoung Kim, and Seong Jin Noh

As climate change intensifies hydrologic extremes, the need for near real-time urban flood prediction becomes critical. Pluvial flooding occurs when intense rainfall overwhelms urban drainage systems, involving complex hydrodynamic interactions between surface runoff and subsurface sewer flow—known as dual drainage. Capturing the spatiotemporal evolution of these processes requires detailed representations of flow patterns, inundation propagation, and runoff accumulation. However, physics-based hydrodynamic models, while effective at resolving the fine-scale dynamics of flood events, face significant computational limitations, particularly for large urban areas or high-resolution domains. To address this challenge, we propose a deep learning-based urban flood prediction model that integrates surface runoff dynamics with sewer network interactions. The model is developed using training data generated from physics-based 1D-2D hydrodynamic simulations that capture interactions between 2D surface flow and 1D sewer network flow. The Oncheoncheon River catchment in Busan, South Korea—a region frequently impacted by urban flooding—serves as the study area. Various synthetic rainfall scenarios are used to train the model, ensuring its ability to generalize across different extreme rainfall events. Model validation against historical flood events shows that the deep learning model accurately predicts flood evolution patterns while significantly reducing computational time compared to traditional hydrodynamic models. This study demonstrates the potential of deep learning-based approaches to enhance real-time urban flood prediction and provides valuable insights for developing efficient, data-driven disaster management strategies.

How to cite: Woo, H., Kim, B., Choi, H., Kim, M., and Noh, S. J.: Deep Learning-Based Urban Pluvial Flood Modeling using High-resolution Physical Information, EGU General Assembly 2025, Vienna, Austria, 27 Apr–2 May 2025, EGU25-8202, https://doi.org/10.5194/egusphere-egu25-8202, 2025.

A.57
|
EGU25-12201
Yao Li, Alfred Stein, and Frank Osei

Urban flooding, driven by rapid urbanization and climate change poses critical challenges globally. This research develops an innovative framework, combining diverse models, data and methods to address flood susceptibility, intensity prediction, and inundation simulation across multiple scales. The framework includes: (1) A machine learning based method to assess flood susceptibility, using social media data and environmental factors. It provides low-cost and real-time insights into flood-prone areas. (2) The Log-Gaussian Cox Process (LGCP) model as a spatial statistical model, for predicting flood intensity while capturing unexplained spatial variability; (3) A coupled 1D-2D hydrodynamic model that integrates a 1-dimensional flooding model with a 2D spatial model to simulate inundation. The framework was applied in the rapidly urbanizing regions of Chengdu and Haining, China. Key flood drivers were identified, vulnerable areas were highlighted, and actionable insights for urban flood mitigation were provided. By integrating data-driven models, spatial analysis, and physical simulations into a single framework, this research offers a scalable and comprehensive approach for urban flood management, with potential applications to other natural hazards globally.

How to cite: Li, Y., Stein, A., and Osei, F.: Integrating data-driven and physical models for urban flood prediction in a single framework , EGU General Assembly 2025, Vienna, Austria, 27 Apr–2 May 2025, EGU25-12201, https://doi.org/10.5194/egusphere-egu25-12201, 2025.

A.58
|
EGU25-5612
|
ECS
Wei Yang Hong, Shao Wei Ho, and Wen Ping Tsai

Hydrological models, such as rainfall-runoff and groundwater models, require the accurate calibration of multiple unobserved parameters to function effectively. While various methods, including genetic and evolutionary algorithms, have been developed for this purpose, traditional calibration techniques often fall short. They tend to focus on individual locations, leading to suboptimal, local solutions and results in discontinuous parameter estimates, even in geographically similar adjacent regions. To address these challenges, we propose a differentiable Parameter Learning (dPL) framework that harnesses the power of deep learning for the comprehensive calibration of hydrological model parameters across both temporal and spatial domains. This innovative approach moves beyond the constraints of traditional methods by integrating the extensive learning capabilities of deep learning to achieve more consistent and accurate parameter estimation. In this study, we apply the dPL framework to the HBV (Hydrologiska Byråns Vattenbalansavdelning) rainfall-runoff model, a conceptual lumped model that represents an entire watershed as a system comprising a soil layer, an upper tank, and a lower tank. The study area encompasses the upstream regions of five government-managed rivers of Taiwan, covering six distinct watersheds, each with unique geographical characteristics. The results demonstrate that the dPL framework not only outperforms traditional calibration methods but also enhances physical coherence and generalizability. These findings highlight the potential of the dPL framework as a robust tool for hydrological model calibration.

Keyword:differentiable Parameter Learning Framework,HBV Rainfall-Runoff Model,Surrogate Model,Long Short-Term Memory (LSTM)

How to cite: Hong, W. Y., Ho, S. W., and Tsai, W. P.: Differentiable Parameter Learning Framework for Calibration of Hydrological Model Parameters, EGU General Assembly 2025, Vienna, Austria, 27 Apr–2 May 2025, EGU25-5612, https://doi.org/10.5194/egusphere-egu25-5612, 2025.

A.59
|
EGU25-750
|
ECS
Omid Mohammadiigder, Ricardo Mantilla, and Chandra Rajulapati

Quantitative precipitation estimates (QPE), the main input driving hydrological model simulations, are known to have different levels of uncertainty across spatial and temporal scales. These uncertainties propagate through model simulations and thus lead to erroneous estimations of hydrological variables and extreme events. The role of equifinality—where different model structures or parameter sets produce similarly acceptable hydrological results—needs further research in the context of precipitation error propagation. Additionally, while data assimilation (DA) is a well-established method to improve model predictive performance by addressing various sources of uncertainty, its application to precipitation error propagation under the influence of model equifinality has received limited attention. This study investigates these gaps by leveraging the Raven hydrological modelling framework in combination with the dynamically dimensioned search (DDS) algorithm to calibrate streamflow at the outlets of multiple catchments across Southern Manitoba. Hence, different sets of optimized parameters are identified for each catchment, reflecting equifinality in the model structure and calibration. Subsequently, the calibrated model is driven by precipitation estimates from various satellite-based and reanalysis precipitation products to examine the propagation of precipitation errors through hydrological simulations. Finally, the study evaluates the effectiveness of streamflow data assimilation in correcting precipitation-induced errors in streamflow and improving the accuracy and robustness of the hydrological model. By systematically addressing the interplay between precipitation uncertainty, model equifinality, and data assimilation, this work provides novel insights into improving hydrological simulations.

How to cite: Mohammadiigder, O., Mantilla, R., and Rajulapati, C.: Simultaneous Evaluation of Streamflow Data Assimilation for Addressing Precipitation Error Propagation and Hydrological Model Equifinality , EGU General Assembly 2025, Vienna, Austria, 27 Apr–2 May 2025, EGU25-750, https://doi.org/10.5194/egusphere-egu25-750, 2025.

A.60
|
EGU25-14838
|
ECS
Machine Learning-Based Assessment of High-Impact Low Likelihood Precipitation Events in North India
(withdrawn after no-show)
Aayushi Tandon, Amit Awasthi, and Kanhu Charan Pattnayak
A.61
|
EGU25-1452
A Hybrid CNN-LSTM Approach for Precipitation Forecasting under Climate Change Scenarios
(withdrawn after no-show)
Tiantian Tang and Guan Gui
A.62
|
EGU25-2685
Precipitation Forecasting Using Hybrid Data-Driven Modeling
(withdrawn after no-show)
Sanaz Moghim and Kianoush Kadkhodaei
A.63
|
EGU25-4743
|
ECS
Vivek Kumar Bind, Hiren Solanki, Vikrant Jain, and Vimal Mishra

Suspended Sediment Load (SSL) plays a crucial role in water resources management, agriculture, infrastructure development, river morphology, and ecological balance. SSL also affects the estuary and marine ecosystem as sediment is a habitat for invertebrates. Furthermore, excessive SSL poses significant challenges upstream of dams by reducing their water storage capacity. A warming climate is expected to influence the streamflow and, subsequently, the SSL of Indian river basins. While extensive research has been conducted to estimate streamflow under historical and future climate projection scenarios, further studies addressing streamflow and SSL dynamics need to be investigated. Recently, Physics Informed Machine Learning (PIML) has shown better performance over individual Physics-based hydrological (PBH) and Machine Learning (ML) models. We employed PBH, ML, and PIML models to predict streamflow and SSL in the Tapi River basin. Our study focused on a ~56,000 km² area to evaluate the impact of SSL on the Ukai dam, the largest dam located approximately 600 km downstream from the river's origin. The Ukai dam features an area of ~612 million m² and a total storage capacity of ~7,414 million m³. We used the Soil Water Assessment Tool (SWAT) as PBH, Long-Short-Term Memory (LSTM) as ML, and SWAT-informed LSTM as the PIML model. Our results show that the PIML model performs best for the historical streamflow and SSL simulation. We then used the generated PIML model to predict streamflow and SSL under future climate scenarios for SSP126 and SSP585. Bias-corrected climate data for future scenarios were derived from the four General Circulation Models (BCC-CSM2-MR, CMCC-ESM2, INM-CM5-0, and NorESM2-MM) included in the Coupled Model Intercomparison Project-6 (CMIP6). These datasets provided projections for precipitation, maximum and minimum temperatures, and wind speed. The models were applied to simulate historical (1951–2014) and future (2015–2100) streamflow and SSL under SSP126 and SSP585 scenarios. Our analysis indicates that SSL and streamflow will increase under the SSP126 and SSP585 scenarios. This increase in SSL will reduce the water storage capacity of the Ukai dam to 54% and 56% under the SSP126 and SSP585 scenarios, respectively. Such reductions in dam capacity and increased streamflow by 39% and 51% for SSP126 and SSP585, respectively, will pose significant challenges in managing extreme flood events in the future. Our findings hold critical implications for water resource management, flood risk mitigation, and the sustainability of river ecosystems.

How to cite: Bind, V. K., Solanki, H., Jain, V., and Mishra, V.: Sediment dynamics under historical and future climate projection scenarios in the Tapi River basin, India, EGU General Assembly 2025, Vienna, Austria, 27 Apr–2 May 2025, EGU25-4743, https://doi.org/10.5194/egusphere-egu25-4743, 2025.

A.64
|
EGU25-11162
|
Hitoshi Miyamoto and Naoya Maeda

This study developed an ML (machine learning) model that predicts the vegetation distribution of the following year from the current year's conditions by applying the ML model in multiple stages. The target rivers examined in this study were five Japanese large rivers, i.e., Kinugawa, Edogawa, Yahagigawa, Shonaigawa, and Ibogawa. The multi-stage ML model's explanatory and target variables were created for each river segment using DEMs (Digital Elevation Models) and river environment base maps. The multi-stage ML model consisted of three ML stages to predict the vegetation distribution of the following year from the current river vegetation distribution and topographical information. The advantage of the multi-stage ML model was that a third-stage vegetation distribution prediction model could be constructed according to the difficulty of prediction using a second-stage classification result. XGB (eXtreme Gradient Boosting) was used as the machine learning model. SHAP (SHapley Additive exPlanations) was used for factor analysis. F1 score with five-fold cross-validation was used to evaluate the model's accuracy. The result of the multi-stage ML model for the five target rivers showed that the F1 score was 0.8 or higher for all rivers except the Kinugawa River. The multi-stage ML model had an accuracy of 10% higher F1 score than a conventional single ML model. The vegetation distribution probability map indicated that the prediction had a high general accuracy but dropped near the boundary between the river's low water channel and the floodplain. SHAP analysis revealed the three prominent factors for vegetation existence: (i) the relative height near the levee and in the center of the floodplain, (ii) the distance from the river water's edge near the low water channel, and (iii) the vegetation existence history at the boundary between the low water channel and the floodplain. These results suggest that combining the prediction map with factor analysis could identify the factors that significantly influence where vegetation recruits in a river course.

How to cite: Miyamoto, H. and Maeda, N.: A multi-stage machine learning application for predicting vegetation distribution and its factors in river channels, EGU General Assembly 2025, Vienna, Austria, 27 Apr–2 May 2025, EGU25-11162, https://doi.org/10.5194/egusphere-egu25-11162, 2025.

A.65
|
EGU25-16009
Amit Kumar Srivastava, Krishnagopal Halder, Yue Shi, Liangxiu Han, Radwa EI Shawi, Jan Timko, Wenzhi Zheng, Gang Zhao, Karam Alsafadi, Manmeet Singh, Dominik Behrend, Thomas Gaiser, and Frank Ewert

The dual challenges of climate change and a growing population exceeding 9 billion by 2030 necessitate precise regional crop yield prediction models to optimize management, ensure food security, and guide agricultural decisions. Machine learning (ML), leveraging big data and high-performance computing, provides powerful tools for addressing these complexities but faces challenges such as inconsistent data quality and variable algorithm performance. While ML algorithms like Convolutional Neural Networks (CNNs), Random Forests (RF), and Long Short-Term Memory (LSTM) networks show promise in crop yield prediction, their performance can be hindered by data noise and incompleteness. Diffusion (a probabilistic generative model), with its iterative denoising capabilities, offers resilience to these issues and holds significant potential to improve accuracy and reliability in crop forecasting, though their use in this domain remains largely untapped.

This study compared XGBoost (XGB), a state-of-the-art tree-based ML model, with our proposed Diffusion-reg (DR) model. The input data for the models was compiled from multiple sources, including crop calendar data from MIRCA2000, net primary production (NPP) data from WAPOR, soil data from the Soil-Grids database, and maize crop yield data from the FAO database. Climate variables such as precipitation, air temperature, and solar radiation were obtained from ERA5, with all data aggregated into decadal periods. Additionally, Leaf Area Index (LAI) and Normalized Difference Vegetation Index (NDVI) data from MODIS were collected at 16-day intervals. In the subsequent step, maize yield data at the country level from the FAO was spatially disaggregated to produce pixel-scale estimates (250 m resolution, aligned with the soil input data resolution). This process focused exclusively on cropland areas within the five major maize-producing countries in Sub-Saharan Africa.

The evaluation of model performance metrics highlights the consistent superiority of the DR model over XGB across all analyzed countries. The R2 values, which measure the proportion of variance explained by the models, indicate higher predictive accuracy for Diffusion-reg in every instance. For example, in Ethiopia, the DR achieves an almost perfect R2 of 0.98 compared to XGB’s 0.95, while the largest gap is observed in South Africa, with R2 values of 0.86 for DR and 0.76 for XGB. These results highlight the DR model’s ability to effectively capture complex data patterns, even in regions with higher predictive challenges.
Further, the RMSE (Root Mean Squared Error) and MAE (Mean Absolute Error) metrics reinforce the DR model’s superior predictive precision. Across all countries, DR consistently exhibits lower error values, with Ethiopia showing the best performance (RMSE: 0.02, MAE: 0.01). Although South Africa records the highest RMSE (0.25) and MAE (0.13) for the DR model, these metrics still significantly outperform those of XGB. Similar trends in Uganda and Mozambique, where the DR model achieves substantial reductions in error, further validate its robustness and reliability.
In summary, the DR model consistently outperforms XGBoost in diverse regional contexts, highlighting its potential for broader application in predictive tasks requiring high accuracy and resilience.

How to cite: Srivastava, A. K., Halder, K., Shi, Y., Han, L., EI Shawi, R., Timko, J., Zheng, W., Zhao, G., Alsafadi, K., Singh, M., Behrend, D., Gaiser, T., and Ewert, F.: Advancing Crop Yield Predictions: The Potential of Diffusion Models in Machine Learning for Agriculture, EGU General Assembly 2025, Vienna, Austria, 27 Apr–2 May 2025, EGU25-16009, https://doi.org/10.5194/egusphere-egu25-16009, 2025.

Posters virtual: Tue, 29 Apr, 14:00–15:45 | vPoster spot A

Display time: Tue, 29 Apr, 08:30–18:00
Chairpersons: Yonca Cavus, Boen Zhang

EGU25-19530 | Posters virtual | VPS9

Applicability of deep learning based detection of surface weather fronts on large scale climate models
(withdrawn after no-show)

Yiwen Mao and Tomohito Yamada
Tue, 29 Apr, 14:00–15:45 (CEST)   vPoster spot A | vPA.23