Session HS3.1

[Programme]

HS3.1 | Hydroinformatics: data analytics, machine learning, hybrid modelling, optimisation

Hydroinformatics: data analytics, machine learning, hybrid modelling, optimisation

Convener: Claudia BertiniECSECS | Co-conveners: Amin Elshorbagy, Alessandro AmarantoECSECS, Niels Schuetze

Orals

| Mon, 24 Apr, 08:30–12:25 (CEST), 14:00–15:45 (CEST)

Room 3.29/30

Posters on site

| Attendance Tue, 25 Apr, 08:30–10:15 (CEST)

Hall A

Posters virtual

| Attendance Tue, 25 Apr, 08:30–10:15 (CEST)

vHall HS

Hydroinformatics has emerged over the last decades to become a recognised and established field of independent research within the hydrological sciences. Hydroinformatics is concerned with the development and hydrological application of mathematical modelling, information technology, systems science and computational intelligence tools. We also have to face the challenges of Big Data: large data sets, both in size and complexity. Methods and technologies for data handling, visualization and knowledge acquisition are more and more often referred to as Data Science.

The aim of this session is to provide an active forum in which to demonstrate and discuss the integration and appropriate application of emergent computational technologies in a hydrological modelling context. Topics of interest are expected to cover a broad spectrum of theoretical and practical activities that would be of interest to hydro-scientists and water-engineers. The main topics will address the following classes of methods and technologies:

* Predictive and analytical models based on the methods of statistics, computational intelligence, machine learning and data science: neural networks, fuzzy systems, genetic programming, cellular automata, chaos theory, etc.
* Methods for the analysis of complex data sets, including remote sensing data: principal and independent component analysis, time series analysis, information theory, etc.
* Specific concepts and methods of Big Data and Data Science
* Optimisation methods associated with heuristic search procedures: various types of genetic and evolutionary algorithms, randomised and adaptive search, etc.
* Applications of systems analysis and optimisation in water resources
* Hybrid modelling involving different types of models both process-based and data-driven, combination of models (multi-models), etc.
* Data assimilation and model reduction in integrated modelling
* Novel methods of analysing model uncertainty and sensitivity
* Software architectures for linking different types of models and data sources

Applications could belong to any area of hydrology or water resources: rainfall-runoff modelling, flow forecasting, sedimentation modelling, analysis of meteorological and hydrologic data sets, linkages between numerical weather prediction and hydrologic models, model calibration, model uncertainty, optimisation of water resources, etc.

Orals: Mon, 24 Apr, 08:30–15:45 | Room 3.29/30

Chairpersons: Claudia Bertini, Niels Schuetze

08:30–08:35

5-minute convener introduction

Machine learning models

08:35–08:45

EGU23-1160

ECS

On-site presentation

Global streamflow modelling using process-informed machine learning

Michele Magni, Edwin H. Sutanudjaja, Youchen Shen, and Derek Karssenberg

Hydrological models include errors when reproducing real-world observations, due to uncertainties in their components that inevitably propagate to the simulated variable. A large body of research in streamflow prediction blends statistical learning into the hydrological sciences, modelling river discharge using meteorological variables and catchment attributes as predictors of observed streamflow.

We developed a novel hybrid framework that integrates information from the process-based global hydrological model PCR-GLOBWB to reduce prediction errors in streamflow simulations. Our statistical methodology employs simulated streamflow and state variables from PCR-GLOBWB as additional predictors of observed river discharge. These model outputs provide supplemental information that is effectively used in a random forest, trained on a global database of streamflow measurements, to improve estimates of simulated river discharge across the globe. PCR-GLOBWB was run for the years 1979-2019 at 30arcmin and daily resolution, and the simulated state variables were then aggregated to monthly time steps. A single random forest model was trained with these state variables, meteorological data and catchment attributes, as predictors of observed streamflow from 2286 stations worldwide.

Results based on cross-validation show that the model is capable of discerning between a variety of hydro-climatic conditions and river flow dynamics, improving KGE of PCR-GLOBWB simulations at more than 80% of testing locations and increasing median KGE from -0.02 in uncalibrated runs to 0.52 after post-processing. Performance boosts are usually independent of availability of streamflow data at a particular station, thus making our method a potential candidate in addressing prediction in poorly gauged and ungauged basins.

Further research is still needed to test the potential influence of additional predictors describing catchment and time-series behaviour. Cluster analysis is required to understand why the post-processing framework still performs poorly at some stations. For prediction purposes, future efforts should also be directed at testing the model at higher spatial resolutions globally, and at finer temporal resolutions.

How to cite: Magni, M., Sutanudjaja, E. H., Shen, Y., and Karssenberg, D.: Global streamflow modelling using process-informed machine learning, EGU General Assembly 2023, Vienna, Austria, 23–28 Apr 2023, EGU23-1160, https://doi.org/10.5194/egusphere-egu23-1160, 2023.

08:45–08:55

EGU23-1672

ECS

On-site presentation

An open source library for environmental isotopic modelling using machine learning techniques

Ashkan Hassanzadeh, Sonia Valdivielso, Enric Vázquez-Suñé, Rotman Criollo, and Mercè Corbella

Stable isotopic composition modelling of water is an important part of resource management studies. We present a tool that estimates water stable isotope compositions using discontinuous inputs in time and space through machine learning algorithms. This tool has a multi-stage coupled algorithm that firstly calculates the parameters defined by the user that potentially affect the isotopic composition such as meteorological parameters, then, integrates the results of different parameters and generates the isotopic composition models for each time window. Isocompy time windows can be defined flexibly based on the amount of spatial-temporal properties of the available data. A variety of decision-making algorithms are implemented in this tool as an optional support to the user in different stages: from dataset preprocessing, outlier detection, statistical analysis, feature selection, model validation and calibration to postprocessing. Reports, figures, datasheets and maps could be generated in each step to clarify the underlying processes.

All in all, this tool aims (1) to offer an integrated, open-source Python library that is dedicated to the water isotopic composition statistical-regression modelling (2) to potentially improve our understanding of the precipitation stable isotopes by implementing novel machine-learning tools; and (3) to ensure reproducible research in environmental studies.

How to cite: Hassanzadeh, A., Valdivielso, S., Vázquez-Suñé, E., Criollo, R., and Corbella, M.: An open source library for environmental isotopic modelling using machine learning techniques, EGU General Assembly 2023, Vienna, Austria, 23–28 Apr 2023, EGU23-1672, https://doi.org/10.5194/egusphere-egu23-1672, 2023.

08:55–09:05

EGU23-2343

ECS

On-site presentation

Revealing how precipitation extremes impact river floods in a warming climate with interpretable machine learning

Shijie Jiang and Jakob Zscheischler

An intensified hydrological cycle due to climate change is expected to increase precipitation extremes, but how river flood magnitudes will respond to this change remains disputed. Historically, there is only limited observational evidence that increasing precipitation extremes directly translate into systematically increased flood magnitudes. The incongruence between extreme precipitation and flooding is likely related to the compounding nature of various flooding drivers such as snowmelt and antecedent soil moisture. This complex interplay between flooding drivers makes it challenging to predict flood risks under warming. In order to better understand how precipitation extremes affect river floods in a warming climate, it is essential to disentangle the impacts of different drivers and conditions in flood generation. In this study, we employ an interpretable machine learning approach together with a large-sample hydrological dataset to identify the impact of various drivers in flood generation across a myriad of globally distributed catchments. We analyze how these impacts change with warmer temperatures and how – in response – their relationships with flood occurrence and magnitude change. The results indicate that increases in precipitation extremes have indeed contributed increasingly to flood generation in many regions over the historical period. The fact that flood magnitudes did not necessarily increase is likely a result of decreasing contributions of other drivers. We further investigate how future floods may change given the continuously rising trend of precipitation extremes. Overall, the study emphasizes the value of interpretable machine learning in helping understand how flood risks are likely to change in a warming climate.

How to cite: Jiang, S. and Zscheischler, J.: Revealing how precipitation extremes impact river floods in a warming climate with interpretable machine learning, EGU General Assembly 2023, Vienna, Austria, 23–28 Apr 2023, EGU23-2343, https://doi.org/10.5194/egusphere-egu23-2343, 2023.

09:05–09:15

EGU23-4367

On-site presentation

Fluvial land cover classification by using CSC deep learning method with UAV airborne images

Hitoshi Miyamoto and Ryusei Ishii

This study investigated a deep learning method for fluvial land cover classification using aerial imagery of the UAV (Unmanned Aerial Vehicles). The deep learning used in this study was the CNN-Supervised Classification (CSC) developed by Carbonneau et al. (Remote Sensing of Environment, 2020). The analysis was based on 51 river sections of RGB orthorectified images taken aerially in 2015-2019 in several river channels of the Kinu River in Japan. They were obtained by applying the SfM (Structure from Motion) processing to the UAV aerial images acquired during field observations. The spatial resolution of the images was approximately 4 cm per pixel. The seven land cover types classified by CSC were water surface, gravel, sand, grass, tree, farmland, and artificial land. The deep learning algorithm CSC in this study was a classification model combining two-stage Convolution Neural Networks (CNNs). The first stage of the CSC classified the input image into 200 x 200-pixel image tiles and created a training dataset to be used in the second stage. Then, the training dataset was used to train a second-stage small-scale CNN (hereafter called mini-CNN) to optimise the model hyper-parameters. Finally, the trained CSC performed pixel-based land cover classification of the RGB orthoimages. In the first stage, this study used an existing CNN architecture, VGG16. The fine-tuning dataset had more than 2,500 images for each land cover class, resulting in a total of 85,800 through data augmentation. The hyper-parameters examined were the learning rate, patch size and the number of frozen layers. The F-measures for the CSC first stage with the optimised parameters were 99.1, 96.9, 92.6, 91.4, 93.6, 95.7 and 96.1% for water surface, gravel, sand, grass, tree, farmland, and artificial land, respectively. Then, the architecture of the mini-CNN, learning rate, patch size, patch number and filter size were optimised for the CSC second stage. The weighted average F-measure for the optimised CSC model was 90.4%. This confirmed that the optimised CSC could reproduce the land cover classes with enough accuracy. The CSC application to the RGB orthorectified images of the Kinu River in Japan showed that the CSC deep learning method could accurately classify temporal changes in fluvial geomorphologies such as gravel beds and sandbars as well as riparian vegetation, including the significant differences before and after the severe floods in 2015 and 2019. Future work would be needed to verify the applicability of the proposed CSC deep learning method to other rivers with different fluvial characteristics.

How to cite: Miyamoto, H. and Ishii, R.: Fluvial land cover classification by using CSC deep learning method with UAV airborne images, EGU General Assembly 2023, Vienna, Austria, 23–28 Apr 2023, EGU23-4367, https://doi.org/10.5194/egusphere-egu23-4367, 2023.

09:15–09:25

EGU23-14581

ECS

On-site presentation

Application of machine learning techniques in groundwater potential mapping in the Korba coastal aquifer Cap Bon Peninsula

Ichrak Khammessi and hamdi omar

Groundwater potential mapping (GWPM) in the coastal areas is central for the decion making and development of the society and the environment. The current study endeavours the delineation of the groundwater potential zones of Korba coastal aquifer in the Cap-Bon Peninsula in the North East of Tunisia, using three different machine Learning techniques : random forest (RF), boosted regression tree (BRT), and the ensemble of RF and support vector machine (SVM). In order to achieve the objective, 17 groundwater influencing factors including elevation, slope, aspect, slope length (LS), profile curvature, plan curvature, topographical wetness index (TWI), distance from streams, distance from lineaments, lithology, geomorphology, soil, land use, normalized difference vegetation index (NDVI),Stream Power Index,Drainage Density and rainfall were considered for inter-thematic correlations and overlaid with wells locations and Transmissivity data in a spatial database. A total of 225 wells locations were identified, which had been divided into two classes: training and validation, at the ratio of 70:30, respectively. The RF, BRT, and RF-SVM ensemble models have been applied to delineate the groundwater potential zones. These models were validated with area under the receiver operating characteristics (AUROC) curve. The accuracy of RF (92%) and hybrid model (89.2%) was more efficient than BRT (85.6%) model. The results of the study will help the decision-makers, government agencies, and private sectors for sustainable planification of groundwater resources in the study area.

How to cite: Khammessi, I. and omar, H.: Application of machine learning techniques in groundwater potential mapping in the Korba coastal aquifer Cap Bon Peninsula, EGU General Assembly 2023, Vienna, Austria, 23–28 Apr 2023, EGU23-14581, https://doi.org/10.5194/egusphere-egu23-14581, 2023.

09:25–09:35

EGU23-11950

ECS

On-site presentation

Comparison of Neural network based and Kernel based Machine Learning approaches for daily forecasting of reference evapotranspiration in data scarce regions

Nehar Mandal and Kironmala Chanda

Efficient estimation and forecast of reference evapotranspiration (ET_O) is crucial for water resources management and for developing an efficient irrigation practice that will help better utilization of scanty water resources. This is a more challenging task in data scarce regions. This study aimed at multi-step ahead prediction of ET_O across different cropping seasons and agro-climatic regions in India with six Machine learning (ML) based techniques using globally available fifth-generation European Centre for Medium-Range Weather Forecasts (ECMWF) gridded reanalysis products (ERA5). For real-time, one-day, two-day, and seven-day ahead prediction of ET_O, this study evaluates and compares the capability and prediction accuracy of deep learning algorithms, i.e., Support Vector Regression (SVR), Multivariate Adaptive Regression Splines (MARS), Random Forest (RF), Multi-Layer Perceptron (MLP), one-dimensional Convolutional Neural Network (CNN) and Long Short-Term Memory (LSTM). ML based models were developed using meteorological observations along with ERA5 inputs at three meteorological stations: Nagpur, Hyderabad and Bhubaneswar. The results indicate that MLP, SVR and CNN outperform other ML algorithms. High performing models, one each (MLP and SVR) from neural network-based models and kernel-based models respectively, are further utilized to scale up the analysis for gridwise ET_O forecasting across the whole of India. The Global Land Evaporation Amsterdam Model (GLEAM) dataset has been used as reference to evaluate gridwise ET_O forecasts. ET_O predicted by MLP model shows better agreement with GLEAM ET_O values during the Rabi cropping season (October-March) (MAE = 0.103 mm/day and NRMSE = 3.9 %) than during the Kharif season (June-September) (MAE = 0.151 mm/day and NRMSE = 4.5 %). As expected, the accuracy of the models drops with increase in the prediction horizon from real-time to seven-day; for instance, with MLP, MAE = 0.146 mm/day, R² = 0.955 for real-time and MAE = 0.173 mm/day, R² = 0.939 for seven-day ahead prediction over arid agro-climatic zone during Rabi season. However, even the minimum forecast performance observed in the semi-arid tropics region during Rabi season is reasonably good (MAE = 0.396 mm/day, R² = 0.704 for real-time evaluation and MAE = 0.445 mm/day, R² = 0.56 for seven-day ahead). This strengthens the potential of the proposed models for multi-step ahead ET_O forecasting across varied cropping seasons and agro-climatic regions without depending on meteorological station data.

How to cite: Mandal, N. and Chanda, K.: Comparison of Neural network based and Kernel based Machine Learning approaches for daily forecasting of reference evapotranspiration in data scarce regions, EGU General Assembly 2023, Vienna, Austria, 23–28 Apr 2023, EGU23-11950, https://doi.org/10.5194/egusphere-egu23-11950, 2023.

09:35–09:45

EGU23-4440

ECS

On-site presentation

Developing a random forest model to quantify streamflow intermittency in Pan-Europe at a spatial resolution of 15 arc-sec

Mahdi Abbasi, Tim Trautmann, and Petra Döll

Intermittent streams, where water ceases to flow during some time, are a unique habitat for freshwater biota that are adapted to these conditions and provide many ecosystem services. Shifts in intermittency patterns, for example due to climate change, are problematic. To quantify streamflow intermittency in all of Europe at a spatial resolution of 15 arc-sec (approx.. 500 m), we developed a machine learning approach that combines daily streamflow observations, the output of a global hydrological model as well as other physiogeographic data to estimate monthly time series of the number of no-flow days.

Daily streamflow observations at initially a selection of initially close to 2000 stations gauging stations across Europe from the SMIRES, GRDC and GSIM databases were used as target for training the ML model. We selected those stations with at least 18 complete (no day is missing) monthly records in the period 1980-2019. Predictors include monthly time series of simulated hydrological indicators at two spatial resolutions, 15 arc-sec (high resolution HR) and 0.5 arc-deg (approx. 50 km, low resolution LR) as well as static HR environmental indicators (e.g. drainage area). The hydrological indicators were derived from the global hydrological model WaterGAP 2.2e. Its native LR output including surface runoff, and groundwater discharge was used for computing HR time series of monthly streamflow across all of Europe. A comparison of streamflow observations shows a reasonable fit to observations. HR hydrological indicators include specific streamflow in current and previous months. Examples for LR hydrological predictors include the groundwater recharge to total runoff ratio and daily streamflow variability with each month.

We considered a sequential statistical modeling approach (in the first stage: binary classification, and in the second stage: multiclass classification) owing to the zero-inflated and imbalanced data issues. In the first stage, a Random Forest (RF) model is built up to classify a binary classification of each month as either intermittent (with at least one no-flow day) or perennial. Then, by taking into account only those stations that were in the first step either predicted or observed to be intermittent, we developed another model to predict four classes of intermittency (e.g. with 1-2, 3-15, 16-27, 28-31 of no-flow days per month). A random oversampling of non-perennial gauging stations was implemented for both stages in order to address the biases in the RF model caused by the class imbalance in the training data. Three cross-validation techniques were applied for estimating the model performance, hyperparameter tuning, and model selection, including non-spatial, spatial, and spatial-temporal cross-validations. Balanced class accuracy, sensitivity, specificity, and precision supported the model selected. The most important predictors for streamflow intermittency will be presented as well as the spatial distribution of the four intermittency classes in Europe (without Russia).

How to cite: Abbasi, M., Trautmann, T., and Döll, P.: Developing a random forest model to quantify streamflow intermittency in Pan-Europe at a spatial resolution of 15 arc-sec, EGU General Assembly 2023, Vienna, Austria, 23–28 Apr 2023, EGU23-4440, https://doi.org/10.5194/egusphere-egu23-4440, 2023.

09:45–09:55

EGU23-11925

ECS

On-site presentation

Forecasting soil moisture on a spatial and temporal scale using machine learning algorithms

Efthymios Chrysanthopoulos, Christos Pouliaris, Ioannis Tsirogiannis, and Andreas Kallioras

While the observations from earth-observing satellites and in-situ weather meteorological and monitoring stations continue to expand, researchers are to deal with an abundance of data and, consequently, with a long modeling procedure. Machine learning algorithms, as universal nonlinear function approximation tools, are effective in analyzing and modeling spatio-temporal environmental data, more efficiently, either in time or in terms of the availability of various variables, than physically-based models.

Soil moisture is an essential climatic parameter, especially for understanding and forecasting variations in surface temperature, precipitation, drought, flood, and the effects of climate change. As a parameter with high spatial and temporal variability, it is a strong necessity for predictive models that embed spatially irregular measurements, which stand for spatially distributed weather meteorological and monitoring stations. To date, most approaches, that have been documented in the literature, model environmental data only at the discrete locations of the monitoring stations.

This research aims to employ a recently proposed methodology, for spatio-temporal prediction of environmental data (Amato et al., 2020), and to propose a new methodology for spatio-temporal prediction of soil moisture in lowland areas, making use of the basic hydrologic premise that precipitation and temperature strongly hinge on topography. The features of the machine learning models used to predict soil moisture within the research area are the meteorological parameters of several agro-meteorological weather stations that have been installed at the site.

The study area is the plain of Arta, located in the Epirus region (NW Greece), and includes the lower part of the watersheds of rivers Aracthos and Louros. The final receptor for upstream surface water and groundwater is a sensitive and complex system of wetlands, the marine ecosystem of Amvrakikos. The research area is receiving a lot of attention because of its agricultural characteristics and water infrastructures (extensive irrigation and drainage network, pumping stations, and hydroelectric dam).

Acknowledgments: This research has been co-financed by the European Union and Greek national funds through the Operational Program Competitiveness, Entrepreneurship and Innovation, under the call SUPPORT OF REGIONAL EXCELLENCE (project code MIS: 5047059).

Amato, F., Guignard, F., Robert, S., & Kanevski, M. (2020). A novel framework for spatio temporal prediction of environmental data using deep learning. Scientific Reports, 10(1), Article 1. https://doi.org/10.1038/s41598-020-79148-7

How to cite: Chrysanthopoulos, E., Pouliaris, C., Tsirogiannis, I., and Kallioras, A.: Forecasting soil moisture on a spatial and temporal scale using machine learning algorithms, EGU General Assembly 2023, Vienna, Austria, 23–28 Apr 2023, EGU23-11925, https://doi.org/10.5194/egusphere-egu23-11925, 2023.

09:55–10:05

EGU23-10575

On-site presentation

Machine learning and hydrological sciences: A systematic overview of review papers

Nilay Dogulu, Adeyemi Oludapo Olusola, and Georgia Papacharalampous

Water sciences have greatly contributed to the proliferation of machine learning in the twenty-first century, especially through engineering hydrology. This process has consequently necessitated transfer of core theory and knowledge of machine learning to the domain of hydrological sciences. In this regard, it is noteworthy that published academic literature played a substantial role in supporting development and learning of hydrologists. Specifically, research articles (and book sections) that review machine learning concepts and algorithms along with their applications in hydrology bolster progress of science by presenting encapsulated information (e.g, in the form of literature synthesis). Despite the rapid increase in the number and scope of such research articles, a systematic understanding of how this line of research publications has evolved with respect to their scientific context, objectives, and methods is still lacking. Hereby, we present an analysis of review papers in hydrology and machine learning based on a systematic search strategy. The overview includes analysis of bibliographic information, review types (objective, focus theme, etc.), review methodologies (narrative, systematic, etc.) as well as thematic context (hydrology subjects and machine learning topics). We believe that our analysis can provide important insights into topics and discussions in hydrology and machine learning that need further exploration by hydrologists. Furthermore, the public online library on Zotero (https://www.zotero.org/groups/4828386/machine_learning_hydrology_review_papers/library) might encourage more participation towards sustainable literature search and active reading on this subject at the intersection of two fundamental disciplines, machine learning and hydrology.

How to cite: Dogulu, N., Olusola, A. O., and Papacharalampous, G.: Machine learning and hydrological sciences: A systematic overview of review papers, EGU General Assembly 2023, Vienna, Austria, 23–28 Apr 2023, EGU23-10575, https://doi.org/10.5194/egusphere-egu23-10575, 2023.

10:05–10:15

Discussion

Coffee break

Chairpersons: Niels Schuetze, Alessandro Amaranto

Machine learning, hybrid models and hydrological models

10:45–10:55

EGU23-14421

ECS

On-site presentation

Modelling and reforecasting real-time reservoir operation and outflow with neural networks: case study of the multi-purpose Sirikit reservoir in Thailand

Chanoknun Wannasin, Claudia Brauer, Remko Uijlenhoet, and Albrecht Weerts

Real-time reservoir operations are highly dependent on decisions made by reservoir operators, which are difficult to simulate accurately with process-based hydrological models. Data-driven models, particularly those based on machine learning (ML), have been shown to be able to overcome the limitations typically encountered in process-based models. Despite a large number of ML studies in reservoir operation modelling, only few studies have focused on ML model performance in real-time reservoir operation and outflow forecasting. This study aims to investigate the capabilities of the Recurrent Neural Network (RNN), Long Short-Term Memory (LSTM) and Gated Recurring Unit (GRU) in simulating and reforecasting real-time (daily) reservoir operation and outflow, considering uncertainties in input data, training-testing periods and different algorithms. A major, multi-purpose reservoir, namely the Sirikit reservoir, in the upper Chao Phraya River basin Thailand was used as the case study. The main inputs for the ML operation models included the daily reservoir outflow, inflow, storage and the month of the year. We applied the distributed wflow_sbm model for inflow simulation (using MSWEP precipitation data) and inflow reforecasting (using ECMWF precipitation data). Daily reservoir storage was obtained from observations and real-time recalculation based on the reservoir water balance. The ML operation models were trained and tested with 10-fold cross-validation. Results show that RNN, LSTM and GRU can reconstruct real-time reservoir operation and provide accurate outflow when training data cover both regular and extreme conditions. For multi-day reforecasting, the model performances are appropriate for the current day up to 2-day lead times for low outflows and up to 6-7 days for high outflows. GRU is potentially the most accurate, robust and convenient model to be used in practice. We conclude that with some further improvements, the ML operation models can be effective and applicable tools to support decision-making for real-time operational water management.

How to cite: Wannasin, C., Brauer, C., Uijlenhoet, R., and Weerts, A.: Modelling and reforecasting real-time reservoir operation and outflow with neural networks: case study of the multi-purpose Sirikit reservoir in Thailand, EGU General Assembly 2023, Vienna, Austria, 23–28 Apr 2023, EGU23-14421, https://doi.org/10.5194/egusphere-egu23-14421, 2023.

10:55–11:05

EGU23-16641

ECS

On-site presentation

Predicting Terrestrial Water Storage Anomalies at the Global Scale with a Machine-Learning Model

Irene Palazzoli, Serena Ceola, and Pierre Gentine

Changes in the level of the world freshwater storage, the Terrestrial Water Storage Anomalies (TWSA), may be induced by natural variability, climate change, and human activities. Since 2002 the Gravity Recovery and Climate Experiment (GRACE) has been measuring the Earth’s gravity field providing estimates of the TWSA at the global scale.

Here, we aim to develop a machine learning model that can reproduce the GRACE monthly time series covering the period between 2002 and 2017 from climate data, identifying to what extent the TWS fluctuations have been caused by climate variability. We used a Long Short-Term Memory (LSTM) neural network trained with meteorological variables (precipitation, air temperature, solar net radiation, snow cover, relative humidity, and leaf area index) and soil properties data (soil porosity, soil texture, and clay, sand, and silt fractions). Our results show that the model is able to consistently reconstruct the observed freshwater anomalies, especially in the humid regions. Furthermore, we observed that as climate change trends are removed from input data, the bias between model predictions and observed data becomes larger, proving the influence of climate change on TWSA.

How to cite: Palazzoli, I., Ceola, S., and Gentine, P.: Predicting Terrestrial Water Storage Anomalies at the Global Scale with a Machine-Learning Model, EGU General Assembly 2023, Vienna, Austria, 23–28 Apr 2023, EGU23-16641, https://doi.org/10.5194/egusphere-egu23-16641, 2023.

11:05–11:15

EGU23-8968

ECS

Virtual presentation

SnoDRI: A Machine Learning Based Index to Measure Snow Droughts

Sinan Rasiya Koya, Kanak Kanti Kar, Shivendra Srivastava, and Tirthankar Roy

Snow plays a significant role in the hydrology of numerous regions across the globe. A major portion of precipitation above 45^O N latitude falls as snow. The accumulated snow melts slowly and contributes to infiltration and runoff processes. Therefore, studying the quantity and fate of water from snowmelt is essential. Reduced snow storage would lead to snow droughts, which can have an enormous impact on the water resources of snow-dominated catchments, such as those in the western United States. For that reason, it is essential to identify the time and severity of snow droughts efficiently. This study proposes SnoDRI, a new index that could identify and measure snow drought events. SnoDRI is a machine learning-based index estimated from several snow-related variables utilizing novel machine learning algorithms. The model uses a combination of mutual information and a self-supervised learning algorithm of an autoencoder. We use random forests for feature extraction for SnoDRI and to assess the importance of each variable. We use NLDAS-2 reanalysis data from 1981 to 2021 for the Western United States to study the efficacy of the new snow drought index. The results are validated by verifying the coincidence of actual snow drought events and the interpretation of our new index. We will discuss how well the new drought index performs and help in better identification of snow droughts.

How to cite: Rasiya Koya, S., Kanti Kar, K., Srivastava, S., and Roy, T.: SnoDRI: A Machine Learning Based Index to Measure Snow Droughts, EGU General Assembly 2023, Vienna, Austria, 23–28 Apr 2023, EGU23-8968, https://doi.org/10.5194/egusphere-egu23-8968, 2023.

11:15–11:25

EGU23-3299

ECS

On-site presentation

A Framework for Water Supply Regulation in Coastal Areas by Avoiding Saltwater Withdrawal Considering Upstream Streamflow Distribution

Haiou Wu, Xinjun Tu, Xiaohong Chen, Vijay P Singh, Leonardo Alfonso, Kairong Lin, Zhiyong Liu, and Rongbiao Lai

Freshwater availability in coastal areas depends on the withdrawal from tidal rivers and is severely threatened by saltwater intrusion, especially in the dry season. Freshwater availability is associated with natural factors and human activities. Although analyses of freshwater availability under saltwater intrusion is problematic, it has received limited attention in the literature. We propose a new framework, i.e. regulation by avoiding saltwater withdrawal (RASW), where the relationships among saltwater intrusion, upstream streamflow, and water supply are established, using hybrid data-driven method coupling wavelet transform and random forest, and considering data on streamflow, tide, wind, salinity of withdrawal stations, capacities of withdrawal projects and reservoirs, and water demand. RASW contains three phases, i.e. estuary salinity-exceedance simulation, upstream streamflow distribution design, and local water supply security analysis. The method is tested on the water supply for Zhuhai-Macao of the Guangdong-Hong Kong-Macao Great Bay Area, South China. Results demonstrate that the salinity-exceedance simulation model using a hybrid data-driven method is quite accurate. The meta-Gaussian copula efficiently simulates the six-dimensional distribution of upstream monthly streamflow and is appropriate for streamflow distribution scenario design. Water supply security benefits greatly from the joint river-reservoir regulation mode. But for a given exceedance frequency of average streamflow, the modes and security situations are diverse, due to various streamflow distributions, i.e. extremely low streamflow and its occurrence time. The proposed framework facilitates integrated decision-making for water supply security in coastal areas. Moreover, the capacities of facilities should be carefully considered according to local conditions, and streamflow distribution design can be utilized as a management tool to regulate water supply system.

How to cite: Wu, H., Tu, X., Chen, X., Singh, V. P., Alfonso, L., Lin, K., Liu, Z., and Lai, R.: A Framework for Water Supply Regulation in Coastal Areas by Avoiding Saltwater Withdrawal Considering Upstream Streamflow Distribution, EGU General Assembly 2023, Vienna, Austria, 23–28 Apr 2023, EGU23-3299, https://doi.org/10.5194/egusphere-egu23-3299, 2023.

11:25–11:35

EGU23-3575

ECS

On-site presentation

Hybrid modelling in hydrology by Neural Network-based prediction of conceptual model parameters

Eduardo Acuna, Uwe Ehret, Nicole Bäuerle, and Ralf Loritz

In recent years data-driven techniques, specifically LSTMs, have outperformed conceptual hydrological models for rainfall-runoff prediction. However, even though great progress has been made to explain the internal functioning of the model ((Kratzert, et al., 2019); (Lees, et al., 2022)), their interpretation is still not as straightforward as conceptual models. Additionally, latent variables, different from the target quantity, need postprocessing methods to be extracted. One way to combine the flexibility of data-driven techniques with the interpretability of conceptual models is the use of hybrid models. In our contribution, we will present results from applying a similar technique as (Kraft, Jung, Korner, & Reichstein, 2020) and (Feng, Liu, Lawson, & Shen, 2022), in which an artificial neural network dynamically calculates the parameters of the conceptual model. This approach increases the model flexibility, allows the inclusion of multiple information sources, and compensates for model uncertainty, while maintaining the straightforward interpretability of the conceptual part. In this contribution, we will look at the performance of the hybrid model, analyze the parameter variation over time, and present a technique to avoid parameter cross-compensation.

References

Feng, D., Liu, J., Lawson, K., & Shen, C. (2022). Differentiable, learnable, regionalized process-based models with multiphysical outputs can approach state-of-the-art hydrologic prediction accuracy. Water Resources Research. doi:https://doi.org/10.1029/2022WR032404

Kraft, B., Jung, M., Korner, M., & Reichstein, M. (2020). HYBRID MODELING: FUSION OF A DEEP LEARNING APPROACH AND A PHYSICS-BASED MODEL FOR GLOBAL HYDROLOGICAL MODELING. The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences, 1537--1544. doi:https://doi.org/10.5194/isprs-archives-XLIII-B2-2020-1537-2020

Kratzert, F., Herrnegger, M., Klotz, D., Hochreiter, S., & Klambauer, G. (2019). NeuralHydrology--Interpreting LSTMs in Hydrology. In W. Samek, G. Montavon, A. Vedaldi, L. Hansen, & K.-R. Müller, Explainable AI: Interpreting, Explaining and Visualizing Deep Learning (pp. 347--362). Springer.

Lees, T., Reece, S., Kratzert, F., Klotz, D., Gauch, M., De Bruijn, J., . . . Dadson, S. (2022). Hydrological concept formation inside long short-term memory (LSTM) networks. Hydrology and Earth System Sciences, 3079-3101. doi:https://doi.org/10.5194/hess-26-3079-2022

How to cite: Acuna, E., Ehret, U., Bäuerle, N., and Loritz, R.: Hybrid modelling in hydrology by Neural Network-based prediction of conceptual model parameters, EGU General Assembly 2023, Vienna, Austria, 23–28 Apr 2023, EGU23-3575, https://doi.org/10.5194/egusphere-egu23-3575, 2023.

11:35–11:45

EGU23-8499

On-site presentation

A hybrid method to tackle conditional systematic errors of hydrological models

Evangelos Rozos

Machine learning has been used in hydrological applications for decades. Recent studies, after a systematic comparison, have shown that machine learning models (more precisely, deep learning with thousands of nodes) can outperform even the most sophisticated physically based models. Furthermore, one of the basic criticisms, that machine learning produces black box models, has been addressed by researchers, who have indicated how this black box can be made transparent to obtain explainable/interpretable results. However, the main disadvantage of the machine learning approaches (especially deep learning, which may employ hundreds of thousands of parameters) remains the CPU-intensive training process. This disadvantage can be overcome by employing hybrid modelling frameworks that combine simple machine learning models with parsimonious hydrological models. The drawback of these parsimonious approaches is the susceptibility of the latter to conditional systematic errors, which propagate through the modelling framework and cannot be eliminated by simple machine learning networks (employing complex networks would nullify the sought benefit of reduced CPU times). In this study, we suggest methods to cope with this kind of error and achieve a modelling performance close to the best achievable with the available data.

How to cite: Rozos, E.: A hybrid method to tackle conditional systematic errors of hydrological models, EGU General Assembly 2023, Vienna, Austria, 23–28 Apr 2023, EGU23-8499, https://doi.org/10.5194/egusphere-egu23-8499, 2023.

11:45–11:55

EGU23-13200

ECS

On-site presentation

Adaptive Mesh Refinement of 3D saturated-unsaturated hydrological and transport models

Gillien Latour, Pierre Horgue, Romain Guibert, François Renard, and Gérald Debenest

Unsaturated water flows at watershed scale or Darcy-scale are generally described by the Richardson-Richards equation. This equation is highly non-linear and simulation domains are limited by computational costs. The porousMultiphaseFoam toolbox, is a Finite Volume tool capable of modeling multi-phase flows in porous media, including the solving of the Richardson-Richards equation combined with the transport equation. As it has been developed using the OpenFOAM environment, the software is natively fully parallelized and can be used on super computers. Despite all its perks, the toolbox still suffer from expensive computational cost in 3D, highlighted by a calibration method developed last year to validate the 3D model against the historic 2D and 1D+2D models. In an attempt to reduce those, Adaptive Mesh Refinement (AMR) method has been included into the toolbox. The main issue faced by the method is the highly anisotropic mesh, with cells having horizontal characteristic lengths up to 100 times bigger than the vertical one. We present here the results obtained using the AMR, and tools to evaluate its efficiency on anisotropic meshes. Particularly, we take interest in the total CPU time, the mesh size and the number of linear iteration required to solve the problem. We show results using both 3D hydrological and transport solvers.

How to cite: Latour, G., Horgue, P., Guibert, R., Renard, F., and Debenest, G.: Adaptive Mesh Refinement of 3D saturated-unsaturated hydrological and transport models, EGU General Assembly 2023, Vienna, Austria, 23–28 Apr 2023, EGU23-13200, https://doi.org/10.5194/egusphere-egu23-13200, 2023.

11:55–12:05

EGU23-2910

ECS

On-site presentation

Thermodynamic integration via Replica Exchange Hamiltonian Monte Carlo for faster sampling and model comparison

Damian M. Ndiwago, Remko Nijzink, Christophe Ley, Stanislaus J. Schymanski, and Jack S. Hale

Hydrologists often need to choose between competing hypotheses or weight the predictions of different models when averaging models. Several criteria for choosing and weighting models have been developed, which balance model complexity and goodness of fit by penalising the number of model parameters. The penalty is explicit for information theory approaches or implicit for Bayesian model selection based on marginal likelihood and, by extension, the Bayes factor. The Bayes factor is the ratio of the marginal likelihoods of two competing models. Also, the Bayes factor can be used for non-nested models in contrast to information-theoretic approaches. However, marginal likelihood estimation is computationally intensive and slow for dynamic models with multiple modes. This study uses Replica Exchange Hamiltonian Monte Carlo and thermodynamic integration for fast, simultaneous calculation of marginal likelihood and parameter identification of dynamic rainfall-runoff models. Using synthetic data, the method selected the true model in our numerical experiments. The technique was also applied to real data from Magela Creek in Australia. The selected model was not the model with the highest or lowest number of parameters for real data. The method is implemented using the differentiable programming software ''TensorFlow Probability". This implementation can be applied to other types of models for fast simultaneous parameter estimation and model comparison.

How to cite: M. Ndiwago, D., Nijzink, R., Ley, C., J. Schymanski, S., and S. Hale, J.: Thermodynamic integration via Replica Exchange Hamiltonian Monte Carlo for faster sampling and model comparison, EGU General Assembly 2023, Vienna, Austria, 23–28 Apr 2023, EGU23-2910, https://doi.org/10.5194/egusphere-egu23-2910, 2023.

12:05–12:15

EGU23-2543

ECS

On-site presentation

Urban non-point source pollution modelling: A physics-informed neural network approach

Sijie Tang, Yin Wan, Fangze Shang, Shuo Wang, and Jiping Jiang

Over the past decades, urban non-point source (NPS) pollution has been the most severe threat to the urban water environment. The sharp increase of impervious surface and the high level of particulate matter from massive human activities exacerbated the water quality of surface runoff, leading to the significant urban NPS pollution globally whereby it is of importance to have a deep knowledge on the accumulation and transport of pollutants. A series of traditional physical models have been developed to simulate the runoff generating as well as the NPS pollution. However, a disadvantage of process-based modelling is its great demand for a large amount of field data which may normally be inaccessible, as well as the demand for the expertise in applying appropriate modelling method on specific study area. Empirical models do not characterize complex physical processes of NPS pollution and thus require fewer data and modelling skills. Nevertheless, the limitation is that these modelling approaches are region-sensitive and spatially untransferable. It is challenging to fill the gap between the requirement of urban water environment management and existing modelling performance on NPS pollution, in the absence of a more effective model with high accuracy, easy employment, and spatial transferability.

Machine learning approach has been utilized in environmental studies for decades, and was originally believed to be a black box that can barely provide any physical insight into environmental processes. However, an approach named physics-informed neural networks (PINN) was proposed lately and then applicated in dynamical system. This approach embeds differential equations of priori knowledge into neural network to make modelling interpretable and generalizable. In this study, a physical process embedded LSTM network was proposed to formulate the cumulation and transport of urban NPS pollution in rainfall runoff, based on the coupling of LSTM and differential equations of classic exponential build-up/wash-off processes. Water quality data of urban runoff from sampling and continuous real-time monitoring campaigns distributed in China, USA and New Zealand were collected and fed into proposed network to model the primary NPS pollutant TSS. The results revealed that the hybrid PINN model excels the vanilla LSTM approach and auto-calibrated SWMM approach in accuracy and convenience. The interpretable model also enhanced the cross-catchment transferability of model for urban water management in data-poor area. In addition, the trained parameters of network units were found consistent with the prior knowledge of accumulation and transport of NPS pollutants, indicating the deep coupling of neural network and physical process. As a very early case of hybrid AI modelling in urban NPS pollution, this study provided a new perspective on water quality modelling and can help in improving the standards of urban environment governance.

How to cite: Tang, S., Wan, Y., Shang, F., Wang, S., and Jiang, J.: Urban non-point source pollution modelling: A physics-informed neural network approach, EGU General Assembly 2023, Vienna, Austria, 23–28 Apr 2023, EGU23-2543, https://doi.org/10.5194/egusphere-egu23-2543, 2023.

12:15–12:25

Discussion

Lunch break

Chairpersons: Alessandro Amaranto, Claudia Bertini

Data analysis, optimisation and Information Theory

14:00–14:10

EGU23-641

ECS

Virtual presentation

Application of Enhanced Search Technique: Chaos-Directed Genetic Algorithm in Optimal Design of Water Distribution Network

Seelam Naga Poojitha and Vinayakam Jothiparakash

The water distribution network (WDN), a vital component of the water supply system, is an essential urban infrastructure distributing potable water to society. Its design, a non-deterministic polynomial-hard problem, has been a widely studied complex research problem for decades, with various optimization models proposed for its optimal design. Recent advancements in enhancing the computational efficiency of stochastic optimization algorithms by introducing chaotic force have elevated the scope of formulating chaos-directed evolutionary algorithms (EAs). The present study proposes one such approach, the chaos-directed genetic algorithm (CDGA) model, to improve the search mechanism of the genetic algorithm (GA) in solving the complex optimization problem of WDN optimal design.

In one of our recent works, the influence of chaotic maps with high-dimensionality, the Henon and Lorenz maps, are explored and compared to the low-dimensional Logistic map in improving the performance of GA. With the one-dimensional Logistic map demonstrating better computational improvement of GA, the present study considers it for formulating the CDGA model. The Logistic map is a non-linear first-order difference equation. Its dynamics evolve into various possible states of system range without repetition. For the search mechanism of the optimization technique to explore different regions of search space, this particular characteristic forms the most favorable feature. Consequently, by incorporating the chaotic force of the Logistic map into GA’s evolutionary mechanism by replacing every random search phenomenon, the CDGA model is formulated. A novel method of non-sequential allocation of chaotic dynamics is employed to induce chaotic force. Notably, the method is unique, using the same initial characteristics of the Logistic equation, retaining the chaos ergodicity for the evolutionary search.

To demonstrate the computational efficiency of the CDGA model, the enormously studied benchmark problem, the Hanoi network (HN), is considered. HN is a 34-dimensional problem having a complex search space with multiple locally optimal solutions. Defining the WDN optimization problem as the single-objective design framework subjected to linear and non-linear constraints of governing laws, the principal objective is to minimize the investment cost of HN pipes. While minimizing the pipe investment cost, the constraints levied ensure that the HN is hydraulically adequate to deliver the design demands. Thus, the optimization model formulated is the integrated framework with the simulation tool to simulate the WDN's hydraulic conditions. The code for the CDGA model is written in MATLAB R2015a and combined with the simulation software, EPANET 2.0, using the EPANET-MATLAB toolkit.

The computational results demonstrate the convergence precision of the CDGA model over its traditional GA, converging to the optimal cost of 6,081,564 units, the previous best solution reported for HN in the literature. Moreover, it outperforms many stochastic optimization models reported in the literature with computational efficiency in solving HN, particularly simulated annealing, shuffled complex, shuffled frog leaping, ant colony, particle swarm, harmony search, krill herd, and cuckoo search algorithms. Hence, from the results, the study suggests formulating chaos-directed optimization algorithms to improve their traditional model's computational efficiency in solving complex optimization problems.

Keywords: Water distribution network optimal design; Evolutionary algorithms; Genetic algorithm; Chaotic maps; Logistic equation; Hanoi network

How to cite: Poojitha, S. N. and Jothiparakash, V.: Application of Enhanced Search Technique: Chaos-Directed Genetic Algorithm in Optimal Design of Water Distribution Network, EGU General Assembly 2023, Vienna, Austria, 23–28 Apr 2023, EGU23-641, https://doi.org/10.5194/egusphere-egu23-641, 2023.

14:10–14:20

EGU23-5491

ECS

On-site presentation

Assessing water observation network settings by hydro-geological sub-sampling of a large data set for Sweden

Ellen Gute, Luisa Ickes, and Ilias Pechlivanidis

Developing cost-effective methods for hydrological observations is an identified research objective of the WMO Hydrological Research Strategy 2022-2030. Network settings of hydrological observational networks are central for data collection and monitoring efforts to fulfill observation needs. In this work, we ask the question: Where and how densely (location and time) measurement stations need to be placed to gain sound scientific insights into hydro-meteorological conditions of a region?

We address this question through information theory concepts and calculate entropy, joint entropy, and mutual information for an existing large dataset of hydro-meteorological parameters. The dataset spans 36 years (1981-2017) of daily data for Sweden based on the national S-HYPE hydrological model. Hydrological data include runoff, inflow, and streamflow as computed values and meteorological data encompass temperature and precipitation as measured and corrected data. We chose Sweden as a study domain to look at a Nordic region with a large number of water basins and an overall well-sampled region allowing to assess interesting network settings through sub-sampling.

Sub-sampling and analysis for potential network settings is done for the seven hydrological clusters across Sweden as they are defined in Girons Lopez et al. (2021). Random sub-sampling (by 10%, 25%, 50%, 75%, and 90%) of each of the seven clusters shows a narrow range of (Shannon) entropy indicating excellent assignment of catchments to the seven clusters.

Focusing on three clusters, which span Sweden’s North-South extend and mainly feature forested areas and a cluster covering mostly coastal lakes, we assess how much information remains in the data set if sub-sampled by hydro-geological parameters, such as baseflow and flashiness. Such tests allow us to determine ideal and minimum network settings with respect to observational and computational efforts based on different criteria relevant to scientific investigations and decision-making needs.

Girons Lopez, M., Crochemore, L., and Pechlivanidis, I. G.: Benchmarking an operational hydrological model for providing seasonal forecasts in Sweden, Hydrol. Earth Syst. Sci., 25, 1189–1209, https://doi.org/10.5194/hess-25-1189-2021, 2021

How to cite: Gute, E., Ickes, L., and Pechlivanidis, I.: Assessing water observation network settings by hydro-geological sub-sampling of a large data set for Sweden, EGU General Assembly 2023, Vienna, Austria, 23–28 Apr 2023, EGU23-5491, https://doi.org/10.5194/egusphere-egu23-5491, 2023.

14:20–14:30

EGU23-3428

ECS

On-site presentation

Calibration of the SWAT Hydrological Model with the Particle Swarm Optimization Technique

Fatmanur Çakır, Alper Elçi, and Melis Somay-Altaş

Hydrological models are important tools for management of water resources at the basin scale. However, the outputs of these models might come with significant inaccuracies, often due to uncertainties in model parameters. One of the biggest challenges in working with a hydrological model is that these models require rigorous calibration, validation, and uncertainty analysis. In recent years, there has been an increase in the use of heuristic optimization techniques in water resources research. These techniques can yield more accurate and more reliable modeling results by searching the global optimum of multiple model parameter sets.

This study describes the application of a heuristic optimization method, the Particle Swarm Optimization (PSO), on a hydrological model, the Soil and Water Assessment Tool (SWAT). The model is applied to the Fetrek Stream watershed in western Turkiye, which is under environmental stress due to excessive groundwater abstraction and pollution from numerous wastewater discharges. The model includes data related to 41 point sources, and two inflowing tributaries. The model is configured with 8 sub-basins, and 484 hydrologic response units. Hydrological fluxes are obtained for a 30-year simulation period. The sensitivities of the model parameters and uncertainties of model results are investigated. PSO is used to calibrate sensitive model parameters, followed by a comparison with the calibration outcome using the SUFI-2 (Sequential Uncertainty Fitting) algorithm, which is the usual choice in calibrating SWAT models. The performances of both optimization approaches are evaluated with the Kling-Gupta Efficiency (KGE), the regression coefficient (R²), and the bias percentage (PBIAS) Model results are presented with their associated prediction uncertainties in the form of the so-called p-factor and r-factor statistics, which represent envelopes of good model solutions. The results show that the PSO approach can achieve satisfactory results on a monthly time-scale thereby offering an alternative calibration with less parameters and a wider interval of the 95% prediction uncertainty.

This study is supported by the PRIMA program under grant agreement No: 2024 Project TRUST (management of industrial Treated wastewater ReUse as mitigation measures to water Scarcity in climaTe change context in two Mediterranean regions). The PRIMA program is supported by the European Union.

How to cite: Çakır, F., Elçi, A., and Somay-Altaş, M.: Calibration of the SWAT Hydrological Model with the Particle Swarm Optimization Technique, EGU General Assembly 2023, Vienna, Austria, 23–28 Apr 2023, EGU23-3428, https://doi.org/10.5194/egusphere-egu23-3428, 2023.

14:30–14:40

EGU23-3826

ECS

On-site presentation

Influence of large-scale climate modes and local hydrometeorological factors in predicting basin scale rainfall and streamflow: A Bayesian network approach

Prabal Das and Kironmala Chanda

This article reports the findings of a recent study by Das and Chanda (2022), wherein a Bayesian Network (BN) approach was applied to analyze the influence of large-scale climate modes and local hydro-meteorological variables on streamflow and rainfall in four river basins in India. Bayesian Networks (BN) offers a thorough conditional independence structure that can improve comprehension and forecasting of hydroclimatic systems. This served as the main impetus for the work, which explored the relative contributions of large-scale climate modes and local hydro-meteorological variables for the prediction of rainfall and streamflow at the basin scale. Once the conditional independence structure is developed, variables possessing a ‘directed arc’ from the target variable were selected as the potential predictors for developing the prediction models. The results showed that the most important predictors for streamflow were rainfall, u-wind, and soil moisture, while the most important predictors for rainfall were u-wind, air temperature, geo-potential height, precipitable water, vertical velocity, and relative humidity. The analysis also revealed that the influence of large-scale climate modes on the target variables was generally insignificant, except for the Pacific Decadal Oscillation and El-Niño Southern Oscillation. Furthermore, the network structure showed that about 87 and 97% of the initial inputs are redundant. The accuracy of the prediction models are comparable across all of the basins and is higher for rainfall (Refined index of agreement (MD) ranging from 0.61 to 0.81) than for streamflow (MD ranging from 0.61 to 0.78). The study also found that dry, intermediate, and wet months can be satisfactorily classified using two drought indices, the Standardized Drought Index (SDI) for streamflow and the Standardized Precipitation Anomaly Index (SPAI) for rainfall.

Keywords: Large-scale climate modes, local hydro-meteorological variables, Bayesian Networks (BN), Standardized Drought Index (SDI), Standardized Precipitation Anomaly Index (SPAI)

Reference:

Das P, Chanda K (2022) A Bayesian network approach for understanding the role of large-scale and local hydro-meteorological variables as drivers of basin-scale rainfall and streamflow. Stoch Environ Res Risk Assess 2:. https://doi.org/10.1007/s00477-022-02356-2

How to cite: Das, P. and Chanda, K.: Influence of large-scale climate modes and local hydrometeorological factors in predicting basin scale rainfall and streamflow: A Bayesian network approach, EGU General Assembly 2023, Vienna, Austria, 23–28 Apr 2023, EGU23-3826, https://doi.org/10.5194/egusphere-egu23-3826, 2023.

14:40–14:50

EGU23-16978

ECS

On-site presentation

A new software for spatio-temporal analysis of gridded data sources

Mauricio Zambrano-Bigiarini and Sebastián Bernal Vallejos

Several long-term gridded datasets have become available in the last decades on a global scale, at increasing spatial and temporal resolution, with low latency times. These datasetshave opened new opportunities to advance Earth Sciences modelling studies at different spatial and temporal scales, especially in poorly gauged areas. However, working with (hundreds of) thousands of raster time series (e.g., for (sub)daily precipitation), usually in different vectorial and raster formats, impose high computational challenges to efficiently analyse all the gridded datasets.

In this work we introduce a new R package for easy processing and analysis of raster time series, to bring the use of gridded data closer to the Earth Sciences community. This package expands the large number of spatial functions provided by the terra package by taking advantage of the time attribute of raster objects. A particular emphasis of the package is exploring and comparing gridded datasets of hydrological variables with different time frequencies (e.g., sub-hourly, hourly, daily, monthly, seasonal, annual).

General purpose functions include temporal subsetting, resampling, cropping, extracting time series for points or polygons, comparing two datasets using summary statistics, and exporting a raster time series as a collection of daily/monthly/annual files, each one of them with several layers of a higher temporal frequency. Also, temporal aggregation is possible from sub-hourly to hourly/daily/weekly/monthly/annual, among others. Most functions can take advantage of multi-core computers and network clusters, to reduce the computational burden.

To illustrate the use of this new package, we compared different state-of-the-art gridded precipitation products (CHIRPSv2, CMORPH v1.0, IMERGv06B, MSWXv1.0, ERA5, ERA5-Land, CR2METv2.5), all of them with different data formats and spatial resolutions, using continental Chile as a case study. However, based on the package's flexibility and ease of use, we hope the broader community of hydro-scientists and water-engineers will use it to visualise the spatio-temporal variation of key hydrological/environmental variables,to carry out time series analysis, to combinedifferent types of models and data sources, and to improve our integrated knowledge of the water cycle.

How to cite: Zambrano-Bigiarini, M. and Bernal Vallejos, S.: A new software for spatio-temporal analysis of gridded data sources, EGU General Assembly 2023, Vienna, Austria, 23–28 Apr 2023, EGU23-16978, https://doi.org/10.5194/egusphere-egu23-16978, 2023.

14:50–15:00

EGU23-16938

ECS

On-site presentation

Assessment of GPM IMERG Satellite Precipitation Estimationunder Complex Climatic and Topographic Conditions

Chengxuan Lu

Satellite precipitation estimation provides crucial information for those places lacking rainfall observations from ground–based sensors, especially in terrestrial or marine areas with complex climatic or topographic conditions. This is the case over much of Western China, including Upper and Middle Lancang River Basin (UMLRB), an extremely important transnational river system in Asia (the Lancang–Mekong River Basin) with complex climate and topography that has limited long–term precipitation records and high–elevation data, and no operational weather radars. In this study, we evaluated three GPM IMERG satellite precipitation estimation (IMERG E, IMERG L and IMERG F) over UMLRB in terms of multi–year average precipitation distribution, amplitude consistency, occurrence consistency, and elevation–dependence in both dry and wet seasons. Results demonstrated that monsoon and solid precipitation mainly affected amplitude consistency of precipitation, aerosol affected occurrence consistency of precipitation, and topography and wind–induced errors affected elevation dependence. The amplitude and occurrence consistency of precipitation were best in wet seasons in the Climate Transition Zone and worst in dry seasons in the same zone. Regardless of the elevation–dependence of amplitude or occurrence in dry and wet seasons, the dry season in the Alpine Canyon Area was most positively dependent and most significant. More significant elevation–dependence was correlated with worse IMERG performance. The Local Weighted Regression (LOWERG) model showed a nonlinear relationship between precipitation and elevation in both seasons. The amplitude consistency and occurrence consistency of both seasons worsened with increasing precipitation intensity and was worst for extreme precipitation cases. IMERG F had great potential for application to hydroclimatic research and water resources assessment in the study area. Further research should assess how the dependence of IMERG’s spatial performance on climate and topography could guide improvements in global precipitation assessment algorithms and the study of mountain landslides, floods, and other natural disasters during the monsoon period.

How to cite: Lu, C.: Assessment of GPM IMERG Satellite Precipitation Estimationunder Complex Climatic and Topographic Conditions, EGU General Assembly 2023, Vienna, Austria, 23–28 Apr 2023, EGU23-16938, https://doi.org/10.5194/egusphere-egu23-16938, 2023.

15:00–15:10

EGU23-4028

ECS

Virtual presentation

The spatial-temporal variations of rainfall-streamflow linkage across North America: A functional data analysis approach

Ali Ameli, Joseph Janssen, Shizhe Meng, Jiguo Cao, and William Welch

Achieving improved predictions in ungauged-basins or inferring the effects of climate and land-use changes on streamflow requires hydrologists to first learn the underlying mechanisms behind streamflow generation in gauged-basins. One way to characterize streamflow generation is by quantifying how catchments filter rainfall into streamflow. A simple and popular technique that displays the rainfall-streamflow linkage is the unit hydrograph. Though one could characterize and classify catchments based on their unit hydrographs, this approach implicitly implies that the function and response-time that link rainfall to streamflow are time-invariant. The celerity, and the function that links rainfall to streamflow in a given catchment, could vary from catchment to catchment as well as from season to season. This is primarily due to variations in antecedent wetness, temperature, vegetation transpiration and the ways climatic factors interact with biophysical factors, over time and over space. In this study, we utilize sparse historical functional linear models to quantify the time-variant rainfall-streamflow response function, across hundreds of catchments in North America. The function reflects the temporally varying relationship between rainfall and streamflow and can be used to infer temporally varying response times. We then attempt to relate catchment characteristics such geology, climate, and topography to the characteristics of rainfall-streamflow response function and response time, spatially and temporally. We argue that our study extracts generalizable and robust process understanding in a novel data-driven manner.

How to cite: Ameli, A., Janssen, J., Meng, S., Cao, J., and Welch, W.: The spatial-temporal variations of rainfall-streamflow linkage across North America: A functional data analysis approach, EGU General Assembly 2023, Vienna, Austria, 23–28 Apr 2023, EGU23-4028, https://doi.org/10.5194/egusphere-egu23-4028, 2023.

15:10–15:20

EGU23-4297

ECS

On-site presentation

Investigating the change of hydrological patterns of streamflow by using HDCE method

ChunTa Wen and Jiing-Yun You

The change of streamflow patterns is one of the important information to water resources management. Especially, in the last few years, climate changes have not only caused increases in the intensity and frequency of extreme hydrological events, but also disrupted the monotony of climate which could lead to unexpected consequences and economic losses to our society. However, only a little research has paid attention to the change in patterns or schemes of the hydrological cycle. The issue is even more serious in Taiwan due to the uneven spatial-temporal distribution of rainfall. This research aims to discover the change in patterns in Taiwan. We proposed a HDCE framework which is composed of Hierarchical cluster, Dynamic time warping, Change point detection, and Empirical mode decomposition. With this framework, we apply the hierarchical cluster with different distance matrices, and obtain the optimal clustering number and linkage method according to clustering valid indexes. Dynamic time warping is used as the measure of distance to investigate the pattern of time series By this way, this framework determines the optimal cluster of patterns for the historical inflow data. After clustering, the AMOC (At Most One Change) is used to analyze the structure of the pattern. With AMOC, the change time point in each period is examined under the structure of each cluster. In the end, the empirical mode decomposition is adopted to determine the trend of the pattern change. With the proposed framework, this research applies these schemes to main inflow observation gage stations in Taiwan, and the results demonstrate that the groups and activities of positions of the stations indirectly affect the pattern of the inflow values, instead, the clusters formed are mainly affected by the region and geographical area of the locations. Furthermore, we will explain the results showing the different trends in changing time in each region and the correlation of breaks at each station. In this way, the results of HDCE not only examine the occurrence of droughts but provide information that is useful to develop the strategy to reduce the loss through better water management.

How to cite: Wen, C. and You, J.-Y.: Investigating the change of hydrological patterns of streamflow by using HDCE method, EGU General Assembly 2023, Vienna, Austria, 23–28 Apr 2023, EGU23-4297, https://doi.org/10.5194/egusphere-egu23-4297, 2023.

15:20–15:30

EGU23-9056

ECS

On-site presentation

Information entropy for assisting decision-making for critical events in surface water quality management

Tianrui Pang, Jiping Jiang, Leonardo Alfonso, Peng Wang, and Tong Zheng

The information entropy method, based on mass sampling data processing, has been widely applied in environment monitoring and management. However, previous efforts have been mainly limited to the optimization of stations in discrete space with no association to critical events and their associated temporal scale. In particular, further research on integrating water quality monitoring under critical events (such a spillway accident) and the related cost-benefit analysis of environment management decisions are needed. In this study, we give an entropy-based paradigm of water quality reaction criteria R, which is analogous to the definition of Gibbs free energy (ΔG) in thermodynamics. Then we propose a systematic framework of entropy prisms (HPrisms) with four entropy indexes: dilution index (E), flux index (F), spatial entropy index (G_x) and temporal entropy index (G_t). They describe the pollutants transport process in water bodies from different perspectives, facing different water environmental management decisions. The corresponding reaction criteria of these four entropy indexes for different water quality management scenarios are defined for different spatiotemporal scales where different criteria are applicable. The method has value in emergency monitoring in rivers and lakes, useful for anomaly detection, key point identification and other water environment management scenarios. This study is a generic theoretical framework so far, and we will present specific critical cases for management reaction criteria to find the quantitative relationship between reaction criteria with information entropy indexes.

How to cite: Pang, T., Jiang, J., Alfonso, L., Wang, P., and Zheng, T.: Information entropy for assisting decision-making for critical events in surface water quality management, EGU General Assembly 2023, Vienna, Austria, 23–28 Apr 2023, EGU23-9056, https://doi.org/10.5194/egusphere-egu23-9056, 2023.

15:30–15:40

Discussion

15:40–15:45

Session closure

Posters on site: Tue, 25 Apr, 08:30–10:15 | Hall A

Chairperson: Niels Schuetze

A.33

EGU23-774

ECS

Synergizing Machine Learning with Conceptual Model for Daily Reservoir Storage Predictions in the Data-scarce catchments

Pravin Bhasme and Udit Bhatia

Reservoirs play a crucial role in water resources management. However, quantifying reservoir storage prediction is challenging, especially when reservoir inflow data is unavailable. In recent years, Machine Learning (ML) models have shown their successful application in hydrological predictions, although these models are criticized for their inability to follow physical constraints. On the other hand, conceptual models are applied widely in various hydrological studies due to their simplistic structure yet inclusive of various hydrological processes. However, these conceptual models show limited predictive skills. Thus, synergizing domain knowledge from a conceptual model with the predictive ability of the ML model can help for better physical consistent outputs. We developed the Physics Informed Machine Learning (PIML) model for reservoir storage predictions. This model combines the predictability of Long Short Term Memory (LSTM) with domain understanding of the conceptual (SIMHYD) model. The applicability of the PIML model is demonstrated on two United States reservoirs where reservoir inflow data is unavailable. Our results show that the PIML model outperforms the SIMHYD model in the reservoir storage predictions while being mindful that reservoir storage will not be more than the maximum storage capacity. This study may be helpful in better-informed reservoir operation in the data-scarce catchments.

How to cite: Bhasme, P. and Bhatia, U.: Synergizing Machine Learning with Conceptual Model for Daily Reservoir Storage Predictions in the Data-scarce catchments, EGU General Assembly 2023, Vienna, Austria, 23–28 Apr 2023, EGU23-774, https://doi.org/10.5194/egusphere-egu23-774, 2023.

A.34

EGU23-844

ECS

Streamflow Estimation in Ungauged Catchments in Brazil using Machine Learning Approaches

Rafael Barbedo, Mino Sorribas, and Walter Collischonn

Knowing river flows in space and time is fundamental for several hydrological and environmental applications. One of the greatest challenges in hydrology, however, is having this information at every river stretch, as we can only focus our resources in obtaining measurement at particular sites. Several research initiatives have been developed over the next years to address this problem, a notorious one being the prediction in ungauged basins (PUB) by the International Association of Hydrological Sciences (IAHS).

One of the most used approaches for PUB is using catchment descriptors – such as elevation, slope, land cover, and soil types – in statistical (data-driven) models to estimate hydrological signatures – such as mean annual streamflow, flow-duration curves, and high/low flows. There is a wide range of statistical methods that can be used in this regard, either by grouping catchments of similar characteristics and applying regression equations, using geospatial interpolation techniques, among others. In recent years, regression techniques based on Machine Learning (ML) approaches have been extensively developed, presenting great results in all areas of knowledge. In hydrological sciences, particularly for PUB, the potential of using these techniques is enormous, and yet, they have not been much explored.

In this context, we’ve built a ML regression modelling pipeline to estimate mean annual flows and low flows, and tested it in several different catchments covering the whole of Brazil, using different models to compare the results. The pipeline consists in (1) collecting environmental data for the catchments, (2) selecting the best descriptors, (3) tunning the hyperparameters of the ML model, (4) evaluating the performance of the model, (5) computing the importance of the predictors, and (6) assessing the uncertainty of the estimations. Also, the pipeline is model-independent, i.e., it can be applied to any ML regression model.

We evaluated results against consistent streamflow data from 1069 gauges spread across the country that cover distinct characteristics, using 100-fold cross validation, obtaining R² scores of ~0.8 for mean annual flows and ~0.7 for low flows, for all ML models except multiple linear regression, which didn’t present good results. Average and low precipitations were the main drivers for predicting both flow variables, although using these alone didn’t yield in good metrics. Other important predictors were linked to soil types, land cover, wetlands, and drainage density. We extrapolated the results to all catchments in Brazil, along with uncertainty estimations.

Acknowledgement:

The authors would like to acknowledge the financial support provided by the Brazilian Coordination for the Improvement of Higher Education Personnel (CAPES) and the Brazilian National Water and Sanitation Agency (ANA), the latter in the context of the project "Technological Cooperation for Hydrological Assessments in Brazil" (grant number: TED-05/2019-ANA). Additional acknowledgements to the Google LLC for making available the Google Earth Engine (GEE) platform, and all data providers for the global products used in this study.

How to cite: Barbedo, R., Sorribas, M., and Collischonn, W.: Streamflow Estimation in Ungauged Catchments in Brazil using Machine Learning Approaches, EGU General Assembly 2023, Vienna, Austria, 23–28 Apr 2023, EGU23-844, https://doi.org/10.5194/egusphere-egu23-844, 2023.

A.35

EGU23-1143

ECS

Reconstructing floods from large-scale atmospheric variables with neural networks in high latitude climates

Jenny Sjåstad Hagen, Ramin Hasibi, Etienne Leblois, Deborah Lawrence, and Asgeir Sorteberg

Climate change is expected to alter the occurrence of floods in high latitude countries; evidence of earlier spring floods and more frequent rainfall-driven floods has already been detected in Norway. While the state-of-the-art hydrological climate-impact model chain embeds explicit assumptions about stationarity, machine learning offers a complementary approach to hydrological climate-impact modelling by facilitating direct downscaling from large-scale atmospheric variables to streamflow, thus making downscaling and bias-correction implicit. While applications of machine learning algorithms for streamflow and flood modelling are well documented in the scientific literature, few studies have linked large-scale atmospheric variables directly to streamflow without including observed streamflow as part of the input variable selection. Such autoregressive models have limited application for climate-impact studies, as future streamflow is yet to observe. Furthermore, most studies linking large-scale atmospheric forcing to catchment response have focused on monthly, seasonal, or annual streamflow. This study presents the application of feed-forward and recurrent neural networks for daily streamflow and flood reconstruction from atmospheric reanalysis data with comparable spatiotemporal resolution to global climate model outputs. Two widely applied neural network types, namely multilayer perceptron (MLP) and long short-term memory (LSTM), were benchmarked against gradient boost regression tree models. Catchment-specific, physically-based input variable selections representing the dominant flood-drivers were identified for 27 catchments in Norway. The selected catchments have low degrees of basin development and anthropogenic influence so that the established statistical links only reflect the forcing-response relationship between the atmosphere and the catchments. Overall, the LSTM obtained the highest accuracy, with a median Nash Sutcliffe Efficiency (NSE) of 0.88 on the training set (1950-2000) and 0.76 on the testing set (2006-2010). However, the MLP proved more robust, with a smaller drop in NSE from training (0.76) to testing (0.72), indicating that further restricting the input variables based on hydrological theory and physical interpretability may increase the robustness of neural networks in the context of daily streamflow modelling. The median NSE of the regression tree models was lower on both the training set (0.73) and the testing set (0.66). The results point to the potential of neural networks for hydrological climate-impact modelling in catchments where both snowmelt and rainfall constitute flood-drivers in the present climate. This research provides a springboard for future studies employing neural networks for hydrological climate-impact modelling in high latitude countries. Future research should assess the potential for regionalization by including catchment characteristics through clustering techniques like Kohonen Self-Organizing Maps.

How to cite: Hagen, J. S., Hasibi, R., Leblois, E., Lawrence, D., and Sorteberg, A.: Reconstructing floods from large-scale atmospheric variables with neural networks in high latitude climates, EGU General Assembly 2023, Vienna, Austria, 23–28 Apr 2023, EGU23-1143, https://doi.org/10.5194/egusphere-egu23-1143, 2023.

A.36

EGU23-1540

ECS

Predicting estuarine salt intrusion with a long short-term memory model

Bas Wullems, Claudia Brauer, Fedor Baart, and Albrecht Weerts

Estuarine salt intrusion causes problems with freshwater availability in many deltas. For water managers to mitigate and adapt to salt intrusion, they require timely and accurate forecasts. Data-driven models derived with machine learning can help with this, as they can mimic complex non-linear systems and are computationally very efficient. We set up such a model for salt intrusion in the Rhine-Meuse delta. The model predicts chloride concentrations at Krimpen aan den IJssel, an important location for freshwater provision. As input features, we selected observations of water level, discharge, chloride concentration and wind speed. We then used the Boruta algorithm to select a subset of relevant features. We set up a Long Short-Term Memory network (LSTM) to make predictions of chloride concentrations one day ahead and ran the resulting model multiple times to simulate a multi-day forecast. This model predicts baseline concentrations and peak timing well, but peak height is underestimated, a problem that gets worse with increasing lead time. Because this model is reasonably successful, we aim to extend it to other locations in the delta. We also expect a similar setup can work in other deltas, especially those with a similar or simpler geometry. A more complete version of this model should finally be made suitable for use in an operational forecasting system.

How to cite: Wullems, B., Brauer, C., Baart, F., and Weerts, A.: Predicting estuarine salt intrusion with a long short-term memory model, EGU General Assembly 2023, Vienna, Austria, 23–28 Apr 2023, EGU23-1540, https://doi.org/10.5194/egusphere-egu23-1540, 2023.

A.37

EGU23-1819

Classifier-assisted level-based learning evolutionary search for heat extraction optimization of enhanced geothermal systems

Guodong Chen, Jiu Jimmy Jiao, and Xin Luo

Enhanced geothermal systems are essential to provide sustainable and long-term geothermal energy supplies and reduce carbon emissions. Optimal well-control scheme for effective heat extraction and improved heat sweep efficiency plays a significant role in geothermal development. However, the optimization performance of most existing optimization algorithms deteriorates as dimension increases. To solve this issue, a novel surrogate-assisted level-based learning evolutionary search algorithm (SLLES) is proposed for heat extraction optimization of enhanced geothermal systems. SLLES consists of classifier-assisted level-based learning pre-screen part and local evolutionary search part. Specifically, the classifier-assisted level-based learning strategy employs probabilistic neural network as the classifier to classify the offspring into pre-set number of levels. The offspring in different levels uses level-based learning strategy to generate more promising and informative candidates pre-screened by classifier to conduct real simulation evaluations. In the local evolutionary search part, a surrogate model is constructed at the local promising area. The optimum of the surrogate model obtained by the optimizer is selected to conduct real simulation evaluations. The cooperation of the two parts is able to achieve balance between the exploration and exploitation during the optimization process. After iteratively sampling from the design space, the robustness and effectiveness of the algorithm are proven to be improved significantly. To the best of our knowledge, the proposed algorithm holds state-of-the-art simulation-involved optimization framework. Comparative experiments have been conducted on benchmark functions, a two-dimensional fractured reservoir and a three-dimensional enhanced geothermal system. The proposed algorithm outperforms other five state-of-the-art surrogate-assisted algorithms on all selected benchmark functions. The results on the two heat extraction cases also demonstrate that SLLES can achieve superior optimization performance compared with traditional evolutionary algorithm and other surrogate-assisted algorithms. This work lays a solid basis for efficient geothermal extraction of enhanced geothermal system and sheds light on the model management strategies of data-driven optimization in the areas of energy exploitation.

How to cite: Chen, G., Jiao, J. J., and Luo, X.: Classifier-assisted level-based learning evolutionary search for heat extraction optimization of enhanced geothermal systems, EGU General Assembly 2023, Vienna, Austria, 23–28 Apr 2023, EGU23-1819, https://doi.org/10.5194/egusphere-egu23-1819, 2023.

A.38

EGU23-2110

ECS

Satellite gravimetry helps monitor the operation of large reservoirs

Jessica Besnier, Augusto Getirana, Nishan Biswas, and Venkataraman Lakshmi

All over the world, water levels are constantly changing. From lakes, to rivers, to oceans, the patterns of the water levels change due to different factors. With hydrological extremes increasing in intensity and duration around the world, it is important to understand what changes these levels in order to better predict and mitigate the negative impacts of changing water levels.

The goal of this study is to use estimates of terrestrial water storage (TWS) variability from the Gravity Recovery and Climate Experiment (GRACE) satellite mission to predict reservoir operation in Brazil. To do this, reservoir water elevations are derived from multi-satellite radar altimetry (RA) data and used as a proxy of their operation. 30 reservoirs in Southern Brazil are considered. For each reservoir, the Pettitt test was used to identify the point break within the TWS data, and the Mann-Kendall test was used to identify trends before and after these breaks.

A machine learning approach was used to reconstruct RA-based water elevations using GRACE data. The approach considered numerous geomorphologic and meteorologic characteristics of reservoirs, including reservoir area, volume, location, extent, depth, drainage area, and elevation, in addition to precipitation and temperature. Break points of time series and trends were also computed for each reservoir to explain why some reservoirs present a better fit than others. The findings of this study will give insight into what variables affect the relationship between TWS and RA height in the Parana Basin in Southern Brazil.

How to cite: Besnier, J., Getirana, A., Biswas, N., and Lakshmi, V.: Satellite gravimetry helps monitor the operation of large reservoirs, EGU General Assembly 2023, Vienna, Austria, 23–28 Apr 2023, EGU23-2110, https://doi.org/10.5194/egusphere-egu23-2110, 2023.

A.39

EGU23-2328

ECS

Real-time gradual leakage detection system for water distribution networks based on MIMO-ANN

Xi Wan, Raziyeh Farmani, and Edward Keedwell

Leakage detection is a critical issue in water management for water distribution systems (WDSs). With the availability of real-time monitoring data, leakage detection for WDSs based on data-driven methods has received increasing attention in recent years. Current data-driven leakage detection methods are based on a single-step prediction model that only focuses on burst events that are characterized by sudden changes in flow or pressure data in a very short time. However, gradual leakage events that develop from small seeps to noticeable leaks could last for weeks or even months, and these gradual events will cause more water loss and do more harm to the WDS. Furthermore, the gradual leakage events are more challenging to be detected due to its slowly changing pattern. Therefore, this work presents an early warning system for gradual leakage events based on a multistep forecasting strategy. A multi-input multi-output (MIMO) artificial neural network (ANN) is developed to capture the diurnal, weekly and seasonal patterns in the flow monitoring data. The generated forecasting vector is further compared with the observed measurements based on the cosine distance. The residual vector is further analyzed by exponential weighted moving average (EWMA) to smooth the spikes and noises. The final statistics are then used to raise alarms for the monitoring data. The method has been applied to a hypothetical town called L-Town to demonstrate its applicability. The results showed that the proposed method is capable of detecting gradual leakage events with a very small growth rate. In addition, all gradual leakage events are detected with short detection time, high detection accuracy, and low false alarms.

How to cite: Wan, X., Farmani, R., and Keedwell, E.: Real-time gradual leakage detection system for water distribution networks based on MIMO-ANN, EGU General Assembly 2023, Vienna, Austria, 23–28 Apr 2023, EGU23-2328, https://doi.org/10.5194/egusphere-egu23-2328, 2023.

A.40

EGU23-2666

ECS

Hybrid Multi Models Ensemble Framework Based on Clustering Algorithms for Runoff Reconstruction

Ujjwal Singh, Petr Maca, and Martin Hanel

Runoff is the key hydrological process, which is vital to the sustaining of human life on earth in examining
the climate change scenario. There are a lot of hydrological models available to simulate the runoff, but
these models’ outputs have biases due to uncertainty. Most machine learning algorithms cannot capture
the runoff generated by the real-world complex hydrological system accurately. The hybrid model combines
the efficiency of hydrological, machine learning, and ensemble modeling to minimize the bias of output [1],
[2]. The recent development of evolutionary computation in hybrid modelling frameworks combines the
efficiency of different components such as hydrological models, spatial autocorrelation, machine learning,
and machine learning ensemble to estimate robust and less biased runoff [1]. However, these components
need to significantly capture the heterogeneity and similarity of the catchment properties, which are highly
linked with the spatial variation of various hydrological patterns. Clustering is a technique that can group
similar types of hydrological patterns, which can be integrated within a hybrid modeling framework.
However, there is rarely found literature on the hybrid framework, which consists of different clustering
techniques and their ensemble. These clustering algorithms are based on different categories. We proposed
the hybrid ensemble framework based on extended input data, hydrological models, different clustering
algorithms, deep learning, and an ensemble of deep learning to reconstruct the minimum biased surface
runoff. We tested our proposed hybrid framework, which is robust compared to previously developed
frameworks. This proposed hybrid framework methodology will help to develop a new hybrid algorithm
to estimate the less biased surface runoff using various available climate data to understand the dynamics
of surface runoff for different spatial-temporal scales and climates.

[1] U. Singh, P. Maca, M. Hanel, et al., “Hybrid multi-model ensemble learning for reconstructing gridded
runoff of europe for 500 years,” vol. Available at SSRN: doi: 10 . 2139 / ssrn . 4188518. [Online].
Available: http://dx.doi.org/10.2139/ssrn.4188518.
[2] S. M. Hauswirth, M. F. Bierkens, V. Beijk, and N. Wanders, “The suitability of a hybrid framework
including data driven approaches for hydrological forecasting,” Hydrology and Earth System Sciences
Discussions, pp. 1–20, 2022.

How to cite: Singh, U., Maca, P., and Hanel, M.: Hybrid Multi Models Ensemble Framework Based on Clustering Algorithms for Runoff Reconstruction, EGU General Assembly 2023, Vienna, Austria, 23–28 Apr 2023, EGU23-2666, https://doi.org/10.5194/egusphere-egu23-2666, 2023.

A.41

EGU23-3296

ECS

Large-scale comparison of machine and statistical learning algorithms for blending gridded satellite and earth-observed precipitation data

Georgia Papacharalampous, Hristos Tyralis, Anastasios Doulamis, and Nikolaos Doulamis

An established way for improving the accuracy of gridded satellite precipitation products is to “correct” them by exploiting ground-based precipitation measurements, together with machine and statistical learning algorithms. Such corrections are made in regression settings, where the ground-based measurements are the dependent variable and the satellite data are predictor variables. Comparisons of machine and statistical learning algorithms in the direction of obtaining the most useful precipitation datasets by performing such corrections are regularly conducted in the literature. Nonetheless, in most of these comparisons, a small number of machine and statistical learning algorithms are considered. Also, small geographical regions and limited time periods are examined. Thus, the results provided tend to be of local importance and to not offer more general guidance. To provide results that are generalizable, we compared eight state-of-the-art machine and statistical learning algorithms in correcting satellite precipitation data for the entire contiguous United States and for a 15-year period. We used monthly data from the PERSIANN (Precipitation Estimation from Remotely Sensed Information using Artificial Neural Networks) gridded dataset and the Global Historical Climatology Network monthly database, version 2 (GHCNm). Our results suggest that extreme gradient boosting (XGBoost) and random forests are more accurate than the remaining algorithms, which can be ordered as follows from the best to the worst ones: Bayesian regularized feed-forward neural networks, multivariate adaptive polynomial splines (poly-MARS), gradient boosting machines (gbm), multivariate adaptive regression splines (MARS), feed-forward neural networks, linear regression.

How to cite: Papacharalampous, G., Tyralis, H., Doulamis, A., and Doulamis, N.: Large-scale comparison of machine and statistical learning algorithms for blending gridded satellite and earth-observed precipitation data, EGU General Assembly 2023, Vienna, Austria, 23–28 Apr 2023, EGU23-3296, https://doi.org/10.5194/egusphere-egu23-3296, 2023.

A.42

EGU23-3320

Fusion of satellite precipitation products and ground-based measurements using LightGBM with a focus on extreme quantiles

Hristos Tyralis, Georgia Papacharalampous, Anastasios Doulamis, and Nikolaos Doulamis

Satellite precipitation products are not accurate in representing the actual precipitation measured by gauges. To improve their accuracy, machine learning algorithms are applied in regression settings with ground-based measurements as dependent variables and satellite precipitation data as predictor variables. Here we examine the case of light gradient-boosting machine (LightGBM) for correcting daily IMERG (Integrated Multi-satellitE Retrievals for GPM) and PERSIANN (Precipitation Estimation from Remotely Sensed Information using Artificial Neural Networks) precipitation data using daily precipitation measurements in the contiguous US. Our demonstration especially focuses on the estimation of quantiles of the conditional probability distribution of daily precipitation at given points, with emphasis on extreme values.

How to cite: Tyralis, H., Papacharalampous, G., Doulamis, A., and Doulamis, N.: Fusion of satellite precipitation products and ground-based measurements using LightGBM with a focus on extreme quantiles, EGU General Assembly 2023, Vienna, Austria, 23–28 Apr 2023, EGU23-3320, https://doi.org/10.5194/egusphere-egu23-3320, 2023.

A.43

EGU23-4463

A comparative study of machine learning approaches with wavelet transforms for groundwater level modeling (Case study: Unconfined Tehran aquifer, Iran)

(withdrawn)

Leyla Ghasemi, Meysam Vadiati, and Ozgur Kisi

A.44

EGU23-5096

A Development of High-Resolution Long-Term Gridded Meteorological Data for South Korea using Deep Learning

Yookyung Jeong and Kyuhyun Byun

Climate change has a considerable impact on socioeconomic fields as well as on the natural environment. To effectively respond and adapt to climate change, we should analyze the long-term climate change trends and future impacts according to plausible climate scenarios. For this, the production of high-quality and high-resolution gridded meteorological data based on observation is essential, which is important for developing high-quality downscaled future projections. However, South Korea lacks the long term gridded meteorological data because a dense network from ASOS (Automated Synoptic Observing System) and AWS (Automated Weather System) Stations was available only after 2000. To address this problem, this study aims to produce high quality gridded meteorological data for a historical period (1973-1999), which could have been generated if a dense network existed. Specifically, we reconstruct spatial variations and features of meteorological variables for the historical (1973-1999) period by relating the gridded products for more recent period (2000-21) to that for preceding period (1973-1999) based on a deep learning algorithm. For this, MK-PRISM, an interpolation method for quantifying the effects of meteorological factors based on elevation in South Korea was applied to produce two different version of gridded products based on two different observation networks: a sparse network (ASOS) and a dense network (ASOS+AWS) over the recent 22 years (2000-2021). Then, we develop the Long-Short Term Memory (LSTM) for each grid cell using the gridded products by the sparse network as input and the denser network as output layer. Finally, we generate meteorological variables for the period of 1973-1999 using gridded product by a sparse network as input of the developed LSTM model for each grid cell. Our preliminary results showed that Nash-Sutcliffe Efficiency (NSE) was higher than 0.9 in most grid climate prediction models. Therefore, our development in this study has a potential to calculate high-quality and long-term meteorological data which can be used as important data to analyze the long-term climate trends and variability.

Acknowledgement:

This work was supported by the National Research Foundation of Korea (NRF) grant funded by the Korea government (MSIT) (No. NRF-2022R1A4A3032838).

How to cite: Jeong, Y. and Byun, K.: A Development of High-Resolution Long-Term Gridded Meteorological Data for South Korea using Deep Learning, EGU General Assembly 2023, Vienna, Austria, 23–28 Apr 2023, EGU23-5096, https://doi.org/10.5194/egusphere-egu23-5096, 2023.

A.45

EGU23-5841

ECS

B-AMA: a new Python protocol for hydrological predictions using data-driven models

Alessandro Amaranto and Maurizio Mazzoleni

The objective of this interactive poster session is to show the main features of B-AMA (Basic dAta-driven Models for All), an easy, flexible, fully coded Python-written protocol for the application of data-driven models (DDM) in hydrology. The protocol is specifically tailored for early career scientists with little background in coding, to foster them through the development of DDMs for hydrological forecasting while ensuring that none of the fundamental methodological steps is overlooked.

While a Jupyter notebook is already available online to guide the users through the protocol employment, during the session the interested audience can learn the main features of the software (data splitting, feature selection, hyperparameter optimization, and performance metrics) by running several practical hydrological workflows. The session will couple the visual representation B-AMA’s methodology with some laptop-based experiments, including rainfall-runoff, hydropower, and groundwater forecasts. We also allow loading customized csv data to deliver a first-hand experience of the protocol forecasting ability on the user’s specific case study, thanks also to the embedded visualization tools, which facilitate the efficient investigation and communication of results.

How to cite: Amaranto, A. and Mazzoleni, M.: B-AMA: a new Python protocol for hydrological predictions using data-driven models, EGU General Assembly 2023, Vienna, Austria, 23–28 Apr 2023, EGU23-5841, https://doi.org/10.5194/egusphere-egu23-5841, 2023.

A.46

EGU23-8534

ML approaches to flood susceptibility mapping at the country scale

Geoffrey Dawson, Junaid Butt, Paolo Fraccaro, and Anne Jones

Flooding is one of the most costly disasters in the UK, and its impact is projected to increase under climate change. Detailed, accurate and high resolution modelling and mapping of flood hazards are therefore essential to enable climate change adaptation. However, high resolution physics-based flood inundation models are extremely computationally intensive to run, presenting a challenge when mapping flood risk at the country scale, especially when working with ensembles of driving scenarios to account for uncertainty. Furthermore, efficient physical modelling for a target location and/or event required a priori categorisation of dominant flood type (for example fluvial or pluvial), which determines the selection and configuration of appropriate models. In reality, floods at scales beyond a local level are often a combination of multiple flood types. In recent years, machine learning approaches to mapping flood susceptibility have grown in popularity, enabled by large volumes of geospatial and weather/climate data from which explanatory flood factors can be derived. In this study, we develop a pluvial/fluvial flood susceptibility model for England, using high quality open datasets (elevation, land use, soil type, location of water bodies, rainfall) to derive hydrologically-meaningful features, and an open flood inventory dataset to sample flooded/non-flooded points. We train and test the model with grouped cross-validation hyper-parameter tuning for repeated samples of the data on a regular grid, where testing is carried out on unseen grid squares. We discuss the relative performance of different machine learning algorithms, including Random Forest and XG Boost, and assess the computational intensity and scalability of the model across training and inference phases. We also consider the potential of machine learning approaches to provide uncertainty estimates and, via explainable AI techniques, the sensitivity of the predicted flood probability to explanatory flood factors at any given location. Finally, we reflect on the part the modelling approach can play as part of a range of tools to meet the needs of consumers of flood risk information across multiple economic sectors.

How to cite: Dawson, G., Butt, J., Fraccaro, P., and Jones, A.: ML approaches to flood susceptibility mapping at the country scale, EGU General Assembly 2023, Vienna, Austria, 23–28 Apr 2023, EGU23-8534, https://doi.org/10.5194/egusphere-egu23-8534, 2023.

A.47

EGU23-8698

ECS

Long-Short Term Memory networks as observation operator for the states in a conceptual hydrological model

Olivier Bonte, Hans Lievens, and Niko Verhoest

During the last decades, data assimilation has demonstrated its merit for updating hydrological models with remotely sensed observations. Generally, physically-based models are used as these contain model states that can effectively be observed. Yet, remote sensing data, such as microwave backscattering, often needs to be converted to these model states using an observation operator, which often is a physically-based retrieval algorithm. When using conceptual models, the problem becomes more complicated as the model states cannot be related to actual physical properties in the field. Because of this, the observation operator is often an empirical relation between a hydrological (state) variable and a model state. In this poster presentation we demonstrate the use of a Long-Short Term Memory (LSTM) network as alternative observation operator that allows to convert remotely sensed observations into model state estimations of the Probability Distributed Model (PDM). Therefore, Sentinel-1 observations averaged at the catchment scale and for each land use type within the catchment, along with other data sources (such as LAI, precipitation, …) are fed to an LSTM in order to estimate a critical capacity of the probability distributed soil moisture reservoir. These data are then used to update the PDM through a classical data assimilation method (i.e. the ensemble Kalman Filter).

How to cite: Bonte, O., Lievens, H., and Verhoest, N.: Long-Short Term Memory networks as observation operator for the states in a conceptual hydrological model, EGU General Assembly 2023, Vienna, Austria, 23–28 Apr 2023, EGU23-8698, https://doi.org/10.5194/egusphere-egu23-8698, 2023.

A.48

EGU23-9962

Refining linear interpolation of water level data with the use of autoregressive models

Tomasz Niedzielski and Michal Halicki

Although linear interpolation is the simplest method for inputing hydrograph data, there are evidences for its efficiency in hydrology. It works well at edges of no-data gaps because the inputation is limited by bounds. However, it does not reconstruct the hydrologic variability of water levels recorded before and after a no-data gap.

In this paper, we combine linear interpolation with autoregressive models in order to account for both controlling bounds as well as anticipating irregular variation of hydrograph. We check the performance of this approach using hourly water level time series collected between 2016 and 2022 at 28 gauges located in the Odra/Oder River basin in Poland. For the purpose of validation, we produce missing data gaps artificially, using the moving window approach. By considering root mean square errors (RMSE) of interpolation as a function of gap length and investigating differences between these RMSE values computed using linear interpolation with/without autoregression, we identify cases in which the postulated approach refines purely linear interpolation. Initial studies suggest that the combination of methods reveals slightly better skills than the linear interpolation itself for short no-data gaps, the length of which does not exceed 24 hours.

The research has been conducted in frame of the project no. 2020/38/E/ST10/00295 within the Sonata BIS programme of the National Science Centre, Poland.

How to cite: Niedzielski, T. and Halicki, M.: Refining linear interpolation of water level data with the use of autoregressive models, EGU General Assembly 2023, Vienna, Austria, 23–28 Apr 2023, EGU23-9962, https://doi.org/10.5194/egusphere-egu23-9962, 2023.

A.49

EGU23-11973

ECS

A deep learning approach based on Bayesian optimization for prediction of stable isotope concentrations in stream and groundwater

Amir Sahraei, Tobias Houska, and Lutz Breuer

High temporal resolution (i.e., sub-daily) stable isotope concentrations of multiple stream and groundwater sources reveal small-scale, rapid transport and mixing processes that are not discernible at coarser resolution. However, long-term, routine sampling of multiple water sources at high temporal resolution is far from widespread. In recent years, the rise of deep learning offers the opportunity to further improve the prediction accuracy of infrequently measured data owing to its capability to efficiently abstract interrelationship patterns in complex and non-linear systems. In this research, we explore the potential of a Long Short-Term Memory (LSTM) deep learning model to predict high-resolution (3 h) isotope concentrations of multiple stream and groundwater sources in the Schwingbach Environmental Observatory (SEO), Germany. The key objective of this study is to examine the predictive performance of the LSTM that is simultaneously trained on multiple sites with a set of explanatory data that are more convenient and less expensive to collect in comparison to the stable water isotopes. The explanatory data comprise meteorological data, soil moisture, and natural tracers (i.e., water temperature, pH, and electrical conductivity). A sensitivity analysis is applied to investigate the model performance under different input data and sequence lengths. A Bayesian optimization algorithm is employed to optimize the hyperparameters of the LSTM to ensure an efficient model performance. The main outcome of our study shows that the LSTM enables the prediction of stable isotopes in streams and groundwater by using only a short sequence (6 hours) of recorded water temperature, pH, and electrical conductivity. The best performing LSTM reached on average an RMSE of 0.7‰, MAE of 0.4‰, R² of 0.9, and NSE of 0.7. The proposed model can be used to predict continuous time series of stable water isotope concentrations, either for gap filling or in cases when continuous data collection is not possible. This is very worthwhile in practice since measurements of tracers used in our LSTM are still much cheaper than those of stable water isotopes and can be carried out continuously with relatively low associated maintenance. In future research, the pre-trained LSTM should be applied through transfer learning to other catchments at which the length and resolution of available data are not sufficient to build a standalone model.

How to cite: Sahraei, A., Houska, T., and Breuer, L.: A deep learning approach based on Bayesian optimization for prediction of stable isotope concentrations in stream and groundwater, EGU General Assembly 2023, Vienna, Austria, 23–28 Apr 2023, EGU23-11973, https://doi.org/10.5194/egusphere-egu23-11973, 2023.

A.50

EGU23-16266

ECS

Meta-learning for water level prediction

Asma Slaimi, Michael Scriney, Susan Hegarty, Fiona Regan, and Noel E. O’Connor

In the Artificial intelligence (AI) sense, meta-learning is the ability of an artificially intelligent machine first to learn how to conduct different complex tasks, taking the principles it utilised to learn one task and applying them to other different tasks. Hence, the general concept of "learning how to learn". Machine learning provides capabilities to learn from past data and generates models for future prediction, which can be helpful for multiple catchment management tasks, such as water elevation monitoring and flood prediction.

Our initial studies focused on predicting and evaluating the ML-based hydrologic time-series models based on their predictive performance. We used eight machine learning algorithms to predict river water levels, including Baseline, Linear, Dense, MultiDense, CNN, RNN, GRU and LSTM techniques. The eight models were employed for one hour ahead of river water level forecasting in 70 hydrometric stations in Ireland. The results show that the NN-based models generally performed well in predicting the water level, with some differences in each model's performance for different stations. These results suggest that a single machine learning model may be sufficient for forecasting river water levels in one location and perform poorly in another. Hence, there is no overall best model; and the selected model may significantly impact the desired results.

This study's main goal was to investigate a meta-learning-based approach for water level prediction. The proposed Meta-learning approach comprises two phases; Learning and meta-learning. The meta-learning process uses the outcomes of the previous experiments to accomplish the Learning Training and Practising phases of the meta-learner. Later the outcome of the previous step will be the Databases to create the learner (learning about learning phase).

Creating meta-learning models can help AI models to generalise learning methods and acquire new skills more quickly. We expect the meta-learning model to adjust well when generalising to previously unknown datasets and environments that have never been encountered during training.

Keywords: Machine learning (ML), meta-learning, water-level prediction, hydrologic time-series forecasting.

How to cite: Slaimi, A., Scriney, M., Hegarty, S., Regan, F., and E. O’Connor, N.: Meta-learning for water level prediction, EGU General Assembly 2023, Vienna, Austria, 23–28 Apr 2023, EGU23-16266, https://doi.org/10.5194/egusphere-egu23-16266, 2023.

A.51

EGU23-16995

Multi-period and multi-variable calibration of SWAT+ using gridded input datasets and a novel R package

Rodrigo Marinao and Mauricio Zambrano-Bigiarini

For more than two decades, multi-objective optimisation (MOO) has imposed a new paradigm in the calibration of hydrological models, and over the years different algorithms and calibration approaches have been developed that aim to obtain consistent parameters for some specific hydrological model. However, the development of flexible multi-objective calibration tools has been scarce, making it difficult to spread these approaches to a wide range of researchers.

The objective of this work is to show the application of a new multi-objective and multi-platform R package (hydroMOPSO) for the calibration of SWAT+, a widely used semi-distributed hydrological model. In particular, in this work hydroMOPSO is used beyond the traditional adoption of multiple objective functions. Instead, a multi-period (dry and wet years), and multi-variable (point streamflows and gridded soil moisture and evapotranspiration) are used as objectives, to illustrate the flexibility of hydroMOPSO to be linked with different model input and outputs, both with different file formats and temporal frequencies. Similar approaches could be applied with other hydrological models available in R (e.g., TUWmodel, airGR, topmodel) or any other model that can be run from the system console (e.g., Raven, MODFLOW, WEAP).

How to cite: Marinao, R. and Zambrano-Bigiarini, M.: Multi-period and multi-variable calibration of SWAT+ using gridded input datasets and a novel R package, EGU General Assembly 2023, Vienna, Austria, 23–28 Apr 2023, EGU23-16995, https://doi.org/10.5194/egusphere-egu23-16995, 2023.

Posters virtual: Tue, 25 Apr, 08:30–10:15 | vHall HS

Chairperson: Amin Elshorbagy

vHS.3

EGU23-6685

Improving sub-seasonal drought forecasting via machine learning to leverage climate data at different spatial scales

Francesco Bosso, Claudia Bertini, Matteo Giuliani, Dimitri Solomatine, and Schalk Jan van Andel

Droughts are one of the most dangerous natural hazards that are affecting societies, with an economic impact amounting to over 9 billion euros per year in Europe. Drought events usually originate from a precipitation deficit, which can then cause water shortages, agricultural losses, and environmental degradation. Despite the numerous efforts and recent advances in predicting weather and extreme weather events, accurately forecasting rainfall remains a challenge, especially at sub-seasonal lead-times. In this case, the reference period is short enough for the atmosphere to retain a memory of its initial conditions, but also long enough for oceanic variability to affect atmospheric circulation. However, the relative contribution of climate teleconnections and local atmospheric conditions to the genesis of total precipitation at sub-seasonal scale remains unclear. In this work, we aim to address this gap by advancing the Climate State Intelligence (CSI) framework to examine the impact of both teleconnection patterns and local atmospheric conditions on monthly total precipitation. We then use the information gained to forecast total precipitation with a one-month lead time, and we test three different Machine Learning (ML) models: (i) Extreme Learning Machine (ELM); (ii) Fully Connected Neural Network; (iii) Convolutional Neural Network (CNN). We finally assess the skill of our ML-based precipitation forecasts in predicting the Standardized Precipitation Index (SPI), using the ECMWF Extended Range forecasts as a benchmark. Our framework is developed within the CLImate INTelligence (CLINT) project and applied in the Rhine Delta area, in the Netherlands. Initial findings indicate that combining global and local climate contexts into ML-based models significantly improves state-of-the-art drought forecast accuracy, thus representing a promising option to timely prompt anticipatory drought management measures.

How to cite: Bosso, F., Bertini, C., Giuliani, M., Solomatine, D., and van Andel, S. J.: Improving sub-seasonal drought forecasting via machine learning to leverage climate data at different spatial scales, EGU General Assembly 2023, Vienna, Austria, 23–28 Apr 2023, EGU23-6685, https://doi.org/10.5194/egusphere-egu23-6685, 2023.

vHS.4

EGU23-9790

ECS

Deep Learning for Probabilistic Forecasts Using Features from Rainfall Objects: A Case Study in the Amazon Basin

Omesh Persaud, Gerald Corzo Perez, Dimitri Solomatine, Eliana Torres, Vinícius Alencar Siqueira, and Ingrid Petry

Hydrological forecasting is of global importance, especially with the spotted increasing trend of flood-related disasters as seen in the last two decades. The causative rainfall events of these extreme events are primarily analysed in a one-dimensional method. However, through an object-based approach, more data on these rainfall fields can be generated and studied to link them to the hydrological response observed. Through an object-based methodology ST-CORA, features from rate of change of rain intensity in space and time can be extracted by simple visual inspection. Every side of an object provides time variations that can be used as images that contain features not easy to extract. In general, rainfall events in previous studies have used aggregated information, like the duration, area, volume, maximum intensity, and the centroid. In this work, more information is captured that describes the spatial and temporal properties of the event. The main objective of this research is to use these 3D objects and their features with a deep learning model to produce a 15-day hydrological probabilistic forecast for flood prediction.

A calibrated version of a large-scale hydrological model (MGB) is used to study an Amazon subbasin. The model is forced with the 50-member perturbed forecast from the TIGGE dataset for the period 2006 to 2014 (from ECMWF). The purpose of using the large-scale model is to better capture the spatio-temporal characteristics over a wider area in an effort to reduce the uncertainty in the analysis. For data-driven models, there is a need for sufficiently large databases, in this case for both the causative rainfall events and the observed hydrological responses. As such, the first two steps relate to the data generation. The first database is developed from the daily streamflow which is generated from the calibrated hydrological model at specific locations of interest with the known higher performance metrics. Second, the ST-CORA methodology is applied to extract the features from the rainfall events in order to develop a database of the rainfall objects. Third, an analysis on the statistics of the features of the objects to understand the rainfall which occurs within the study area. The final part of the research involves the effective use of these features and objects with a deep learning model. From the average annual rainfall from 2001 to 2020, three distinct precipitation patterns are observed. For the streamflow, the subbasin shows a relatively fast response which is captured within a 15-day window.

A convolutional LSTM deep learning model is developed to handle 3D rainfall objects as sequences of images representing space time sequences. The outcome of this research contributes to the end-to-end deep learning model which receives the forecasted rainfall as objects and generates a corresponding hydrograph at the area of interest for which it has been trained. A potential contribution of this Conv-LSTM network is that it may provide an efficient and automated approach for streamflow forecasting in basins where there is known complexity and non-linearity, which is especially useful for early warning systems.

How to cite: Persaud, O., Corzo Perez, G., Solomatine, D., Torres, E., Siqueira, V. A., and Petry, I.: Deep Learning for Probabilistic Forecasts Using Features from Rainfall Objects: A Case Study in the Amazon Basin, EGU General Assembly 2023, Vienna, Austria, 23–28 Apr 2023, EGU23-9790, https://doi.org/10.5194/egusphere-egu23-9790, 2023.

vHS.5

EGU23-724

ECS

Reservoir Operation and Multi-Attribute Decision-Making – Web-based Tool

Moola Rajasree and Roshan Srivastav

Reservoir operation plays a significant role in effectively managing water resources, especially mitigating future droughts. Several simulation, and optimization models are used to obtain optimal reservoir operation solutions based on hedging rules. These solutions are usually in the form of Pareto front and are derived to satisfy multiple objectives related to water supply measures. However, it is challenging to achieve a single solution when various performance indices such as reliability, resilience and vulnerability are considered in evaluation. Therefore, this study proposes a web-based application for reservoir operation with Multi-Attribute Decision Making (MADM) methods to evaluate and provide rankings for reservoir operation solutions. It includes two components: (i) reservoir operation module based on simulation-optimization framework with hedging policies; and (ii) decision-making module that includes evaluation and ranking of solutions obtained from first module and comparison of rankings obtained from various MADM methods. Overall the tool provides a set of solutions for different water supply reduction measures/hedging policies and ranking the solutions for performance indices which would help reservoir operators, practitioners, and researchers for optimal water allocation and decision-making.

How to cite: Rajasree, M. and Srivastav, R.: Reservoir Operation and Multi-Attribute Decision-Making – Web-based Tool, EGU General Assembly 2023, Vienna, Austria, 23–28 Apr 2023, EGU23-724, https://doi.org/10.5194/egusphere-egu23-724, 2023.

vHS.6

EGU23-9409

ECS

Interpolation of hydrological time series via Dynamic Mode Decomposition

Giulia Libero and Valentina Ciriello

The distribution and availability of water resources all around the world are strongly affected by climate change. To deal with any negative impact, the research community is asked to provide accurate information to guide adaptation and mitigation strategies. The effort is supported by the increasing availability of data, which is fueling studies about climate-related phenomena. A massive contribution comes from satellite technologies, which have evolved rapidly in the past few decades, and now provide data with improved spatial coverage and time resolution. However, an important issue related to this type of product is still represented by missing data. The gap between data, especially if long-lasting, breaks the continuity of the observations and limits further application of the time series. Different data-driven methods have been tested to bridge these gaps, and even to reconstruct the series in the past. A new viable approach could be represented by the dynamic mode decomposition (DMD), a data-driven model reduction technique that originated in the fluid dynamics community, capable of extracting coherent structures directly from spatiotemporal complex system data. The DMD method allows to automatically embed seasonal variations and capture trends in the data, for this reason, it is used for the detection of patterns, the extraction of reduced order models, and the prediction of time series based on previous observations. A suite of DMD algorithms is available to handle different applications. Here, we use different DMD algorithms and analyze their capability to reconstruct and interpolate time series of total water storage anomalies as provided by the Gravity Recovery and Climate Experiment (GRACE) satellite mission. The mission is focused on monitoring mass distribution changes on Earth through the measurement of Earth’s gravity field variations. Changes in gravity detected by GRACE can be used to derive estimates of water distribution on the planet and hence provide pioneering data to draw an integrated global view of how Earth’s water cycle is evolving. GRACE data are freely available and provided to the users as global matrices of centimeters of equivalent water thickness anomalies relative to a baseline mean. The native resolution is 3 degrees, but a matrix of scale factors can be applied to adapt the data on a global regular grid at a resolution of 0.5 degrees in both latitude and longitude. Data are available from April 2002 to the present, on a monthly scale, but the series is affected by some short-term gaps and a major interruption of approximately 1 year, due to the transition between the first GRACE mission, flown from March 2002 to October 2017, and the GRACE Follow-On (GRACE‐FO) mission launched in May 2018. In this study, the DMD method is applied to capture the hidden information embedded in the large amount of data collected by GRACE missions and then to use them to interpolate the short-term gaps and bridge the larger one-year gap.

How to cite: Libero, G. and Ciriello, V.: Interpolation of hydrological time series via Dynamic Mode Decomposition, EGU General Assembly 2023, Vienna, Austria, 23–28 Apr 2023, EGU23-9409, https://doi.org/10.5194/egusphere-egu23-9409, 2023.

vHS.7

EGU23-13411

ECS

A novel approach for predicting spring locations using machine learning algorithms in Indian Himalayan Region

Pradeep Gairola, Arabinda Maiti, Srikanta Sannigrahi, Anand Bhatt, Soban Singh Rawat, Sudhir Kumar, Deepak Singh Bisht, and Sandeep Bhatt

Due to the concerning effects of climate change, groundwater will be one of the significant sources of water for both primary and secondary use in the future. Therefore, identifying the spatial patterns of groundwater distribution might help implement practical water resources management projects. Springs are a potential source of groundwater in the Indian Himalayan Region. The main objective of the current study is to explore a novel methodological approach that utilizes the Variance Inflation factor (VIF) to perform a feature selection procedure and most used machine learning (ML) algorithms, including Random Forest (RF), Gradient Boosting Machine (GBM), and Neural Network (NN) for generating a groundwater spring potential map of the Ravi Basin in Himachal Pradesh, India. Used, 1834 spring and non-spring locations were selected from the field and split into two groups. Of 1834 samples, 70% (1283) were used for model training, and 30% (551) were used for model validation. The model’s overall accuracy of 0.89, 0.87, and 0.88 for RF, GBM, and NN, respectively, around 10% area, has a very high potential for spring occurrence. The novel methodology can be employed to find the initial information for GW exploitation for inaccessible areas and the lack of data sources in this area.

How to cite: Gairola, P., Maiti, A., Sannigrahi, S., Bhatt, A., Singh Rawat, S., Kumar, S., Singh Bisht, D., and Bhatt, S.: A novel approach for predicting spring locations using machine learning algorithms in Indian Himalayan Region, EGU General Assembly 2023, Vienna, Austria, 23–28 Apr 2023, EGU23-13411, https://doi.org/10.5194/egusphere-egu23-13411, 2023.