Session 6 | Big data science in hydrological research

Session 6

Big data science in hydrological research
Conveners: Ana Maria Tarquis, Ralf Kunkel
Orals
| Thu, 15 Jun, 09:00–10:15|Saints Marcellino and Festo
Poster
| Attendance Thu, 15 Jun, 10:45–11:30|Poster area
Orals |
Thu, 09:00
Thu, 10:45
In the past decade, hydrological and critical zone observatories have been established that produce massive amounts of data for a range of critical zone processes. As it remains challenging to analyze such data sets, we solicit submissions that present novel strategies to support critical zone studies in the light of big data.

Invited speaker: Steffen Zacharias, Germany (steffen.zacharias@ufz.de)

Orals: Thu, 15 Jun | Saints Marcellino and Festo

Chairpersons: Ralf Kunkel, Ana Maria Tarquis
Keynote presentation
09:00–09:15
|
GC8-Hydro-25
|
keynote
Steffen Zacharias, Harry Vereecken, Jaana Bäck, and Michael Mirtl

We live in a world of rapid social, economic and ecosystem change, facing major environmental challenges such as global warming, biodiversity loss and pressures on natural resources. Addressing these topics requires world-class ecosystem research by a well-connected, extensive community of experts, supported by advanced sites and facilities, openly shared and easily accessible data and capacity building programs. This is the goal of the Integrated European Long-Term Ecosystem, critical zone and socio-ecological system Research Infrastructure (eLTER RI).

eLTER RI will adopt a fundamentally systemic approach to observe and analyse the environmental system, encompassing biological, geological, hydrological and socio-ecological perspectives. It will be the first research infrastructure capturing and analysing holistically the integrated impacts of climate change alongside other pressures on a wide variety of European ecosystems. Ca. 200 eLTER research sites will provide a wide scale and systematic coverage of major European terrestrial, freshwater and transitional water ecosystem. eLTER RI will allow in-situ, co-located gathering of Essential Variables ranging from bio-physico-chemical to biodiversity and socio-ecological data. Ecosystem change caused by long-term pressures and short-term pulses will be investigated in a nested design from the local to the continental scale. With the huge number of in-situ sites and platforms and the harmonized and standardized observation concept, the eLTER RI offers outstanding new perspectives for hydrological research in Europe.

One of the major aims of long term ecosystem monitoring and research is to provide quality controlled and reliable data to support scientific analyses and enable input for designing environmental policies and assessing their impacts. Both the concept and in-situ design as well as the basic architectures and tools of the eLTER RI to support data providers and data users will be presented.

How to cite: Zacharias, S., Vereecken, H., Bäck, J., and Mirtl, M.: eLTER RI – A new European Research Infrastructure addressing today’s environmental challenges – a new perspective for European hydrological research, A European vision for hydrological observations and experimentation, Naples, Italy, 12–15 Jun 2023, GC8-Hydro-25, https://doi.org/10.5194/egusphere-gc8-hydro-25, 2023.

Global and Open Data
09:15–09:25
|
GC8-Hydro-55
|
Stephan Dietrich, Claudia Färber, Matthias Zink, Philipp Saile, and Ulrich Looser

The exchange of data and information on freshwater-related observations has been a key issue for scientists and hydrologists for decades. Although significant improvements have been made in the observation of the global hydrological cycle, the Global Climate Observation System (GCOS), in its latest Implementation Plan 2022, still identifies the need for further improvements in the exchange of hydrological data. This message has been echoed by the COP27 Sharm-el-Sheik 2022 Cover Decision. The main barriers are long known and related to a) lack of capacity to apply international standards for data and metadata exchange and b) restrictive data policies that hinder data exchange.

The Global Terrestrial Network - Hydrology has been established in 2001 to support a range of climate and water resource objectives, building on existing networks and data centres and creating integrated products on the global water cycle. Today, GTN-H comprises 12 data centres and networks, such as GRDC, the International Soil Moisture Network (ISMN), the Global Environment Monitoring System for Freshwater (GEMS/Water), IGRAC’s Global Groundwater Monitoring Network (GGMN) or FAO AQUASTAT. The data and information provided by the GTN-H Global Data Centres are an essential source of information for the UN, regional and national programmes and projects in support of development and science. GTN-H is a joint effort of the Global Climate Observing System (GCOS) and the World Meteorological Organisation (WMO).

In this presentation, we will summarise past and recent efforts to improve data sharing in Europe and globally. Additionally, we will present recent developments in agendas and technical implementation efforts at the UN level to improve data sharing. These include:

 

  • Some historic background: In the 1980s, the UNESCO FRIEND-Water (Flow Regimes of International Experimental and Network Data) global water community has been established to collect and share hydrological observations for scientific assessment of flow regimes. These activities led to projects such as EURO-FRIEND's European Water Archive and SA-FRIEND's Southern Africa Flow Database. Both datasets have been integrated into GRDC’s Global Runoff Database.
  • An introduction to the global acting data centers federated within the GTN-H, focussing on hydrological, climate and environmental observations worldwide.
  • The World Meteorological Organisation emphasises the importance of open data policies and interoperability. We will provide an insight into the recent efforts of the GEMS/Water Data Centre to improve the interoperability of water quality data and show the success of the GRDC in implementing a new data portal. We will also present a concept for tiered networks and how to assess their maturity.
  • The IX. Phase of UNESCO's Intergovernmental Hydrological Programme, which aims to fill the data knowledge gaps in hydrology, particularly by engaging the scientific community.
  • Finally, we will report on the UN 2023 Water Conference, with outcomes that will focus on sharing of water observation data and information to achieve the goals of SDG6.

keywords: global data centres; operational hydrology; data analysis; open data; data sharing

How to cite: Dietrich, S., Färber, C., Zink, M., Saile, P., and Looser, U.: The Global Terrestrial Network - Hydrology (GTN-H): network of networks for integrated observations of the global water cycle, A European vision for hydrological observations and experimentation, Naples, Italy, 12–15 Jun 2023, GC8-Hydro-55, https://doi.org/10.5194/egusphere-gc8-hydro-55, 2023.

09:25–09:35
|
GC8-Hydro-57
|
Matthias Zink, Fay Boehmer, Tunde Olarinoye, Wolgang Korres, Kasjen Kramer, Irene Himmelbauer, Daniel Aberer, Roberto Sabia, Raffaele Crapolicchio, Philippe Goryl, Klaus Scipal, Wouter Dorigo, and Stephan Dietrich

Soil moisture is recognized as an Essential Climate Variable (ECV) because it is crucial for assessing water availability for plants and hence food production. Having long time series of freely available soil moisture data with global coverage enables scientists, farmers and decision makers to detect trends, assess the impacts of climate change, and develop adaptation strategies.

The collection, harmonization and archiving of in situ soil moisture data was the motivation to establish the International Soil Moisture Network (ISMN) at TU Wien, with the financial support of the European Space Agency (ESA), in 2009 as a community effort. The ISMN became an essential source for validating and improving global satellite products, and climate, land surface, and hydrological models. In 2021 permanent funding for the ISMN operations was secured through the German Government (Ministry of Digital and Transport).

The transfer of the ISMN to its new host, i.e., the International Centre for Water Resources and Global Change (ICWRGC)/German Federal Institute of Hydrology (BfG), took place during 2021/2022. The takeover posed the challenge to migrate an operational service between two different teams, locations/hardware and organisations. Finally, the ISMN started serving data from its new host in December 2022 while keeping the service continuously running throughout the migration. In parallel the team in Vienna developed and launched a new dataviewer. This presentation aims at showcasing new ISMN features as well as recent data contributions as well as next evolution of the ISMN based on synergies and science outcome of the Research and Development activities performed by ESA in the context of the Fiducial Reference Measurements for Soil Moisture (FRM4SM) project.

How to cite: Zink, M., Boehmer, F., Olarinoye, T., Korres, W., Kramer, K., Himmelbauer, I., Aberer, D., Sabia, R., Crapolicchio, R., Goryl, P., Scipal, K., Dorigo, W., and Dietrich, S.: Ensuring ISMN’s permanent service for delivering long-term, in situ soil moisture data, A European vision for hydrological observations and experimentation, Naples, Italy, 12–15 Jun 2023, GC8-Hydro-57, https://doi.org/10.5194/egusphere-gc8-hydro-57, 2023.

Machine Learning & Scaling Issues
09:35–09:45
|
GC8-Hydro-22
|
ECS
Mathilde de FLEURY, Laurent Kergoat, Martin Brandt, Rasmus Fensholt, Ankit Kariryaa, Gyula Mate Kovács, Stéphanie Horion, and Manuela Grippa

Inland surface water, especially lakes and small water bodies, are essential resources and have impacts on biodiversity, greenhouse gases and health. This is particularly true in the semi-arid Sahelian region, where these resources remain largely unassessed, and little is known about their number, size and quality. Remote sensing monitoring methods remain a promising tool to address these issues at the large scale, especially in areas where field data are scarce. Thanks to technological advances, current remote sensing systems provide data for regular monitoring over time and offer a high spatial resolution, up to 10 metres.

Several water detection methods have been developed, many of them using spectral information to differentiate water surfaces from soil, through thresholding on water indices (MNDWI for example), or classifications by clustering. These methods are sensitive to optical reflectance variability and are not straight forwardly applicable to regions, such as the Sahel, where the lakes and their environment are very diverse. Particularly, the presence of aquatic vegetation is an important challenge and source of error for many of the existing algorithms and available databases.

Deep learning, a subset of machine learning methods for training deep neural networks, has emerged as the state-of-the-art approach for a large number of remote sensing tasks. In this study, we apply a deep learning model based on the U-Net architecture to detect water bodies in the Sahel using Sentinel-2 MSI data, and 86 manually defined lake polygons as training data. This framework was originally developed for tree mapping (Brandt et al., 2020, https://doi.org/10.1038/s41586-020-2824-5).

Our preliminary analysis indicate that our models achieve a good accuracy (98 %). The problems of aquatic vegetation do not appear anymore, and each lake is thus well delimited irrespective of water type and characteristics. Using the water delineations obtained, we then classify different optical water types and thereby highlight different type of waterbodies, that appear to be mostly turbid and eutrophic waters, allowing to better understand the eco-hydrological processes in this region.

This method demonstrates the effectiveness of deep learning in detecting water surfaces in the study region. Deriving water masks that account for all kind of waterbodies offer a great opportunity to further characterize different water types. This method is easily reproducible due to the availability of the satellite data/algorithm and can be further applied to detect dams and other human-made features in relation to lake environments.

How to cite: de FLEURY, M., Kergoat, L., Brandt, M., Fensholt, R., Kariryaa, A., Kovács, G. M., Horion, S., and Grippa, M.: Sentinel-2 MSI for mapping Sahelian water bodies using a U-Net network, A European vision for hydrological observations and experimentation, Naples, Italy, 12–15 Jun 2023, GC8-Hydro-22, https://doi.org/10.5194/egusphere-gc8-hydro-22, 2023.

09:45–09:55
|
GC8-Hydro-123
|
Yijian Zeng, Fakhereh Alidoost, Bart Schilperoort, Yang Liu, and Zhongbo Su

Climate projections strongly suggest that the 2022 sweltering summer may be a harbinger of the future European climate. Climate extremes (e.g., droughts and heatwaves) jeopardize terrestrial ecosystem carbon sequestration and hinder EU's goal of being climate-neutral by 2050. The construction of an open digital twin of the soil-plant system helps to monitor and predict the impact of extreme events on ecosystem functioning, the resulting information from which can be used to recommend measures and policies to increase the resilience of ecosystems to climate-related challenges. There are three main components of the soil-plant digital twin:  i) The soil-plant model for a digital representation of the soil-plant system; ii) Physics-aware machine learning algorithms to approximate the soil-plant model; and iii) Data assimilation framework to digest Earth Observation data to update the states of the soil-plant system. This paper will present a prototype of this open soil-plant digital twin.

How to cite: Zeng, Y., Alidoost, F., Schilperoort, B., Liu, Y., and Su, Z.: Towards an Open Digital Twin of Soil-Plant System, A European vision for hydrological observations and experimentation, Naples, Italy, 12–15 Jun 2023, GC8-Hydro-123, https://doi.org/10.5194/egusphere-gc8-hydro-123, 2023.

09:55–10:05
|
GC8-Hydro-83
Enrico Zorzetto, Sergey Malyshev, Nathaniel Chaney, and Elena Shevliakova

The land components of Earth System Models (ESMs) are increasingly used to predict changes in surface climate and hydrological processes at the global scale. However, due to their coarse resolution, these models still struggle in representing the fine-scale spatial variability of key land variables important for hydrological applications, such as snow cover, land surface temperature, and soil moisture. To address this limitation, we test here a sub—grid model structure recently developed for the Geophysical Fluid Dynamics Laboratory (GFDL) ESM4.1 land model. This novel model structure employs a machine learning technique and high-resolution terrain data to partition each land surface model grid in a set of land ‘tiles’ with homogeneous physical properties, which are then used to learn land processes at scales finer than the nominal land model resolution. This technique is especially relevant over complex terrain, where we can use elevation information to refine model predictions of the local energy balance and hydrological processes. Over each topography-aware model tile, the land model can thus provide local estimates of relevant land variables which can be then combined to produce high resolution maps and learn their spatial variance. As a proof of concept, here we will compare this modelling approach with satellite observations of land surface temperature, evaluating the skill of the model in reproducing land heterogeneity over complex topography regions. This analysis can be extended to other variables of hydrological interest, in particular soil moisture. As land models of increasing spatial resolution are being developed, our analysis here underlines the importance of evaluating not only grid average model output, but also predicted spatial variability using observational datasets.

How to cite: Zorzetto, E., Malyshev, S., Chaney, N., and Shevliakova, E.: Can Earth System Models represent the spatial variability of land surface processes over complex terrain?, A European vision for hydrological observations and experimentation, Naples, Italy, 12–15 Jun 2023, GC8-Hydro-83, https://doi.org/10.5194/egusphere-gc8-hydro-83, 2023.

10:05–10:15
|
GC8-Hydro-126
Ernesto Sanz, Juan José Martín Sotoca, Antonio Saa-Requejo, Carlos H. Díaz-Ambrona, Margarita Ruiz-Ramos, Alfredo Rodríguez, Andres Almeida, Rubén Moratiel, and Ana M. Tarquis

Soil-vegetation-atmosphere transfer (SVAT) schemes explicitly consider the role of vegetation in affecting water and energy balance by considering its physiological properties. However, most current SVAT schemes and hydrological models do not consider vegetation a dynamic component. The seasonal and monthly evolution of the physiological parameters is kept constant year after year. This fact is likely crucial in transient climate simulations for hydrological models used to study climate change impact. Therefore, the analysis of vegetation dynamics became crucial to study these scenarios.

Vegetation dynamics, especially over large scales, can be monitored using remote sensing. The Normalised Difference Vegetation Index (NDVI) is still the most well-known and frequently used spectral indices derived from remote sensing, identifying vegetated areas and their condition. NDVI is based on plants' differential reflectance for different parts of the solar radiation spectrum.

In this work, we present a classification of rangelands in Spain based on the NDVI time series using them, like the result of SVAT and defining metrics and the Hurst Exponent from detrended fluctuation analysis. These areas are located in different precipitation and temperature regimen but with a Mediterranean climate with different aridity grades: Huescar, Castuera and Lozoya. K-means and unsupervised random forest were used to cluster the pixels using time series metrics and Hurst exponents. The clustering results will be discussed by comparing them to climate and topographical data.

References

Sanz E, Sotoca JJM, Saa-Requejo A, Díaz-Ambrona CH, Ruiz-Ramos M, Rodríguez A, Tarquis AM. Clustering Arid Rangelands Based on NDVI Annual Patterns and Their Persistence. Remote Sensing. 2022; 14(19):4949. https://doi.org/10.3390/rs14194949

Acknowledgements

Financial support from the project "CLASIFICACIÓN DE PASTIZALES MEDIANTE MÉTODOS SUPERVISADOS - SANTO" code RP220220C024, by Universidad Politécnica de Madrid, is highly appreciated.

How to cite: Sanz, E., Martín Sotoca, J. J., Saa-Requejo, A., Díaz-Ambrona, C. H., Ruiz-Ramos, M., Rodríguez, A., Almeida, A., Moratiel, R., and Tarquis, A. M.: Clustering Rangelands Based on NDVI Annual Patterns with different aridity grades, A European vision for hydrological observations and experimentation, Naples, Italy, 12–15 Jun 2023, GC8-Hydro-126, https://doi.org/10.5194/egusphere-gc8-hydro-126, 2023.

Poster: Thu, 15 Jun, 10:45–11:30 | Poster area

Chairperson: Ana Maria Tarquis
P11
|
GC8-Hydro-65
|
ECS
Daniele Lepore, Christopher Conrad, Vincenzo Allocca, Delia Cusano, Johannes Löw, Lèonard El-Hokayem, and Pantaleone De Vita

Remote sensing is recognized as the most feasible means to provide regional information on land surfaces and monitor soil parameters such as soil moisture and evapotranspiration. The use of satellite-derived products can be crucial for groundwater resources in karst aquifers, particularly in regions, such as southern Italy, where groundwater availability drives economic and social development and there is a lack of monitored data. This study aims to expand the classical hydrogeological approach, used for the estimation of groundwater recharge of karst aquifers, to the understanding of the hydrological role of soil coverings by the integration of field monitoring and products derived by remotely sensed data. The research was conducted on the representative Mts. Soprano-Vesole-Chianello karst aquifer (Campania, southern Italy). Copernicus Global Land Services Soil Water Index (SWI) and Moderate Resolution Imaging Spectroradiometer (MODIS) Evapotranspiration products were explored to assess soil water content and evapotranspiration regimes. The analysis included time series gathered by a monitoring network consisting of 5 soil moisture multi-profile probes, working since 2021. The SWI1km provides daily soil water content information at 1 km resolution. Depending on the uncertain calculation, not considering evapotranspiration and soil texture, the SWI1km product provides 8-SWI estimations and the related quality factor values. Instead, the MOD16A2 is based on MODIS data and provides 8-day evapotranspiration estimation at 0.5 km resolution. The product collection is based on the logic of the Penman-Monteith equation, which integrates inputs of daily meteorological re-analysis data along with products derived by (MODIS) including vegetation property dynamics, albedo, and land cover.

Both products showed zones of no-data occurring across the mountain areas of the karst aquifers. This limitation is related to the algorithms that consider several parameters such as topography (slope aspect and angle) and occurrence of clouds for product generation.

The primary outcome of this study was the extraction of SWI values and the calculation of a mean value for the 8-SWI values, weighted by the related quality factor (SWIw). SWIw showed a constant difference of about -20% in comparison to the daily average values obtained by field monitoring. Despite this discrepancy, the annual trend of the SWIw was found being very consistent with the soil moisture probe measurements (corr. > 0.68) and displaying a good response to rainfall events.

Moreover, the MODIS ET data displayed the expected pattern of evapotranspiration with a temporal resolution not achievable in other ways considering the lack of local meteorological data.

In order to cope with missing data across the mountain areas of the karst aquifer, a spatial interpolation of SWIw and MODIS ET was carried out by different geostatistical techniques.

The findings suggest that SWI1km and MODIS16A2 are useful in monitoring soil water content and evapotranspiration of soils covering karst aquifers and controlling groundwater recharge. Although there are limitations due to missing data, both products can be still effectively utilized if properly interpolated. Therefore, they can be considered fundamental for assessing patterns of groundwater recharge in karst aquifers, especially in areas which are not extensively monitored as in the case of southern Italy.

How to cite: Lepore, D., Conrad, C., Allocca, V., Cusano, D., Löw, J., El-Hokayem, L., and De Vita, P.: Integration of remotely sensed and field monitoring data for characterizing the hydrological regime of soils covering karst aquifers and assessing groundwater recharge, A European vision for hydrological observations and experimentation, Naples, Italy, 12–15 Jun 2023, GC8-Hydro-65, https://doi.org/10.5194/egusphere-gc8-hydro-65, 2023.

P12
|
GC8-Hydro-86
Maria Chiara Sole and Alessandro Lotti

The importance of mapping Research Infrastructures (RIs) is widely recognized, there are numerous and diverse observatories and research infrastructures dealing with water challenges, both at local, national and European levels.

With the aim to create a network, since 2013, a lot of work has already been done within the Water JPI, an intergovernmental initiative whose mission is to strengthen water RDI collaboration amongst Member States in order to spur Europe’s leadership and competitiveness in the water sector. 

A Water JPI Infrastructures Platform was developed with the aim to support and facilitate the dissemination of information, assessing the existing RIs, to promote active collaboration among institutions and to provide access to world-leading research infrastructures that will enable excellent interdisciplinary research in water topic.

Today, the new European Partnership Water4All - Water Security for the Planet, is making great efforts to continue pursuing this goal: a first mapping report has been produced for assessing possible gaps and identifying synergies between existing structures. One of the main objectives of this mapping is the facilitation of sharing and accessing large-scale and long-term environmental data, in order to cooperate with relevant EU and national actors for developing observation data, their distribution and services for broadening implementation and to enhance the European observing capacity and predicting capabilities of the water cycle at global, regional and basin scales and its impacts on ecosystems.

In parallel, Water4All is developing a platform and a toolbox for water related data by integrating various existing databases and data collected and developed in the research projects funded by Water4All. The objective is to manage water related data thus providing a platform for a more efficient use of the information collected in Water4All following the FAIR (Findability, Accessibility, Interoperability and Reuse) principles.

How to cite: Sole, M. C. and Lotti, A.: The importance of networks of observatories and research infrastructures in hydrological data collection: the example of Water JPI and Water4All, A European vision for hydrological observations and experimentation, Naples, Italy, 12–15 Jun 2023, GC8-Hydro-86, https://doi.org/10.5194/egusphere-gc8-hydro-86, 2023.