So many catchments, so many choices: challenges with large-sample hydrological datasets &nbsp;

Franziska Clerc-Schwarzenbach; Marc Vis; Ilja van Meerveld; Jan Seibert

doi:https://doi.org/10.5194/egusphere-egu26-11939

[Back] [Session HS2.4.2]

EGU26-11939, updated on 14 Mar 2026

https://doi.org/10.5194/egusphere-egu26-11939

EGU General Assembly 2026

© Author(s) 2026. This work is distributed under
the Creative Commons Attribution 4.0 License.

So many catchments, so many choices: challenges with large-sample hydrological datasets

Franziska Clerc-Schwarzenbach, Marc Vis, Ilja van Meerveld, and Jan Seibert

Franziska Clerc-Schwarzenbach et al.

University of Zurich, Department of Geography, Zurich, Switzerland (franziska.clerc@geo.uzh.ch)

The field of large-sample hydrology is developing at a rapid pace. The increasing availability of hydrometeorological data and information on catchment attributes is primarily thanks to the authors of the various CAMELS datasets and similar datasets, who have put tremendous efforts into creating these resources. This progress has enabled the use of datasets from various regions to conduct large-sample studies. For some regions, large-sample datasets are becoming available at sub-daily resolutions, which will further expand the possibilities for studies in large-sample hydrology.

Many of these datasets offer multiple time series for a certain variable: for example, precipitation time series originating from different sources, or potential evapotranspiration time series calculated using different equations. Furthermore, there is a considerable number of catchments that are represented in multiple large-sample datasets, either because there is more than one dataset for a particular country (which is the case for Brazil) or because they are included in overarching datasets such as Caravan or EStreams. While this wealth of data is a real treasure, it also poses significant challenges to users of large-sample hydrological data because decisions need to be taken on what data to use (and for what reason). Furthermore, questions on the reliability of the different data are inevitable. Many users end up doing individual data checks on their own or taking more or less random decisions. This reduces the comparability between different studies.

In this contribution, we present examples of the challenges that arise when working with large-sample hydrological data. We show the results of comparisons between different data sources that are meant to represent the same variable and how these affect model simulations. The presentation aims to stimulate discussion about a more uniform approach to making decisions on which data to use when working with large-sample hydrological datasets.

How to cite: Clerc-Schwarzenbach, F., Vis, M., van Meerveld, I., and Seibert, J.: So many catchments, so many choices: challenges with large-sample hydrological datasets , EGU General Assembly 2026, Vienna, Austria, 3–8 May 2026, EGU26-11939, https://doi.org/10.5194/egusphere-egu26-11939, 2026.