EGU21-16108
https://doi.org/10.5194/egusphere-egu21-16108
EGU General Assembly 2021
© Author(s) 2022. This work is distributed under
the Creative Commons Attribution 4.0 License.

On the data synergy effect of large-sample multi-physics catchment modeling with machine learning

Chaopeng Shen, Farshid Rahmani, Kuai Fang, Zhi Wei, and Wen-Ping Tsai
Chaopeng Shen et al.
  • Pennsylvania State University, Civil and Environmental Engineering, University Park, United States of America (cshen@engr.psu.edu)

Watersheds in the world are often perceived as being unique from each other, requiring customized study for each basin. Models uniquely built for each watershed, in general, cannot be leveraged for other watersheds. It is also a customary practice in hydrology and related geoscientific disciplines to divide the whole domain into multiple regimes and study each region separately, in an approach sometimes called regionalization or stratification. However, in the era of big-data machine learning, models can learn across regions and identify commonalities and differences. In this presentation, we first show that machine learning can derive highly functional continental-scale models for streamflow, evapotranspiration, and water quality variables. Next, through two hydrologic examples (soil moisture and streamflow), we argue that unification can often significantly outperform stratification, and systematically examine an effect we call data synergy, where the results of the DL models improved when data were pooled together from characteristically different regions and variables. In fact, the performance of the DL models benefited from some diversity in training data even with similar data quantity. However, allowing heterogeneous training data makes eligible much larger training datasets, which is an inherent advantage of DL. We also share our recent developments in advancing hydrologic deep learning and machine learning driven parameterization.

How to cite: Shen, C., Rahmani, F., Fang, K., Wei, Z., and Tsai, W.-P.: On the data synergy effect of large-sample multi-physics catchment modeling with machine learning, EGU General Assembly 2021, online, 19–30 Apr 2021, EGU21-16108, https://doi.org/10.5194/egusphere-egu21-16108, 2021.