Large-sample hydrologic models poorly simulate interannual variability in seasonal catchments, despite high Nash-Sutcliffe and Kling-Gupta Efficiencies

Sacha Ruzzante; Wouter Knoben; Thorsten Wagener; Tom Gleeson; Markus Schnorbus

doi:https://doi.org/10.5194/egusphere-egu26-932

[Back] [Session HS2.4.2]

EGU26-932, updated on 13 Mar 2026

https://doi.org/10.5194/egusphere-egu26-932

EGU General Assembly 2026

© Author(s) 2026. This work is distributed under
the Creative Commons Attribution 4.0 License.

Large-sample hydrologic models poorly simulate interannual variability in seasonal catchments, despite high Nash-Sutcliffe and Kling-Gupta Efficiencies

Sacha Ruzzante¹, Wouter Knoben², Thorsten Wagener³, Tom Gleeson⁴, and Markus Schnorbus⁵

Sacha Ruzzante et al.

¹Department of Civil Engineering, University of Victoria, Canada (sruzzante@uvic.ca)
²Department of Civil Engineering, University of Calgary, Canada
³Institute for Environmental Science and Geography, University of Potsdam, Germany
⁴Department of Civil Engineering & Earth and Ocean Sciences, University of Victoria, Canada
⁵Pacific Climate Impacts Consortium, University of Victoria, Canada

Variability in river flow can be understood as the sum of irregular, seasonal and interannual variance components. Skillful simulations of irregular events are needed to accurately predict short-duration events such as floods, while skillful simulation of interannual variance is required to accurately predict long-term change and long-duration droughts. However, popular performance metrics such as the Nash-Sutcliffe Efficiency (NSE) and Kling-Gupta Efficiency (KGE) do not distinguish these three variance components. We analyse streamflow simulations from 18 process-based, machine learning, and hybrid hydrologic models from around the globe (22,089 simulated time series in total) to investigate how well large-sample hydrologic models represent each variance component. We find that in highly seasonal (tropical, alpine, and polar) catchments these models achieve very high NSE and KGE values but produce worse-than-average simulations of interannual and irregular variance. Year-to-year variability in streamflow extremes and monthly mean flows is consistently more poorly simulated in highly seasonal catchments than in less-seasonal catchments. This suggests that these hydrologic models have limited skill in predicting long-term responses to climate change in alpine, polar, and tropical regions, which are some of the most vulnerable regimes regarding climate change. There is a need to rethink the value of efficiency scores such as NSE and KGE in large-domain model evaluation, and to complement such approaches with more detailed and more process-based investigations of model performance.

How to cite: Ruzzante, S., Knoben, W., Wagener, T., Gleeson, T., and Schnorbus, M.: Large-sample hydrologic models poorly simulate interannual variability in seasonal catchments, despite high Nash-Sutcliffe and Kling-Gupta Efficiencies, EGU General Assembly 2026, Vienna, Austria, 3–8 May 2026, EGU26-932, https://doi.org/10.5194/egusphere-egu26-932, 2026.

OSPP voting tool

This contribution takes part in the OSPP contest. Please log in to see the relevant judging section.

Supplementary materials

Supplementary material file

Comments on the supplementary material

AC: Author Comment | CC: Community Comment | Report abuse

supplementary materials version 2 – uploaded on 03 May 2026, no comments