EGU26-21949, updated on 14 Mar 2026
https://doi.org/10.5194/egusphere-egu26-21949
EGU General Assembly 2026
© Author(s) 2026. This work is distributed under
the Creative Commons Attribution 4.0 License.
Oral | Monday, 04 May, 14:45–14:55 (CEST)
 
Room 2.44
Bridging prediction and causal attribution in large-scale river water-quality networks
Ke Yu1,2, Ziyue Li2, and Shen Qu1
Ke Yu et al.
  • 1Center for Energy and Environmental Policy Research, Beijing Institute of Technology, Beijing, China
  • 2Heilbronn Data Science Center, Technical University of Munich, Heilbronn, Germany

Water-quality forecasting and attribution at large spatial scales remain challenging because observations are sparse and heterogeneous, monitoring networks are nonstationary, and river systems impose strong directional and time-delayed connectivity. These constraints are further complicated by abrupt pollution shocks that cannot be explained by routine upstream–downstream propagation, yet are critical for risk-aware water management. Consequently, existing approaches often face a trade-off between predictive accuracy and interpretability: data-driven models capture complex spatiotemporal patterns but provide limited insight into underlying drivers, while causal analyses offer mechanistic understanding but are difficult to operationalize for real-time, multi-step forecasting across large monitoring networks.

Here we develop a physically constrained, topology-aware causal forecasting framework that unifies large-scale water-quality prediction with driver attribution in river networks. The framework explicitly represents three defining characteristics of fluvial systems: unidirectional upstream-to-downstream transport, travel-time-dependent propagation delays, and dynamic monitoring configurations in which stations appear or disappear over time. By embedding physical flow constraints into a data-driven causal representation, the framework jointly learns evolving spatiotemporal dependencies while remaining robust to extreme data sparsity and uneven sampling typical of water-quality observations.

We apply the framework to China’s national surface-water monitoring network, comprising more than 1,900 stations and multiple water-quality indicators, together with hydro-meteorological covariates. The framework achieves strong multi-step predictive skill across the full network under realistic data gaps, while providing an interpretable decomposition of dynamics into local persistence, upstream propagation, and externally driven disturbances. This decomposition enables real-time identification of dominant drivers of water-quality change at individual stations, distinguishes systemic trends from short-lived pollution shocks, and localizes influential upstream contributors consistent with river-network connectivity. Beyond forecasting, the framework supports event-oriented causal diagnostics and prioritization of high-information monitoring locations, helping to optimize sampling strategies and enhance early-warning capability under limited monitoring resources.

Our results demonstrate that physically constrained causal forecasting can bridge the long-standing divide between prediction and explanation in water-quality modelling at scale. Crucially, the framework remains operational under nonstationary station availability, enabling consistent forecasting and attribution as monitoring configurations evolve. By integrating topology-aware learning with interpretable attribution, the proposed framework establishes a coherent pathway from forecasting to diagnosis and source localization, supporting proactive, data-informed water-quality management and rapid response to pollution events in complex river networks.

How to cite: Yu, K., Li, Z., and Qu, S.: Bridging prediction and causal attribution in large-scale river water-quality networks, EGU General Assembly 2026, Vienna, Austria, 3–8 May 2026, EGU26-21949, https://doi.org/10.5194/egusphere-egu26-21949, 2026.