EGU22-12914
https://doi.org/10.5194/egusphere-egu22-12914
EGU General Assembly 2022
© Author(s) 2022. This work is distributed under
the Creative Commons Attribution 4.0 License.

Predicting high-frequency nutrient dynamics in the Danube River from surrogates with sensors and machine-learning

Jingshui Huang, Binh Yen Tran, and Leonardo F. Arias-Rodriguez
Jingshui Huang et al.
  • Chair of Hydrology and River Basin Management, Technical University of Munich, Arcisstrasse 21, 80333 Munich, Germany (jingshui.huang@tum.de)

Nutrient dynamics play an essential role in aquatic ecosystems. Despite advances in sensor technology, nutrient concentrations are difficult and expensive to monitor in-situ and in real-time. Emerging data-driven methods may provide surrogate measures for nutrient concentrations. In this work, 4-year high-frequency (15-min interval) regularly monitored variables and 2 data-driven algorithms are used to build surrogate measures for nitrate, orthophosphate, and ammonium at 2 stations in the German part of the Danube River. The variables used as input futures are dissolved oxygen (DO), temperature (Temp), conductivity (EC), pH, discharge rate (Q), and chlorophyll-a (Chl-a). Multiple linear regression (MLR) and Random Forest Regression (RF) are trained and cross-validated for the concentration predictions of nutrient constituents. Prior to training, pre-processing procedures were implemented, including removing outliers and filling missing values by linear interpolation. This work presented a thorough description of the workflow, including intermediate steps for feature engineering, feature selection, hyper-parameter optimization. The results of the 12 surrogate models (2 algorithms * 3 constituents * 2 stations) are compared. The results show that the RF algorithm can reproduce the environmental phenomena and contribute to water quality management. The RF algorithm already outperformed MLR when adding at least three predictors in this work. The five-fold CV has identified the reliable and stable prediction of the targets NO3--N (R2 = 0.9967 and 0.9992), NH4+-N (R2 = 0.9861 and 0.9927), PO43--P (R2 = 0.9638 and 0.9643).

How to cite: Huang, J., Tran, B. Y., and Arias-Rodriguez, L. F.: Predicting high-frequency nutrient dynamics in the Danube River from surrogates with sensors and machine-learning, EGU General Assembly 2022, Vienna, Austria, 23–27 May 2022, EGU22-12914, https://doi.org/10.5194/egusphere-egu22-12914, 2022.