EGU26-5814, updated on 13 Mar 2026
https://doi.org/10.5194/egusphere-egu26-5814
EGU General Assembly 2026
© Author(s) 2026. This work is distributed under
the Creative Commons Attribution 4.0 License.
Oral | Tuesday, 05 May, 15:15–15:25 (CEST)
 
Room 3.29/30
Machine Learning and Citizen Science for Catchment-Scale Water-Quality Assessment
Xinyu Liu1, Heri Chisute2, Fred Nyongesa3, Rochi Mkole4, Stuart Warner5, Gerogio Emmanuel Nader1, Amedeo Boldrini1, Alessio Polvani1, Riccardo Gaetano Cirrone1,6, Luisa Galgani1,7, and Steven Arthur Loiselle1,7,8
Xinyu Liu et al.
  • 1university of siena, Department of Chemistry and Pharmaceutical science, siena, Italy (x.liu@student.unisi.it)
  • 2Ministry of Water, Dodoma, Tanzania
  • 3Water Resoruces Agency, Nairobi Kenya
  • 4Ministry of Water, Musoma, Tanzania
  • 5Global Environment Monitoring Unit, United Nations Environment Programme (UNEP), Nairobi, Kenya
  • 6Department of Earth and Marine Sciences, University of Palermo, Via Archirafi, 22 90123 Palermo -Italy
  • 7Center for Colloids and Surface Science (CSGI)-Siena Research Group, University of Florence, Via della Lastruccia 3, 50019 Firenze, Italy
  • 8Earthwatch Europe, 102-104 St Aldate's, Oxford, OX1 1BT, United Kingdom

Assessing and managing water quality in data-scarce tropical basins remains challenging due to rapid land-use change, intensifying human pressures, and increasing climate variability. In the transboundary Mara River Basin (MRB), these factors strongly influence sediment and nutrient dynamics, yet traditional monitoring networks lack the temporal and spatial resolution needed to characterize pollution sources, transport pathways, and event-driven responses. To address these gaps, this study integrates citizen-science observations with satellite-derived hydro-climatic and land-use variables to model turbidity (NTU), nitrate (NO₃), and phosphate (PO₄) across 40 sub-basins. Two machine-learning approaches: Random Forests (RF) and Artificial Neural Networks (ANN) were employed to evaluate water-quality variability and identify dominant drivers under heterogeneous environmental conditions. RF outperformed ANN across all indicators, providing more robust predictions under noisy and nonlinear data constraints. SHAP analyses revealed that precipitation and river flow velocity dominate short-term, event-based fluctuations of turbidity, while population density represents persistent drivers of NO₃ concentration. These findings highlight the basin’s sensitivity to climate-driven changes in rainfall intensity and seasonality and demonstrate how hybrid monitoring–modelling frameworks can enhance the identification of nutrient hotspots, improve source attribution, and support adaptive water-quality management under land-use and climate-change scenarios.

How to cite: Liu, X., Chisute, H., Nyongesa, F., Mkole, R., Warner, S., Nader, G. E., Boldrini, A., Polvani, A., Cirrone, R. G., Galgani, L., and Loiselle, S. A.: Machine Learning and Citizen Science for Catchment-Scale Water-Quality Assessment, EGU General Assembly 2026, Vienna, Austria, 3–8 May 2026, EGU26-5814, https://doi.org/10.5194/egusphere-egu26-5814, 2026.