EGU25-18468, updated on 15 Mar 2025
https://doi.org/10.5194/egusphere-egu25-18468
EGU General Assembly 2025
© Author(s) 2025. This work is distributed under
the Creative Commons Attribution 4.0 License.
An automated machine learning framework for multi-depth soil moisture prediction using hydro-meteorological datasets
Vidhi Singh1, Abhilash Singh2, and Kumar Gaurav1
Vidhi Singh et al.
  • 1Indian Institute of Science Education and Research Bhopal, Earth and Environmental Sciences, Bhopal, India (kgaurav@iiserb.ac.in)
  • 2University of Leeds, UK (a.singh4@leeds.ac.uk)

 Soil moisture, one of the essential climate variables, forms a fundamental bridge between hydro-meteorological processes and influence climate dynamics. It is extremely variable and is driven by numerous hydrological, agricultural and ecological factors. Soil moisture subsequently impacts soil forming processes, root zone water availability, infiltration rates, runoff, groundwater storage and vegetation-soil interaction. Despite its significant contribution in hydro-ecological interaction, its variability at subsurface is not yet explored adequately. Precise estimation of soil moisture at various depths is crucial because it affects water retention characteristics and modulates the vertical and lateral movement of water within the soil profile. This subsurface information is integral to understanding recharge rates, groundwater interactions, and the overall water balance within a catchment. In this study, we present an automated machine learning framework designed to predict soil moisture at multiple depths of 10 cm, 20 cm, 30 cm, and 40 cm leveraging Bayesian optimization. We collected data from our hydrological observatory set up constituting an automatic weather station, a pan evaporimeter and a soil moisture recorder. To evaluate model performance, we categorized the dataset into four scenarios (S1, S2, S3, and S4), with each subsequent scenario incorporating a greater number of observations and rainfall events. We used 11 input features to train this AutoML model by integrating several hydrological and meteorological variables with in-situ soil moisture data. Among the predictor variables, humidity, dew point, and rainfall emerged as the most influential factors driving soil moisture variability. The model was trained to calculate the performance metrices for the entire dataset and for subsets containing only rainfall instances. Our optimized model demonstrated superior performance, with an R² of 0.88–0.99 and RMSE < 0.022 for the overall dataset, and R² of 0.76–1.00 with RMSE < 0.06 for rainfall-specific data across all soil moisture depths.

How to cite: Singh, V., Singh, A., and Gaurav, K.: An automated machine learning framework for multi-depth soil moisture prediction using hydro-meteorological datasets, EGU General Assembly 2025, Vienna, Austria, 27 Apr–2 May 2025, EGU25-18468, https://doi.org/10.5194/egusphere-egu25-18468, 2025.