Building saturated hydraulic conductivity maps with machine learning and geostatistics
- Geological Survey of Spain (IGME-CSIC), Madrid, Spain
Hydraulic conductivity (Ks) is one of the most challenging, time-consuming, and expensive soil hydraulic properties to estimate. Pedotransfer functions (PTFs) of general use for Ks estimation are often site and sample-scale specific and perform poorly when extrapolated to different regions and extents. The present work develops a stepwise methodology for topsoil Ks mapping at a catchment scale based on easy, fast, and inexpensive measurements and auxiliary data (Aguilera et al., 2022). It includes a double-scale sampling of the Ks to account for small-scale variability in the spatial geostatistical interpolation. A supervised selection of variables through correlation analysis and hierarchical clustering of variables precedes the development of site-specific PTFs with machine learning (ML) techniques. The ML model is then used to generate new Ks point data predictions to extend the spatial coverage for mapping. Finally, the consistency of the final Ks map is assessed in terms of a geomorphological base map.
The variable selection process filtered out four predictor variables from the initial pool of fourteen predictors. An artificial neural network (ANN) provided the best Ks prediction model with one hidden layer and six input variables (latitude, longitude, silt, clay, medium sand, and land use). Latitude and longitude coordinates and land use are surrogates for other physical and environmental (e.g., anthropic) factors. The relative importance of input variables in the ANN was determined as the sum of the product of raw input-hidden, hidden-output connection weights across all hidden neurons using Olden's algorithm. Longitude, percentage of clay, and percentage of medium sand presented a stronger positive relationship with Ks, while irrigated and dry land uses together with the percent of silt were the variables with a more significant negative influence on Ks. The fact that Ks was positively related to clay content is surprising, and it appears to be related to soil plowing before sampling.
The ANN was used to estimate new Ks values from a subsequent sampling of model covariates, which doubled the input information for spatial interpolation using ordinary kriging. Overall, the spatial distribution of Ks was consistent with the lithological variability and other superimposed anthropic factors, as the method adequately considers the spatial variability of Ks added by anthropization to the already high natural heterogeneity. The produced maps will help in the hydraulic planning and flood risk management in the study area where high and low Ks, respectively, are clearly outlined.
Reference:
- Aguilera, C. Guardiola-Albert, L. Moreno Merino, C. Baquedano, E. Díaz-Losada, P. Agustín Robledo Ardila, J.J. Durán Valsero, Building inexpensive topsoil saturated hydraulic conductivity maps for land planning based on machine learning and geostatistics, CATENA, Volume 208, 2022, 105788, https://doi.org/10.1016/j.catena.2021.105788.
This work is performed within the framework of the RESERVOIR project, part of the PRIMA Programme supported by the European Union. The PRIMA programme is supported under Horizon 2020 the European Union's Framework Programme for Research and Innovation. PRIMA RESERVOIR Grant Agreement number is 1924.
How to cite: Aguilera, H., Guardiola-Albert, C., Moreno Merino, L., Baquedano, C., Díaz-Losada, E., Robledo Ardila, P. A., de la Losa, A., and Durán Valsero, J. J.: Building saturated hydraulic conductivity maps with machine learning and geostatistics, EGU General Assembly 2022, Vienna, Austria, 23–27 May 2022, EGU22-4032, https://doi.org/10.5194/egusphere-egu22-4032, 2022.