EGU25-4425, updated on 14 Mar 2025
https://doi.org/10.5194/egusphere-egu25-4425
EGU General Assembly 2025
© Author(s) 2025. This work is distributed under
the Creative Commons Attribution 4.0 License.
Oral | Monday, 28 Apr, 14:45–14:55 (CEST)
 
Room B
Predicting saturated hydraulic conductivity from particle size distributions using machine learning
Alraune Zech1, Valerie de Rijk1, Jelle Buma2, and Hans Veldkamp2
Alraune Zech et al.
  • 1Utrecht University, Geoscience, Earth Science, Utrecht, Netherlands (a.zech@uu.nl)
  • 2TNO, Princetonlaan 6, 3584CB Utrecht, Netherlands

Estimating saturated hydraulic conductivity Kf from particle size distributions (PSD) is very common with empirical formulas, while the use of machine learning for that purpose is not yet widely established. We evaluate the predictive power of six machine learning algorithms, including tree-based, regression-based and network-based methods in estimating Kf from the PSD solely. We use a dataset of 4600 samples from the shallow Dutch subsurface for training and testing. The extensive dataset provides not only PSD, but also measured conductivities from permeameter tests. Besides training and testing on the entire data set, we apply the six algorithms to data subsets for the soil types sand, silt and clay. We further test different feature/target-variable combinations such as reducing the input to PSD-derived characteristic grain diameters d10 , d50 and d60 or estimating porosity from PSD. We test feature importance and compare results to Kf estimates from a selection of empirical formulas. We find that all algorithm can estimate Kf from PSD at high accuracy (up to R2/NSE of 0.89 for testing data and 0.98 for the entire data set) and outperform empirical formulas. Particularly, tree-based algorithms are well suited and robust. Reducing information in the feature variables to grain diameters works well for predicting Kf of sandy samples, but is less robust for silt and clay rich samples. d10 also shows to be the most influential feature here. An interesting, but not surprising outcome is that PSD is not a suitable predictor for porosity. Overall, our results confirm that machine learning algorithms are a powerful tool for determining Kf from PSD. This is promising for applications to e.g. deep-drilling data sets or low-effort and robust Kf -estimation of single samples.

How to cite: Zech, A., de Rijk, V., Buma, J., and Veldkamp, H.: Predicting saturated hydraulic conductivity from particle size distributions using machine learning, EGU General Assembly 2025, Vienna, Austria, 27 Apr–2 May 2025, EGU25-4425, https://doi.org/10.5194/egusphere-egu25-4425, 2025.