EGU23-16985
https://doi.org/10.5194/egusphere-egu23-16985
EGU General Assembly 2023
© Author(s) 2023. This work is distributed under
the Creative Commons Attribution 4.0 License.

Predicting bacterial transport through saturated porous media using an automated machine learning model 

Fengxian Chen1, Bin Zhou2, Liqiong Yang1, Xijuan Chen1, and Jie Zhuang3
Fengxian Chen et al.
  • 1Key Laboratory of Pollution Ecology and Environmental Engineering, Institute of Applied Ecology, Chinese Academy of Sciences, Shenyang, Liaoning 110016, China
  • 2Chair of model-based environmental exposure science, Faculty of Medicine, University of Augsburg, Augsburg 86159, Germany
  • 3Department of Biosystems Engineering and Soil Science, Center for Environmental Biotechnology, The University of Tennessee, Knoxville, Tennessee 37996, United States

Escherichia coli, as an indicator of fecal contamination, can move from manure-amended soil to groundwater under rainfall or irrigation events. Predicting its vertical transport in the subsurface is essential for the development of engineering solutions to reduce the risk of microbiological contamination. In this study, we collected 302 datasets from 39 published papers addressing E. coli transport through saturated porous media and trained an automated machine learning model (H2O AutoML) to predict bacterial transport. Bacterial concentration, porous medium type, particle size, ionic strength, pore water velocity, and column length were used as input variables while the first-order attachment coefficient and spatial removal rate were set as target variables. The six input variables have low correlations with the target variables, namely, they cannot predict target variables independently. However, with the automated machine learning model, input variables can effectively predict the target variables. Among 20 candidate models, Gradient Boosting Machine showed the best performance. Among the six input variables, pore water velocity, ionic strength, particle size, and column length were more important than bacterial concentration and porous medium type. This method of using historical literature data to train automated machine learning models provides a new avenue for predicting the transport of other contaminants in the environment.

How to cite: Chen, F., Zhou, B., Yang, L., Chen, X., and Zhuang, J.: Predicting bacterial transport through saturated porous media using an automated machine learning model , EGU General Assembly 2023, Vienna, Austria, 24–28 Apr 2023, EGU23-16985, https://doi.org/10.5194/egusphere-egu23-16985, 2023.