EGU26-16578, updated on 14 Mar 2026
https://doi.org/10.5194/egusphere-egu26-16578
EGU General Assembly 2026
© Author(s) 2026. This work is distributed under
the Creative Commons Attribution 4.0 License.
Poster | Tuesday, 05 May, 08:30–10:15 (CEST), Display time Tuesday, 05 May, 08:30–12:30
 
Hall X3, X3.108
Soil salinization prediction for heterogeneous environments: an environmental similarity–based modeling framework
Xiaolin She3, Amaury Frankl2,3, and Geping Luo1,3
Xiaolin She et al.
  • 1Xinjiang Institute of Ecology and Geography, Chinese Academy of Sciences, Urumqi, China (xiaolin.she@UGent.be; luogp@ms.xjb.ac.cn)
  • 2Department of Geography, Gent, Belgium (amaury.frankl@ugent.be)
  • 3Sino-Belgian Joint Laboratory of Geo-information, Urumqi, China

Soil salinization dynamics are driven by complex interactions among climatic conditions, hydrological processes, and anthropogenic activities. Due to this complexity, traditional single global models often struggle to capture spatial heterogeneity, leading to high prediction uncertainty and limited robustness at the pixel scale.

To address these challenges, this study proposes a multi-source data-driven framework based on environmental similarity matching to enhance prediction adaptability in heterogeneous environments. We compiled a dataset of approximately 35,000 topsoil samples from arid and semi-arid regions and constructed a multidimensional covariate system grounded in soil-forming factor theory. The framework comprises three components: (1) heterogeneity-based stratification, partitioning samples by climate and land use; (2) model library construction, developing candidate machine learning ensembles within each stratum via repeated cross-validation; and (3) similarity-based prediction, which employs Gower distance to quantify environmental similarity between target locations and training samples to select the optimal model.

Evaluations indicate that the Random Forest algorithm exhibits robust stability across stratified regions. Compared to single models, the environment similarity–constrained selection strategy significantly improved performance in heterogeneous regions; notably, the coefficient of determination (R2) in arid cropland areas increased from 0.748 to 0.807. Feature contribution analysis supports the necessity of stratified modeling, revealing that soil salinity in arid regions is primarily driven by vegetation variables and geographic, whereas remote sensing indices and soil pH dominate in semi-humid regions. The methodological framework developed in this study provides a new approach for high-precision soil salinity mapping.

KEYWORDS: Soil salinization; Environmental similarity; Heterogeneous environments; Machine learning.

How to cite: She, X., Frankl, A., and Luo, G.: Soil salinization prediction for heterogeneous environments: an environmental similarity–based modeling framework, EGU General Assembly 2026, Vienna, Austria, 3–8 May 2026, EGU26-16578, https://doi.org/10.5194/egusphere-egu26-16578, 2026.