EGU26-16339, updated on 14 Mar 2026
https://doi.org/10.5194/egusphere-egu26-16339
EGU General Assembly 2026
© Author(s) 2026. This work is distributed under
the Creative Commons Attribution 4.0 License.
Oral | Friday, 08 May, 14:50–15:00 (CEST)
 
Room D1
Development of a Soil pH Prediction Model for South Korean Agricultural Fields Using Digital Soil Mapping
Eunjin Lee, Myung-Sook Kim, Ha-il Jung, and Tae-Goo Lee
Eunjin Lee et al.
  • National Institute of Agricultural Sciences, Division of Soil and Water Environment, Korea, Republic of (eunjin0219@korea.kr)

Agricultural soils in South Korea are prone to acidification due to the leaching of exchangeable bases caused by intensive rainfall. To mitigate this issue, the government subsidizes lime fertilizers based on soil test data. However, for untested fields lacking analytical data, a standardized application rate is uniformly applied, which limits the precision of soil management. This study aims to develop an optimal prediction model and establish environmental and geographical covariates for the precise estimation of soil pH in untested fields using Machine Learning (ML)  techniques. A dataset comprising 203,941 soil pH measurements collected between 2022 and 2024 was utilized for model training. Based on the SCORPAN framework, we constructed environmental and geographical covariates across a grid covering South Korea, incorporating Soil (s), Climate (c), Organisms (o), Relief (r), Parent material (p), and Spatial position (n) factors. The Random Forest (RF) algorithm was employed as the primary prediction model. Furthermore, to account for spatial autocorrelation, a hybrid model combining RF with Ordinary Kriging (OK) of the prediction residuals (RF+OK) was developed, and its performance was compared with the standalone RF model. Variable importance analysis indicated that geographical variables (distance information) had the most significant influence on pH prediction, followed by organism variables (NDVI and crop cultivation density) and climatic variables. In the model performance evaluation, the standalone RF model achieved an R² of 0.509 and an RMSE of 0.550. However, the hybrid RF+OK model demonstrated significantly improved prediction precision, achieving an R² of 0.590 and an RMSE of 0.492. The big data-driven pH prediction model established in this study is expected to eliminate policy blind spots in areas lacking soil test information and enable the precise calculation of lime requirements for sustainable soil management.

How to cite: Lee, E., Kim, M.-S., Jung, H., and Lee, T.-G.: Development of a Soil pH Prediction Model for South Korean Agricultural Fields Using Digital Soil Mapping, EGU General Assembly 2026, Vienna, Austria, 3–8 May 2026, EGU26-16339, https://doi.org/10.5194/egusphere-egu26-16339, 2026.