EGU26-14583, updated on 14 Mar 2026
https://doi.org/10.5194/egusphere-egu26-14583
EGU General Assembly 2026
© Author(s) 2026. This work is distributed under
the Creative Commons Attribution 4.0 License.
Oral | Friday, 08 May, 09:45–09:55 (CEST)
 
Room D1
Field-scale digital soil mapping in the era of tabular foundation models
Viacheslav Barkov1,2, Jonas Schmidinger1,2, Robin Gebbers2, and Martin Atzmueller1,3
Viacheslav Barkov et al.
  • 1Joint Lab Artificial Intelligence and Data Science, Osnabrück University, Osnabrück, Germany
  • 2Department of Agromechatronics, Leibniz Institute for Agricultural Engineering and Bioeconomy (ATB), Potsdam, Germany
  • 3Research Department Cooperative and Autonomous Systems (CAS), German Research Center for Artificial Intelligence (DFKI), Osnabrück, Germany

Digital soil mapping at the field-scale faces a fundamental challenge of building accurate predictive models from small, high-dimensional tabular datasets where training sample sizes are limited by cost and labor constraints. Traditional machine learning methods like Random Forest have long dominated pedometrics, but recent advances in artificial neural network architectures challenge this status. To investigate this, we develop a comprehensive evaluation framework built upon LimeSoDa, our diverse and fully open-access collection of field-scale digital soil mapping datasets. This allows us to assess the application of modern neural networks in pedometrics under realistic conditions of data scarcity. Our results demonstrate that contemporary architectures consistently outperform classical methods when coupled with specific methodological enhancements that address training instability. In-context learning tabular foundation models, such as TabPFN, show particular promise and surpass established baselines even on very small datasets. We go further and investigate the application of tabular foundation models on datasets with unfavorable feature-to-sample ratios typical in soil spectroscopy. Building upon principal component analysis and partial least squares, we propose hybrid strategies that effectively address the challenges posed by soil spectroscopy datasets. Going beyond purely tabular regression modeling, we extend our framework to incorporate spatial information through Kriging prior Regression, integrating geostatistical features into tabular machine learning predictions and further improving accuracy when sensor data alone provide limited information. Our findings establish a new baseline for field-scale digital soil mapping and offer methodological insights applicable to any precision agriculture domain constrained by small tabular datasets.

How to cite: Barkov, V., Schmidinger, J., Gebbers, R., and Atzmueller, M.: Field-scale digital soil mapping in the era of tabular foundation models, EGU General Assembly 2026, Vienna, Austria, 3–8 May 2026, EGU26-14583, https://doi.org/10.5194/egusphere-egu26-14583, 2026.