EGU26-16358, updated on 14 Mar 2026
https://doi.org/10.5194/egusphere-egu26-16358
EGU General Assembly 2026
© Author(s) 2026. This work is distributed under
the Creative Commons Attribution 4.0 License.
Poster | Tuesday, 05 May, 08:30–10:15 (CEST), Display time Tuesday, 05 May, 08:30–12:30
 
Hall X3, X3.135
Dimensionality reduction of soil vis–NIR spectra: implications for soil health assessment and mapping
Sarem Norouzi1, Lis Wollesen de Jonge1, Per Moldrup2, Mogens Humlekrog Greve1, and Sebastian Gutierrez1
Sarem Norouzi et al.
  • 1Aarhus University, Department of Agroecology, Agroecology, Tjele, Denmark (sarem.nrz@gmail.com)
  • 2Department of the Built Environment, Aalborg University, Aalborg, Denmark

The measured soil spectrum in the visible and near-infrared (vis–NIR) range contains information on various soil physical properties, as well as mineralogical and chemical composition. Therefore, this approach can serve as a rapid and cost-effective alternative for assessing soil health. However, the raw soil spectrum is high-dimensional and less suitable for modeling purposes. In this study, we compared the performance of four methods, including PCA, kernel PCA (KPCA), autoencoders (AE), and convolutional autoencoders (CNN-AE), based on their reconstruction error, which directly measures information loss during dimensionality reduction. We used a comprehensive soil dataset from Denmark comprising 7,009 vis–NIR spectra covering a wide range of land uses and soil types. Our analysis shows that reconstruction error decreases as the number of latent variables (LVs) increases, as expected, due to the greater capacity to preserve information. Notably, across all latent dimensionalities, nonlinear methods consistently outperformed PCA. At low latent dimensionalities (5–10 LVs), nonlinear methods achieved approximately 45–55% lower reconstruction error than PCA. As the number of LVs increases, this performance gap decreases. However, PCA still consistently shows lower performance than the other methods. Given the strong representation of soil chemical, physical, and hydrological properties in the spectra, we mapped the latent variables across Denmark using diverse spatial predictors. The mapped latent variables captured complex, nonlinear soil–landscape relationships that reflect key mineral and organic soil components. Overall, our results suggest that spectral representations could serve as a scalable approach to support national soil mapping.

How to cite: Norouzi, S., de Jonge, L. W., Moldrup, P., Greve, M. H., and Gutierrez, S.: Dimensionality reduction of soil vis–NIR spectra: implications for soil health assessment and mapping, EGU General Assembly 2026, Vienna, Austria, 3–8 May 2026, EGU26-16358, https://doi.org/10.5194/egusphere-egu26-16358, 2026.