- 1Indian Institute of Technology, Kharagpur, Agricultural and Food Engineering, Kharagpur, India (singhrachna239662@kgpian.iitkgp.ac.in)
- 2School of Earth, Environment and Sustainability, Statesboro, GA, USA
Soil Organic Carbon (SOC) is a critical indicator of soil health, yet conventional laboratory-based estimation methods remain costly, time-consuming, and environmentally burdensome. This study evaluates a rapid, low-cost, and environmentally friendly alternative for SOC estimation using high-resolution color information acquired from a Nix Spectro 2 handheld sensor, integrated with machine learning and generative data augmentation approaches. A total of 641 soil samples collected across diverse agro-ecological regions of West Bengal, India, were analyzed using Random Forest, Gradient Boosting, XGBoost, and Artificial Neural Network models. To address data imbalance and limited sample representation at higher SOC ranges, synthetic datasets were generated using Gaussian Mixture Models (GMM), Generative Adversarial Networks (GAN), k-nearest neighbors–based augmentation, and bootstrapping techniques. Among the evaluated models, Random Forest achieved the best baseline performance (R² = 0.71), which further improved with GMM-based data augmentation (R² = 0.77). The results demonstrate the strong potential of combining handheld color sensing with generative artificial intelligence to develop more accurate, robust, and scalable SOC prediction frameworks.
How to cite: Singh, R., Chakraborty, S., and Weindorf, D. C.: From Soil Color to Carbon: A Generative AI and Nix Sensor Approach, EGU General Assembly 2026, Vienna, Austria, 3–8 May 2026, EGU26-2786, https://doi.org/10.5194/egusphere-egu26-2786, 2026.