Sensitivity of machine-learning crop-type mapping to feature selection and hyper-parameter tuning.

Mayra Perez; Frédéric Satgé; Jorge Molina; Renaud Hostache; Ramiro Pillco; Elvis Uscamayta; Diego Tola; Lautaro Bustillos; Celine Duwig

doi:https://doi.org/10.5194/egusphere-egu26-928

[Back] [Session HS3.6]

EGU26-928, updated on 13 Mar 2026

https://doi.org/10.5194/egusphere-egu26-928

EGU General Assembly 2026

© Author(s) 2026. This work is distributed under
the Creative Commons Attribution 4.0 License.

Sensitivity of machine-learning crop-type mapping to feature selection and hyper-parameter tuning.

Mayra Perez^1,3, Frédéric Satgé², Jorge Molina³, Renaud Hostache², Ramiro Pillco³, Elvis Uscamayta⁵, Diego Tola⁴, Lautaro Bustillos³, and Celine Duwig¹

Mayra Perez et al.

¹IRD, IHH, ESPACE-DEV, Univ Montpellier, UGA, Univ Grenoble Alpes, IHH, Univ Mayor de San Andres, La Paz, Bolivia (mayraperez5197@gmail.com)
²ESPACE-DEV, University Montpellier, IRD, University Antilles, University Guyane, University Réu-nion, 34093 Montpellier, France
³IHH Instituto de Hidráulica e Hidrología, UMSA Universidad Mayor de San Andrés, La Paz, Bolivia
⁴Programa de Doctorado en Recursos Hídricos (PDRH), Universidad Nacional Agraria La Molina, Lima 15024, Peru
⁵Graduate Program in Geology and in Applied Geosciences and Geodynamics, Geoscience Institute, University of Brasilia, Asa Norte, Brasilia 70910-900, DF, Brazil

To improve crop yields and economic incomes, farmers consistently adapt their practices to climate and market fluctuations, resulting in highly variable crop field distribution and coverage in space and time. As these dynamics ilustrate, up-to-date crop-type mapping is essential to understand farmers’ needs and supporting them in adopting sustainable practices. With global coverage and frequent temporal observations, remote sensing data are generally integrated in machine learning models to monitor crop-type mapping dynamics. Unlike physical-based models that rely on straightforward use, the implementation of machine-learning approaches depends on deep interaction with users. In this context, the study assesses the output sensitivity of these models to features selection and hyper-parameter calibration, both of wich rely on user consideration. To do so, Sentinel-1 (S1) and Sentinel-2 (S2) features are integrated into five distinct models (RF, SVM, LGB, HGB, XGB), considering different features selection (VIF and SFS) and hyper-parameter calibration set-up. Results show that pre-process modeling VIF feature selection discards features that wrapped SFS feature selection keeps, resulting in less reliable crop-type mapping compared to using SFS. Additionally, hyper-parameter calibration appears to be sensitive to the input feature and its consideration after any the feature selection improved the crop-type mapping. In this context a three-step nested modelling set-up including a first hyper-parameters calibration followed by a wrapped feature selection (SFS) and another hyper-parameter calibration, lead to the most reliable model outputs. Across the considered region, LGB and XGB (SVM) are the most (less) suitable model for crop-type mapping and models reliability improved when integrated S1 and S2 features rather than the consideration of S1 or S2 alone. Finally, crop-type maps are derived across different regions and periods to highlight the benefits of the proposed method to monitor crops’ dynamics in space and time.

How to cite: Perez, M., Satgé, F., Molina, J., Hostache, R., Pillco, R., Uscamayta, E., Tola, D., Bustillos, L., and Duwig, C.: Sensitivity of machine-learning crop-type mapping to feature selection and hyper-parameter tuning., EGU General Assembly 2026, Vienna, Austria, 3–8 May 2026, EGU26-928, https://doi.org/10.5194/egusphere-egu26-928, 2026.