EGU24-16460, updated on 09 Mar 2024
https://doi.org/10.5194/egusphere-egu24-16460
EGU General Assembly 2024
© Author(s) 2024. This work is distributed under
the Creative Commons Attribution 4.0 License.

Self-supervised learning for mapping rice areas reduces the need for field surveys

Mukund Narayanan, Ankit Sharma, and Idhayachandhiran Ilampooranan
Mukund Narayanan et al.
  • Indian Institute of Technology Roorkee, Indian Institute of Technology Roorkee, Water Resources Development and Management, India (mukund_n@wr.iitr.ac.in)

Traditional approaches to mapping rice areas depend on detailed field surveys to gather label data. This data gathering, while thorough, is often time-consuming, and costly. Further, for long-term rice area mapping, especially at large scales, ground validation can only be made in recent years, which may lead to uncertainties in rice areas of the past. To solve this limitation, this study introduces a novel method for classifying rice cultivation areas without the need for field data by employing a self-supervised learning framework within Google Earth Engine. Using MODIS data to calculate indices such as the Normalized Difference Vegetation Index (NDVI), the Enhance Vegetation Index (EVI), and the Land and Surface Water Index (LSWI), a pseudo-label boundary was delineated. The delineation of the boundary involved marking flooded regions, where LSWI + 0.05 ≥ NDVI EVI. Within these flooded regions, instances when EVI in the first 40 days was at least half of the peak EVI during the growing season were assigned as rice. In contrast, regions that did not meet these criteria were considered for non-rice. As proof of concept, the delineated pseudo-label boundaries of rice and non-rice were randomly sampled and trained on several machine learning models like random forests, support vector machines, gradient-boosted trees, and decision trees, to classify rice areas in Punjab, India, from 2003 to 2022. The random forest model demonstrated superior performance, achieving an Area Under the Curve of receiver-operating characteristics (AUC) of 0.71, compared to other models (AUC of ~0.55). Furthermore, comparing the self-supervised models against the same machine learning models, which were traditionally trained on field survey data (228 ground points: 164 rice, 64 non-rice was collected), the self-supervised models showed ~10% higher performance than their traditionally supervised counterparts. Therefore, this study demonstrates that using this self-supervised modeling framework reduces the need for field-based annotations, while still providing reasonably accurate rice area maps.

How to cite: Narayanan, M., Sharma, A., and Ilampooranan, I.: Self-supervised learning for mapping rice areas reduces the need for field surveys, EGU General Assembly 2024, Vienna, Austria, 14–19 Apr 2024, EGU24-16460, https://doi.org/10.5194/egusphere-egu24-16460, 2024.