EGU25-7220, updated on 14 Mar 2025
https://doi.org/10.5194/egusphere-egu25-7220
EGU General Assembly 2025
© Author(s) 2025. This work is distributed under
the Creative Commons Attribution 4.0 License.
Oral | Monday, 28 Apr, 10:00–10:10 (CEST)
 
Room -2.92
Ground-level land surface classification and thermal analysis using foundation models
Konlavach Mengsuwan1,2 and Masahiro Ryo1,2
Konlavach Mengsuwan and Masahiro Ryo
  • 1Research Area Simulation & Data Science, Leibniz Centre for Agricultural Landscape Research (ZALF), Müncheberg, Germany (konlavach.mengsuwan@zalf.de)
  • 2Environment and Natural Sciences, Brandenburg University of Technology Cottbus‐Senftenberg, Cottbus, Germany (konlavach.mengsuwan@zalf.de)

Foundation models have shown substantial potential in enhancing Earth observation by providing high accuracy while minimizing the need for manual annotation. However, the combined application of multiple foundation models for processing ground-truthing data remains largely underexplored. This study introduces a new approach for ground-level land use classification and ground-truthing by integrating a vision foundation model, the Segment Anything Model (SAM), with a general-purpose large language model (GPT-4o). Using high-resolution thermal and RGB imagery captured from human-eye height with a handheld camera, the proposed method generates object-level land use classifications and surface temperature profiles. Data collection was conducted in the Lusatia region of Germany, covering diverse land use types. SAM was utilized to segment complex landscape structures into meaningful elements such as roads, water bodies, and trees, followed by GPT-4o, which classified these segments into custom-defined land use categories. At a broad level (7 classification types), the workflow achieved approximately 80% accuracy, with high F1 scores for categories such as Road (0.89), Vegetation (0.82), and Built Structure (0.81). At a finer level (28 classification types), the method attained around 64% accuracy, effectively classifying detailed sub-classes such as Asphalt-Concrete Road (F1 = 0.85), Brick Road (F1 = 0.86), Tree (F1 = 0.74), and Arable Land (F1 = 0.68). By overlaying thermal imagery with classified segments, the method revealed distinct microclimatic patterns across land use types, with agricultural land showing the lowest surface temperatures (p < 0.001). The proposed workflow underscores the potential of combining SAM and GPT-4o to deliver robust ground-truthing data using portable cameras, advancing AI-enabled environmental monitoring.

How to cite: Mengsuwan, K. and Ryo, M.: Ground-level land surface classification and thermal analysis using foundation models, EGU General Assembly 2025, Vienna, Austria, 27 Apr–2 May 2025, EGU25-7220, https://doi.org/10.5194/egusphere-egu25-7220, 2025.