EGU25-11003, updated on 15 Mar 2025
https://doi.org/10.5194/egusphere-egu25-11003
EGU General Assembly 2025
© Author(s) 2025. This work is distributed under
the Creative Commons Attribution 4.0 License.
Poster | Monday, 28 Apr, 16:15–18:00 (CEST), Display time Monday, 28 Apr, 14:00–18:00
 
Hall X3, X3.36
Recognizing any Land surface anomaly with multi-modal foundation model
Jingtao Li
Jingtao Li
  • wuhan, China (jingtaoli@whu.edu.cn)

Various Land surface anomalies have destroyed the stable and balanced state of human living, resulting in fatalities and serious destruction of property. Remote sensing technique has been proven useful in many studies with time-series and large-scale observation advantages. However, existing studies are limited in anomaly recognition of certain categories, lacking the important generalization ability to recognize rare or unseen anomalies. To tackle this problem, we have built a multi-modal land surface anomaly recognition foundation model, which connects the images and anomaly caption words in an open-world setting. A global scale multi-modal dataset is constructed to train the model, which refers to 1000 large-scale monitoring regions covering over 2000 km2 in total, with rich text caption collected from offical news report. After the self-supervised contrastive learning with image and text modalities, the foundation model can describe both the anomaly category and attributes directly given any monitoring image, without the need for further tuning. These open-world and tuning-free settings promote the ability of rapid anomaly monitoring.

How to cite: Li, J.: Recognizing any Land surface anomaly with multi-modal foundation model, EGU General Assembly 2025, Vienna, Austria, 27 Apr–2 May 2025, EGU25-11003, https://doi.org/10.5194/egusphere-egu25-11003, 2025.