EGU24-18300, updated on 11 Mar 2024
https://doi.org/10.5194/egusphere-egu24-18300
EGU General Assembly 2024
© Author(s) 2024. This work is distributed under
the Creative Commons Attribution 4.0 License.

Geospatial Foundation Models for Efficient Retrieval of Remote Sensing Images

Thomas Brunschwiler, Benedikt Blumenstiel, Viktoria Moor, and Romeo Kienzler
Thomas Brunschwiler et al.
  • IBM Research, Rüschlikon, Switzerland (tbr@zurich.ibm.com)

This work explores the potential of content-based image retrieval to enable efficient search through vast amounts of satellite data. Images can be identified across multiple semantic concepts without needing specific annotations. We propose to use Geospatial Foundation Models (GeoFM), for remote sensing image retrieval and evaluated the models on two datasets. The GeoFM named Prithvi uses six bands and outperforms other RGB-based models by achieving a mean Average Precision of 61% on ForestNet-4 and 98% on BigEarthNet-19. The results demonstrate that the model efficiently encodes multi-spectral data and generalizes without requiring further fine-tuning. Additionally, this work evaluates three compression methods: i) binary embeddings, ii) trivial hashing, and iii) locality-sensitive hashing. Compression with binarized embeddings isthe best option for balancing retrieval speed and accuracy. It matches the latency of much shorter hash codes while maintaining the same accuracy as floating-point embeddings.

How to cite: Brunschwiler, T., Blumenstiel, B., Moor, V., and Kienzler, R.: Geospatial Foundation Models for Efficient Retrieval of Remote Sensing Images, EGU General Assembly 2024, Vienna, Austria, 14–19 Apr 2024, EGU24-18300, https://doi.org/10.5194/egusphere-egu24-18300, 2024.