EGU24-8657, updated on 08 Mar 2024
https://doi.org/10.5194/egusphere-egu24-8657
EGU General Assembly 2024
© Author(s) 2024. This work is distributed under
the Creative Commons Attribution 4.0 License.

An innovative SAM-ViT based tool for the automatic detection of litter items on sandy beaches

Angelo Sozio1, Angela Rizzo1,2, Vincenzo Mariano Scarrica3, Pietro Patrizio Ciro Aucelli3, Giorgio Anfuso4, Giovanni Barracane5, Luca Antonio Dimuccio6, Rui Ferreira6, Marco La Salandra1, Antonino Staiano3, Maria Pia Tarantino1, and Giovanni Scicchitano1,2
Angelo Sozio et al.
  • 1Department of Earth and Geoenvironmental Sciences, University of Bari Aldo Moro, Bari, 70125, Italy (angelo.sozio@uniba.it)
  • 2Interdepartmental Research Centre for Coastal Dynamics, University of Bari Aldo Moro, Bari, 70125, Italy
  • 3Department of Sciences and Technology, University of Naples Parthenope, Naples, 80133 Italy
  • 4Departamento de Ciencias de la Tierra, Facultad de Ciencias del Mar y Ambientales, Universidad de Cádiz, Puerto Real, 11510, Cádiz, Spain
  • 5Environmental Surveys s.r.l., Bari, Spin-off of University of Bari Aldo Moro, 70125, Italy
  • 6Centre of Studies in Geography and Spatial Planning (CEGOT), Department of Geography and Tourism, University of Coimbra, Coimbra, 4150-564, Portugal

Machine learning (ML) techniques in the field of Computer Vision turned out to be well-performing tools for the automatic detection of beach litter (BL) items on high resolution UAVs images. This study was carried out in the frame of the RiPARTI project (funded by Apulia Region) proposes an innovative approach based on the combination of the aero-photogrammetric surveys with a newly proposed ML tool.

A series of experiments were conducted with a Mask-Regional Convolutional Neural Network-based (Mask-RCNN) algorithm using an image dataset acquired UAVs fights performed on different coastal sites, in Italy, Portugal, and Spain. Preliminary detection experiments were conducted using three BL items categories, “Bottles”, “Worked Wood” and “Nets”. Subsequently, a comparison with algorithms available in QGIS software confirmed the great potential of Computer Vision techniques. Indeed, in previous studies (Sozio et al., 2023), the performance of the Mask-RCNN based algorithm resulted higher than performances of algorithms available in QGIS software, but still not enough to obtain a definitive ML tool for BL automatic detection.

The novel ML tool here proposed exploits the powerful dataset of Segment Anything (SAM) (Kirillov et al., 2023) developed by Meta AI, as segmentation algorithm and Visual Transformer (ViT) for the classification task. A first experiment was conducted with a dataset derived from UAVs images acquired in five different sites, i.e., Capitolo and Torre Guaceto beach (Italy), Leirosa beach (Portugal), Valdelagrana (Spain), and Cala del Cefalo beach (Italy). Aero-photogrammetric surveys were carried out at different flight heights for each site so, the final images resolution ranges from 0.3 cm/pixel to 0.7 cm/pixel. Moreover, the different color of the sand (background) represents a parameter which could affect the performance of segmentation process. Orthomosaics in .tiff format were split in 1000-pixel square tile and segmented by SAM. It executed a panoptic segmentation that produced 450 masks, both concave and convex, corresponding to objects identified on images. These masks were catalogued according to 11 labels (Bottles, Nets, Polystyrene, Worked wood, Vials, Buckets, Building waste, Ethernit, Sand, Vegetation, and Water), accounting for both the most common litter categories and natural assets. Subsequently, masks so gathered were used to train ViT, the classification algorithm and to perform the test phase, which was carried out on 450 masks, with a ratio of training, validation and test split of 7/10, 1/10 and 2/10, respectively. A preliminary experiment produced output images classified by ViT with an accuracy of 0.93 and an f1 score equal to 0.6. Data considered for this last experiment are more complex for number of classes and amount of data, so performance are better in projection” also considering the different images resolution and the background texture. Finally, identified items are georeferenced with a projected reference system. The method outstanding a very reliable performance for the BL detection task and could represent a useful and definitive approach for the assessment of the BL distribution and as well as for the identification of the main accumulation zones so as to make possible the development of tailored coastal management actions. 

How to cite: Sozio, A., Rizzo, A., Scarrica, V. M., Aucelli, P. P. C., Anfuso, G., Barracane, G., Dimuccio, L. A., Ferreira, R., La Salandra, M., Staiano, A., Tarantino, M. P., and Scicchitano, G.: An innovative SAM-ViT based tool for the automatic detection of litter items on sandy beaches, EGU General Assembly 2024, Vienna, Austria, 14–19 Apr 2024, EGU24-8657, https://doi.org/10.5194/egusphere-egu24-8657, 2024.