EGU25-16261, updated on 15 Mar 2025
https://doi.org/10.5194/egusphere-egu25-16261
EGU General Assembly 2025
© Author(s) 2025. This work is distributed under
the Creative Commons Attribution 4.0 License.
Machine Learning for identification and classification of Foraminifera: testing on monothalamids
Alessandra Negri1, Anna Sabbatini1, Francesca Caridi1, and Domenico Potena2
Alessandra Negri et al.
  • 1Università Politecnica delle Marche, Dipartimento di Scienze della Vita e dell'Ambiente - DISVA, Ancona, Italy
  • 2Università Politecnica delle Marche, Dipartimento di Ingegneria dell'Informazione - DII, Ancona, Italy

Here we propose an AI-based approach using Machine Learning (ML) to assist species identification and reduce morphotype redundancy in the study of monothalamous foraminifera. In fact, this group of protists, is often overlooked in taxonomic studies due to their morphological simplicity and diversity. These single-celled organisms with "soft" tests are poorly studied, with only a few species identified, while many morphotypes remain undescribed. Taxonomic research on monothalamids is limited by challenges in identification, lack of fossilization, and the time-intensive nature of the work. This gap may lead to underestimating biodiversity and hinder detecting ecosystem degradation. Despite these challenges, monothalamids play key roles in marine ecosystems, making their diversity crucial for conservation and resource management. With this in mind, we analyzed images from the scientific literature, extracting key morphological traits, such as chamber shape, shell type, composition, and aperture type, through objective human annotation to build a dataset processed by ML algorithms. Clustering techniques, such as K-Means, revealed that basic shape, followed by shell type and composition, were the primary features distinguishing clusters. This approach enabled more objective morphotype classification, improving consistency and reducing human bias. These findings align with recent taxonomic revisions and demonstrate that applying unsupervised ML methods enhances species identification accuracy and streamlines the analysis of high-dimensional datasets.

How to cite: Negri, A., Sabbatini, A., Caridi, F., and Potena, D.: Machine Learning for identification and classification of Foraminifera: testing on monothalamids, EGU General Assembly 2025, Vienna, Austria, 27 Apr–2 May 2025, EGU25-16261, https://doi.org/10.5194/egusphere-egu25-16261, 2025.