WBF2026-111, updated on 10 Mar 2026
https://doi.org/10.5194/wbf2026-111
World Biodiversity Forum 2026
© Author(s) 2026. This work is distributed under
the Creative Commons Attribution 4.0 License.
Oral | Tuesday, 16 Jun, 08:45–09:00 (CEST)| Room Aspen 1
Towards a mountain biodiversity knowledge graph
Anisia Kantiskaia1, Mark Snethlage2,3, Yousra El-Bachir1, Carl Remlinger1, Davnah Urbach2,3, and Luis Salamanca4
Anisia Kantiskaia et al.
  • 1Swiss Data Science Centre, Federal Institute of Technology Lausanne, Switzerland
  • 2Global Mountain Biodiversity Assessment, University of Bern, Institute of Plant Sciences, Switzerland
  • 3Global Mountain Biodiversity Assessment, University of Lausanne, Centre Interdisciplinaire de Recherche sur la Montagne, Switzerland
  • 4Swiss Data Science Centre, Federal Institute of Technology Zurich, Switzerland

Ongoing changes in mountain biodiversity have important consequences for the future provision of ecosystem services across scales and for human livelihoods and wellbeing worldwide, calling for effective action. However, the formulation of environmental policies and measures that address the challenges of sustainable management and conservation of mountain ecosystems relies on knowledge that is ‘trapped’ inside a vast and rapidly increasing corpus of unstructured text - the scientific literature, which to date is not accessible to machine-based approaches. Our objective is to develop MoBiKo, an open access global mountain biodiversity knowledge graph built from the entities and relations extracted from the corpus of mountain biodiversity literature. This knowledge graph will ‘liberate’ and structure available knowledge pertaining to the state of, trends in, and drivers of mountain biodiversity. By following principles of findability, accessibility, interoperability, and reusability, we enable broad usage, its expansion with new entities of interest, and its application for varied downstream tasks. Here, we present ongoing work towards achieving a first version of MoBiKo with (i) an approach to improve named-entity recognition based on a hybrid framework that combines structured resources with large language models, and (ii) a preliminary attempt towards relationship extraction using models that are pre-trained on existing datasets and fine-tuned on synthetically generated mountain biodiversity triplets. In addition, we present the domain-specific gazetteers used to address widespread issues of heterogeneous terminologies and enable targeted inference and efficient pre-filtering of relevant sentences, and we provide examples of the contribution of such gazetteers to linked open data and to the systematic mapping of mountain biodiversity literature. Our preliminary results highlight the potential of hybrid and iterative natural language processing pipelines to bridge rule-based and generative methods. By developing this structured, curated, and digitally accessible knowledge base, we aim to support scientific research and inform policy as well as conservation efforts. We further contribute to “opening up what is known about biodiversity” and thereby support the Disentis Roadmap 2024 vision to fully “leverage the power of biodiversity knowledge from research publications within an open science framework”.

How to cite: Kantiskaia, A., Snethlage, M., El-Bachir, Y., Remlinger, C., Urbach, D., and Salamanca, L.: Towards a mountain biodiversity knowledge graph, World Biodiversity Forum 2026, Davos, Switzerland, 14–19 Jun 2026, WBF2026-111, https://doi.org/10.5194/wbf2026-111, 2026.