Large-scale integration of data for distribution models (and beyond)

Robert O'Hara; Philip Mostert; Sam Perrin; Kwaku Adjei; Ron Togunov; Anders Finstad

doi:https://doi.org/10.5194/wbf2026-779

[Back] [Session IND11]

WBF2026-779, updated on 10 Mar 2026

https://doi.org/10.5194/wbf2026-779

World Biodiversity Forum 2026

© Author(s) 2026. This work is distributed under
the Creative Commons Attribution 4.0 License.

Oral | Wednesday, 17 Jun, 10:30–10:45 (CEST)| Room Sanada 1

Large-scale integration of data for distribution models (and beyond)

Robert O'Hara¹, Philip Mostert¹, Sam Perrin¹, Kwaku Adjei², Ron Togunov¹, and Anders Finstad¹

Robert O'Hara et al.

¹NTNU, Dept. of Mathematical Sciences, (bob.ohara@ntnu.no)
²NINA, Trondheim, Norway

One of the most important knowledge gaps for nature management is the species diversity in an area. In order to know the spatial distribution of richness of different species groups, observations of species are needed. This is challenging, because for many species groups it is resource-intensive to systematically map the occurrence of species over large areas. Instead, citizen science data has become more important, but as it is mostly collected opportunistically , there is a high geographical bias in the collection. If this is not taken into account, one gets the impression that the most species-rich areas are where there are the most people. This bias can be estimated and corrected if citizen science data is correctly combined with survey data, e.g. from national monitoring programs.

Integrated species distribution models (iSDM) are relatively new statistical tools that combine opportunistically collected data with systematically collected data with information about collection intensity. These models have great potential to benefit from the strengths of different types of data, while at the same time being able to take into account some of the challenges, but the models have not been used to a large extent in large-scale modeling of species diversity. Large-scale data presents both opportunities and problems. It allows us to leverage information across species, but the size of the data makes it infeasible to model every species together.

Here we will present our approach to this large-scale modelling, using iSDMs across species to produce fine-scale maps of communities over Norway, along with their associated uncertainties and biases. The model is based on a state space, using ideas from point processes, which has proved flexible for different data types and for which there are efficient and flexible fitting methods. We will outline the model and the methods we have used to overcome the problems of large data and the requirement for fine spatial scales. We will also outline how this is being extended to include time and topologies such as river networks.

How to cite: O'Hara, R., Mostert, P., Perrin, S., Adjei, K., Togunov, R., and Finstad, A.: Large-scale integration of data for distribution models (and beyond), World Biodiversity Forum 2026, Davos, Switzerland, 14–19 Jun 2026, WBF2026-779, https://doi.org/10.5194/wbf2026-779, 2026.