EGU23-13160
https://doi.org/10.5194/egusphere-egu23-13160
EGU General Assembly 2023
© Author(s) 2023. This work is distributed under
the Creative Commons Attribution 4.0 License.

Automatic global building completeness assessment of OpenStreetMap using remote sensing data

Laurens Jozef Nicolaas Oostwegel1, Nicolas Garcia Ospina2,1, Tara Evaz Zadeh1, Simantini Shinde1, and Danijel Schorlemmer1
Laurens Jozef Nicolaas Oostwegel et al.
  • 1German Research Centre for Geosciences (GFZ), Section 2.6: Seismic Hazard and Risk Dynamics, Telegrafenberg, 14467 Potsdam, Germany (laurens@gfz-potsdam.de)
  • 2LiveEO, Berlin, Germany

OpenStreetMap (OSM) is the largest crowd-sourced mapping effort to date, with an infrastructure network that is considered near-complete. The mapping activities started as any crowd-sourced information platform: the community expanded OSM anywhere there was a collective interest. Initial efforts were found around universities, hometowns of mappers and areas designated by organizations like the Humanitarian OSM Team (HOT). This resulted in a map that is of non-uniform completeness, with some areas having all building footprints in, while other areas remain incomplete or even untouched. Currently, with 530 million footprints, OSM identifies between a quarter and half of the total building footprints in the world, if we estimate that there are around 1-2 billion buildings in the world.

A global view on the completeness of buildings existing in OSM did not yet exist. Unlike other efforts, that only look at a subset of OSM building data (Biljecki & Ang 2020; Orden et al., 2020; Zhou et al., 2020), we have used the Global Human Settlement Layer (GHSL) to estimate completeness of the entire dataset. The remote sensing dataset is distributed onto a grid and in each tile of the grid, the built area of GHSL is compared to the total area of OSM building footprints. The computed ratio is measured against a completeness threshold that is calibrated using areas that were manually assessed.

Using information derived from remote sensing datasets can be problematic: GHSL does not only measure building footprints: it includes any human-built structures, including infrastructure and industrial areas. Next to that, due to circumstances like imperfect input data or failing algorithms, the dataset is not of the same quality as the crowd-sourced data in OSM in areas that are complete. False positives (i.e. rocky coasts) and false negatives (i.e. buildings missing in mountainous areas) exist in automatically generated data.

Even with these limitations, a comprehensive global completeness assessment is created. The assessment should not be used as ground truth, but rather as reflection on the OSM building dataset as is and as a guideline for priorities for the future. Statistics on regional completeness can be created and the quality of GHSL could be assessed on countries that are considered to be complete, such as France or the Netherlands.

How to cite: Oostwegel, L. J. N., Garcia Ospina, N., Evaz Zadeh, T., Shinde, S., and Schorlemmer, D.: Automatic global building completeness assessment of OpenStreetMap using remote sensing data, EGU General Assembly 2023, Vienna, Austria, 24–28 Apr 2023, EGU23-13160, https://doi.org/10.5194/egusphere-egu23-13160, 2023.