Multi-source data and machine learning supporting high-resolution building exposure extraction

Wenyu Nie; Xiwei Fan; Jing Wang; Lin wang; Yuanmeng Qi; Min Liu; Fucun Lu; Laurens Oostwegel; Danijel Schorlemmer

doi:https://doi.org/10.5194/egusphere-egu26-9671

[Back] [Session NH6.7]

EGU26-9671, updated on 14 Mar 2026

https://doi.org/10.5194/egusphere-egu26-9671

EGU General Assembly 2026

© Author(s) 2026. This work is distributed under
the Creative Commons Attribution 4.0 License.

Multi-source data and machine learning supporting high-resolution building exposure extraction

Wenyu Nie^1,2,3, Xiwei Fan^1,2, Jing Wang^1,2, Lin wang^1,2, Yuanmeng Qi^1,2, Min Liu^1,2, Fucun Lu^1,2, Laurens Oostwegel^3,4, and Danijel Schorlemmer^3,5

Wenyu Nie et al.

¹Key Laboratory of Seismic and Volcanic Hazards, China Earthquake Administration, 100029 Beijing, China
²Institute of Geology, China Earthquake Administration, 100029 Beijing, China
³GFZ Helmholtz Centre for Geosciences, Telegrafenberg, 14473 Potsdam, Germany
⁴ISTerre, Université Grenoble Alpes, Université Savoie Mont-Blanc, CNRS, IRD, Université Gustave Eiffel, CS40700 38058 Grenoble cedex 9, 1381 Rue de la Piscine, 38610 Gières, France
⁵Swiss Seismological Service, ETH Zurich, 8092 Zurich, Switzerland

Recent urban earthquakes and rapid urbanization have intensified the demand for fine-scale building exposure information in disaster risk assessment. However, existing approaches for high-resolution building exposure extraction often suffer from limited data completeness, insufficient semantic detail, and weak update capability, particularly at detailed spatial scales. Moreover, traditional methods relying on homogeneous data sources and static classifications struggle to represent the heterogeneity of urban building exposure.

To address these limitations, we propose a multi-source data-driven framework combined with machine learning to extract high-resolution building exposure information, focusing on building function and building height. Building function types are inferred by integrating OpenStreetMap building footprints with time-series mobile signaling data, exploiting differences in population activity patterns across day-night and workday-non-workday periods. Machine learning techniques are then applied to identify clusters of buildings with similar population dynamic characteristics, enabling the inference of building function types. Building height is extracted from bi-temporal Sentinel-2 imagery by capturing variations in image brightness induced by seasonal differences in building shadow length, and a random forest model is employed to learn the nonlinear relationship between image features and building height, thereby reducing reliance on very high-resolution imagery and manual interpretation.

Case studies in representative Chinese cities indicate that the integration of multi-source data and machine learning enables more effective use of data for different building exposure attributes, resulting in improvements in spatial detail, attribute completeness, and data timeliness. Population-dynamic-based building function identification provides an activity-oriented characterization of building use, while building height estimation based on freely available Sentinel-2 imagery offers a cost-efficient and scalable approach. Overall, these findings suggest that multi-source data integration and machine learning can support large-scale, high-resolution urban building exposure mapping.

How to cite: Nie, W., Fan, X., Wang, J., wang, L., Qi, Y., Liu, M., Lu, F., Oostwegel, L., and Schorlemmer, D.: Multi-source data and machine learning supporting high-resolution building exposure extraction, EGU General Assembly 2026, Vienna, Austria, 3–8 May 2026, EGU26-9671, https://doi.org/10.5194/egusphere-egu26-9671, 2026.

OSPP voting tool

This contribution takes part in the OSPP contest. Please log in to see the relevant judging section.

Supplementary materials

Supplementary material file

Comments on the supplementary material

AC: Author Comment | CC: Community Comment | Report abuse

supplementary materials version 1 – uploaded on 04 May 2026, no comments