EGU26-17577, updated on 14 Mar 2026
https://doi.org/10.5194/egusphere-egu26-17577
EGU General Assembly 2026
© Author(s) 2026. This work is distributed under
the Creative Commons Attribution 4.0 License.
Oral | Wednesday, 06 May, 11:35–11:45 (CEST)
 
Room 1.15/16
Validating Crowdsourced Building Data: A Statistical and Expert Approach
Maria Teresa Artese1, Elisa Varini1, Gianluigi Ciocca2, Antonella Peresan3, Flavio Piccoli2, Claudio Rota2, Rajesh Kumar2, and Chiara Scaini3
Maria Teresa Artese et al.
  • 1National Research Council of Italy, Institute for Applied Mathematics and Information Technologies, MILANO, Italy (teresa@mi.imati.cnr.it)
  • 2University of Milano-Bicocca
  • 3National Institute of Oceanography and Experimental Geophysics

The SMILE project explores the use of machine learning to generate updated building exposure layers by integrating remote sensing imagery, census data, and validated crowdsourced information. Crowdsourced data are collected through targeted initiatives involving trained students and citizens. To facilitate these activities, a web-based multimedia platform (https://smile.mi.imati.cnr.it) was developed to guide users through data collection, manage workflows, and store georeferenced information and images in a structured database, which currently includes survey forms on 4,100 buildings, mostly located in Northestern Italy.

A key goal of the study is to validate the collected data and assess their potential use to enhance existing building exposure datasets. Approximately the forms for about 400 buildings located in Udine, filled in by high school students via the platform, were reviewed by experts. Comparing expert-validated and student-collected data enabled identification of potential issues in survey design and allowed for a statistical assessment of data quality and reliability. The validation approach integrates rigorous statistical techniques, including summary statistics, cross-correlation analyses, and dissimilarity measures, with visualization methods to support the interpretation and communication of complex datasets.

We acknowledge the PRIN 2022 PNRR project SMILE “Statistical Machine Learning for Exposure development” (code P202247PK9, CUP B53D23029430001) within the European Union-NextGenerationEU program.

How to cite: Artese, M. T., Varini, E., Ciocca, G., Peresan, A., Piccoli, F., Rota, C., Kumar, R., and Scaini, C.: Validating Crowdsourced Building Data: A Statistical and Expert Approach, EGU General Assembly 2026, Vienna, Austria, 3–8 May 2026, EGU26-17577, https://doi.org/10.5194/egusphere-egu26-17577, 2026.