EGU26-2651, updated on 13 Mar 2026
https://doi.org/10.5194/egusphere-egu26-2651
EGU General Assembly 2026
© Author(s) 2026. This work is distributed under
the Creative Commons Attribution 4.0 License.
Poster | Friday, 08 May, 08:30–10:15 (CEST), Display time Friday, 08 May, 08:30–12:30
 
Hall X5, X5.139
Using an Interpretable Machine Learning Framework for Managing Fragmented and Unbalanced Mobile Monitoring Datasets in Spatiotemporal Modeling
Jia Xu1, Bin Han1, Xiaoqian Li1, Xiaobo Li2, Hong Hou1, and Zhipeng Bai1
Jia Xu et al.
  • 1Chinese Research Academy of Environmental Sciences, State Key Laboratory of Environmental Criteria and Risk Assessment, Beijing, China
  • 2Beijing Key Laboratory of Environmental Toxicology, School of Public Health, Capital Medical University, Beijing, China

Particulate black carbon (BC) is an important air pollutant linked to adverse health effects in susceptible populations. This study developed a spatiotemporal exposure model for BC for a pregnancy cohort in Beijing, China, addressing the challenge of a temporally and spatially fragmented and unbalanced dataset obtained from a mobile monitoring campaign. Ambient particulate BC was measured using a mobile vehicle platform from 2023 through 2024 in urban residential areas. The resulting dataset, characterized by high spatial but limited temporal coverage at an hourly resolution, was processed using an interpretable tree-based machine learning model (XGBoost) with SHapley Additive exPlanations (SHAP) to generate a temporally continuous daily dataset. This dataset was subsequently integrated into a hierarchical geostatistical model incorporating multiple geographic covariates. Ten-fold cross-validation demonstrated good predictive performance, with an R²ₘₛₑ of 0.90 for the final spatiotemporal exposure model. The quartile misclassification of the BC exposure assessments, as estimated by different exposure models, was evaluated. By integrating an interpretable machine learning approach, this work provides a robust framework for managing a fragmented and unbalanced mobile monitoring dataset and supports high-resolution spatiotemporal exposure assessments for a pregnant cohort study.

How to cite: Xu, J., Han, B., Li, X., Li, X., Hou, H., and Bai, Z.: Using an Interpretable Machine Learning Framework for Managing Fragmented and Unbalanced Mobile Monitoring Datasets in Spatiotemporal Modeling, EGU General Assembly 2026, Vienna, Austria, 3–8 May 2026, EGU26-2651, https://doi.org/10.5194/egusphere-egu26-2651, 2026.