EGU26-9404, updated on 14 Mar 2026
https://doi.org/10.5194/egusphere-egu26-9404
EGU General Assembly 2026
© Author(s) 2026. This work is distributed under
the Creative Commons Attribution 4.0 License.
Poster | Tuesday, 05 May, 16:15–18:00 (CEST), Display time Tuesday, 05 May, 14:00–18:00
 
Hall X5, X5.44
Machine Learning Prediction of Long-term Variations in Cloud Condensation Nuclei in the upper boundary layer of North China
Can Cui, Yujiao Zhu, Jiangshan Mu, Yuqiang Zhang, and Likun Xue
Can Cui et al.
  • Shandong University, Environmental Research Institute, China (202520594@mail.sdu.edu.cn)

The scarcity of field observations of cloud condensation nuclei (CCN) limits effective constraints on aerosol–cloud interactions. While a small number of recent studies have explored machine learning approaches based on aerosol chemical and optical characteristics, even fewer have explicitly included particle number size distributions (PNSDs). Here, we developed an observation-driven model based on XGBoost to predict CCN number concentrations (NCCN) by incorporating PNSDs and auxiliary variables. The model exhibits robust performance on the test dataset at supersaturations (SS) of 0.2%, 0.4%, and 1.0% (R2 = 0.91–0.92; RMSE = 235–381 ppbv), demonstrating excellent capability in capturing the temporal variability of NCCN. PNSDs are identified as the most influential features for NCCN prediction using the SHapely Additive exPlanation (SHAP) approach, with the dominant size range shifting from 100–150 nm at SS ≤ 0.4% to 50–100 nm at 1.0% SS. The XGBoost model was further employed to reconstruct the long-term variations of NCCN in the upper boundary layer over North China during 2007–2025. Our results show that NCCN predominantly ranges from 866 to 2104 cm-3, with higher values in spring and winter but enhanced activation ratios in summer and autumn. Interannual variability beyond seasonal influences indicates that NCCN exhibits pronounced interannual fluctuations, largely driven by changes in highly oxidized particle sources. In contrast, overall aerosol hygroscopicity and activation ratio exhibit a gradual decline. The proposed XGBoost framework not only extends long-term NCCN records but also provides new mechanistic insights into CCN activation, thereby reducing uncertainties in the assessments of aerosol-cloud interactions.

How to cite: Cui, C., Zhu, Y., Mu, J., Zhang, Y., and Xue, L.: Machine Learning Prediction of Long-term Variations in Cloud Condensation Nuclei in the upper boundary layer of North China, EGU General Assembly 2026, Vienna, Austria, 3–8 May 2026, EGU26-9404, https://doi.org/10.5194/egusphere-egu26-9404, 2026.