- 1University of Leeds, School of Earth and Environmental Sciences, Leeds, United Kingdom of Great Britain – England, Scotland, Wales (s.s.dhomse@leeds.ac.uk)
- 2University of Leeds, National Centre of Earth Observation, Leeds, United Kingdom of Great Britain – England, Scotland, Wales
Understanding long-term trends in stratospheric species is vital for evaluating the success of the Montreal Protocol and its amendments. However, reliable trend estimation remains challenging due to the sparse spatial and temporal coverage of high-quality observations, such as those from the Atmospheric Chemistry Experiment–Fourier Transform Spectrometer (ACE-FTS).
To overcome this limitation, we present an innovative machine learning framework that fuses ACE-FTS observations with the continuous output of the TOMCAT global Chemical Transport Model (CTM). Using XGBoost regression, we constrain TOMCAT tracers against co-located ACE-FTS measurements, generating the TCOM (TOMCAT CTM and occultation-measurement-based) stratospheric profile datasets for key species: CFC-11, CFC-12, HCl, HF, HNO3, O3, CH4, N2O, and H2O.
The latest TCOM release (version 2.0) provides gap-free, global daily vertical profiles from 2000 to 2024. Validation demonstrates substantial improvements over TOMCAT, including the removal of systematic low biases in simulated CFC concentrations. Interpretable machine learning analysis reveals that XGBoost primarily acts as a “transport corrector,” with dynamical features such as Age-of-Air, temperature, and long-lived tracers exerting the greatest influence. This finding highlights that circulation biases dominate TOMCAT’s baseline errors.
TCOM datasets are publicly available and offer an observationally constrained benchmark for refining chemical models, improving stratospheric transport representation, and reducing uncertainties in ozone-depleting substance (ODS) trend analyses.
Dataset links:
- CFC-11 v2: https://doi.org/10.5281/zenodo.18145730
- CFC-12 v2: https://doi.org/10.5281/zenodo.18147392
- CH4 v2: https://doi.org/10.5281/zenodo.18197333
- N2O v2: https://doi.org/10.5281/zenodo.18197444
- HCl v2: https://doi.org/10.5281/zenodo.18184430
- HF v2: https://doi.org/10.5281/zenodo.18184779
- HNO3 v2: https://doi.org/10.5281/zenodo.18199002
- O3 v2: https://doi.org/10.5281/zenodo.18199586
- H2O v2: https://doi.org/10.5281/zenodo.18199962
- COF2 v2: https://doi.org/10.5281/zenodo.18201786
How to cite: Dhomse, S. and Chipperfield, M.: Machine Learning For Atmospheric Chemistry: Creating Global, Gap-Free Stratospheric Datasets for Montreal Protocol Assessments, EGU General Assembly 2026, Vienna, Austria, 3–8 May 2026, EGU26-4514, https://doi.org/10.5194/egusphere-egu26-4514, 2026.