EGU General Assembly 2021
© Author(s) 2021. This work is distributed under
the Creative Commons Attribution 4.0 License.

Machine-Learning-Based Satellite-Corrected Global Stratospheric Ozone Profile Dataset from a Chemical Transport Model

Sandip Dhomse and Martyn Chipperfield
Sandip Dhomse and Martyn Chipperfield
  • School of Earth & Environment, Leeds, United Kingdom of Great Britain – England, Scotland, Wales (

High quality global ozone profile datasets are necessary to monitor changes in stratospheric ozone. Hence, various methodologies have been used to merge and homogenise different satellite datasets in order to create long-term observation-based ozone profile datasets with minimal data gaps. However, individual satellite instruments use different measurement methods and retrieval algorithms that complicate the merging of these different datasets. Furthermore, although atmospheric chemical models are able to simulate chemically consistent long-term datasets, they are prone to the deficiencies associated with the computationally expensive processes that are generally represented by simplified parameterisations or uncertain parameters.

Here, we use chemically consistent output from a 3-D Chemical Transport Model (CTM, TOMCAT) and an ensemble of three machine learning (ML) algorithms (Adaboost, GradBoost, Random Forest), to create a 42-year (1979-2020) stratospheric ozone profile dataset. The ML algorithms are primarily trained using the Stratospheric Water and OzOne Satellite Homogenized (SWOOSH) dataset by selecting the UARS-MLS (1992-1998) and AURA-MLS (2005-2019) time periods. This ML-corrected version of monthly mean zonal mean TOMCAT (hereafter ML-TOMCAT) ozone profile data is available at both pressure (1000 hPa - 1 hPa) and geometric height (surface to 50 km) levels at about 2.5 degree horizontal resolution.

We will present a detailed evaluation of ML-TOMCAT profiles against range of merged satellite datasets (e.g. GOZCARDS, SAGE-CCI-OMPS, and BVertOzone) as well high quality solar occultation observations (e.g. SAGE-II v7.0 (1984-2005), HALOE v19 (1991-2005) and ACE v4.1 (2004-2020). Overall, ML-TOMCAT shows good agreement with the evaluation datasets but with poorer agreement at low latitudes. We also show that, as in different merged satellite data sets, ML-algorithms show larger spread in the tropical middle stratosphere. Finally, we will present a trend analysis from TOMCAT and ML-TOMCAT profiles for the post-1998 ozone recovery phase.

How to cite: Dhomse, S. and Chipperfield, M.: Machine-Learning-Based Satellite-Corrected Global Stratospheric Ozone Profile Dataset from a Chemical Transport Model, EGU General Assembly 2021, online, 19–30 Apr 2021, EGU21-9613,, 2021.


Display file

Comments on the display

to access the discussion