EGU General Assembly 2021
© Author(s) 2021. This work is distributed under
the Creative Commons Attribution 4.0 License.

Towards data-driven estimates of the transient climate response to cumulative CO2 emissions using interpretable statistical learning methods

Katarzyna Tokarska, Sebastian Sippel, and Reto Knutti
Katarzyna Tokarska et al.
  • ETH Zurich, Institute for Atmospheric and Climate Science, Zurich, Switzerland (

CO2-induced warming is approximately proportional to the total amount of CO2 emitted. This emergent property of the climate system, known as the Transient Climate Response to cumulative CO2 Emissions (TCRE), gave rise to the concept of a remaining carbon budget that specifies a cap on global CO2 emissions in line with reaching a given temperature target, such as those in the Paris Agreement (e.g., Matthews et al. 2020). However, estimating the policy-relevant TCRE metric directly from the observation-based data products remains challenging due to non-CO2 forcing and land-use change emissions present in the real-world climate conditions.

Here, we present preliminary results for applying and comparing different statistical learning methods to determine TCRE (and later, remaining carbon budgets) from: (i) climate models’ output and (ii) the observational data products. First, we make use of a ‘perfect-model’ setting, i.e. using output from physics-based climate models (CMIP5 and CMIP6) under historical forcing (treated as pseudo-observations). This output is used to train different statistical-learning models, and to make predictions of TCRE (which are known from climate model simulations under CO2-only forcing, per experimental design). Next, we use such trained statistical learning models to make TCRE predictions directly from the observation-based data products.

We also explore interpretability of the applied techniques, to determine where the statistical models are learning from, what the regions of importance are, and the key input features and weights. Explainable AI methods (e.g., McGovern et al. 2019; Molnar 2019; Samek et al. 2019) present a promising way forward in linking data-driven statistical and machine learning methods with traditional physical climate sciences, while leveraging from the large amount of data from the observational data products to provide more robust estimates of, often policy relevant, climate metrics.


Matthews et al. (2020). Opportunities and challenges in using carbon budgets to guide climate policy. Nature Geoscience, 13, 769–779.

McGovern et al. (2019). Making the Black Box More Transparent: Understanding the Physical Implications of Machine Learning, B. Am. Meteorol. Soc., 100, 2175–2199,

Molnar, C. (2019) Interpretable Machine Learning -A Guide for Making Black Box Models Explainable.

Samek, W. et al. (2019) Explainable AI: Interpreting, explaining and visualizing deep learning.

How to cite: Tokarska, K., Sippel, S., and Knutti, R.: Towards data-driven estimates of the transient climate response to cumulative CO2 emissions using interpretable statistical learning methods, EGU General Assembly 2021, online, 19–30 Apr 2021, EGU21-2451,, 2021.

Display materials

Display file

Comments on the display material

to access the discussion