- 1LSCE, ISPL, Paris, France
- 2Celest Science, Paris, France
- 3Columbia University, New York, USA
In the context of increasing interest in machine learning-based emulators to overcome the computational cost and limited scenario coverage of Earth System Models (ESMs), some key challenges remain, such as capturing internal variability, handling non-stationarity, and realistically representing compound and extreme events — especially at high spatial and temporal resolution.
We present a probabilistic multivariate emulator of climate variables based on a flow matching model trained on the CESM2 Large Ensemble (Danabasoglu et al., 2020) under the SSP3-7.0 scenario. Our approach leverages a flow matching framework (Lipman et al., 2022) to reproduce the spatiotemporal variability of temperature and precipitations on a monthly timescale. The model is conditioned on greenhouse gas concentrations and we evaluate the capability of the model to generate physically consistent climate fields and to capture the full ensemble spread of the original ESM, including tail behavior and potential extreme events. To ensure a better reproduction of observed climatological variability, the flow matching model is fine-tuned on ERA5 reanalyses (Hersbach et al., 2020). This should enable the emulator to act as a stochastic weather generator of plausible climate states under GHG forcing trajectories, accounting for the non-stationarity introduced by anthropogenic climate change, and allowing for the assessment of rare or compound extreme events within the generated ensemble. Our results assess the model’s ability to reproduce ensemble-scale statistics, cross-variable dependencies, and evolving climate distributions across time.
How to cite: Launeau, V., Vrac, M., Lemordant, L., and Gentine, P.: Emulating climate variability and extremes with a multivariate flow matching model trained on CESM2 and finetuned on ERA5, EGU General Assembly 2026, Vienna, Austria, 3–8 May 2026, EGU26-16454, https://doi.org/10.5194/egusphere-egu26-16454, 2026.