Reconstructing Atmospheric CO2 with Flow Matching Models

Jonathan Groß; Vitus Benson; Maurício Lima; Alexander Winkler; Christian Reimers

doi:https://doi.org/10.5194/egusphere-egu26-15228

[Back] [Session ITS1.10/BG10.6]

EGU26-15228, updated on 14 Mar 2026

https://doi.org/10.5194/egusphere-egu26-15228

EGU General Assembly 2026

© Author(s) 2026. This work is distributed under
the Creative Commons Attribution 4.0 License.

Reconstructing Atmospheric CO₂ with Flow Matching Models

Jonathan Groß, Vitus Benson, Maurício Lima, Alexander Winkler, and Christian Reimers

Jonathan Groß et al.

Max Planck Institute for Biogeochemistry, Biogeochemical Integration, Germany (jgross@bgc-jena.mpg.de)

Accurate estimates of the spatiotemporal distribution of atmospheric carbon dioxide (CO₂) are essential to evaluate and enforce international climate agreements as well as to infer fluxes of the greenhouse gas. However, current observations are spatially sparse, with satellite and in-situ measurements providing only partial coverage of the Earth’s surface and atmosphere. Atmospheric transport models are often used to infer CO₂ concentrations across unobserved regions by simulating how gases move and mix in the atmosphere. While physically grounded, these models are computationally intensive and notoriously difficult to calibrate with observational data, due to the complexity of atmospheric dynamics and the sparsity of available measurements.

This study investigates the use of generative machine learning for inpainting of CO₂. More specifically, we apply flow matching, an approach that generates samples from an unknown target distribution by iteratively transforming samples from a simple known noise distribution with a deep neural network. In a first step, we train a flow matching model on assimilation data from CarbonTracker (CT2022). This trains the model to respect the physical patterns of atmospheric CO₂ fields, turning it into an effective prior for data assimilation. In a second step, we test the trained flow matching model on conditional generation that is, reconstruction of atmospheric CO₂ from partial observations. For this, we artificially mask parts of the CT2022’s CO₂ in a way that emulates the availability of satellite measurements. In a third step, we infer global CO₂by conditioning on the total column average CO₂ (XCO₂) measurements from NASA’s Orbiting Carbon Observatory-2 (OCO-2), comparable to other inversions from the OCO-2 v11 MIP, but using a novel approach.

Extensive evaluation against independent and held-out test-sets from in-situ and satellite measurements show physical consistency and decent agreement of the reconstructed global CO₂ fields from OCO-2 measurements. However, challenges remain: specifically, future research needs to alleviate spurious artifacts from the employed posterior conditioning method in both the artificial mask and particularly the conditioning on XCO₂ before the approach can become operational.

Our presented flow matching approach opens up new avenues of research. The prior parameterized by the flow matching model can be investigated itself. For instance, it is possible to perform feature extraction inside the latent space and hence purposefully explore counterfactual scenarios of CO₂ distributions by carefully tracing out paths in the noise distribution and analyzing the corresponding generated CO₂ samples.

How to cite: Groß, J., Benson, V., Lima, M., Winkler, A., and Reimers, C.: Reconstructing Atmospheric CO2 with Flow Matching Models, EGU General Assembly 2026, Vienna, Austria, 3–8 May 2026, EGU26-15228, https://doi.org/10.5194/egusphere-egu26-15228, 2026.

Supplementary materials

Supplementary material file

Comments on the supplementary material

AC: Author Comment | CC: Community Comment | Report abuse

supplementary materials version 2 – uploaded on 05 May 2026, no comments