EGU General Assembly 2020
© Author(s) 2022. This work is distributed under
the Creative Commons Attribution 4.0 License.

Optimal model complexity for terrestrial carbon cycle prediction using data assimilation

Caroline A. Famiglietti1, T. Luke Smallman2,3, Sophie Flack-Prain2, Rong Ge2, Victoria Meyer5, Nicholas C. Parazoo4, Gregory R. Quetin1, Andrew Revill2, Stephanie G. Stettz4, Yan Yang4, Yuan Zhao2, Penghui Zhu2, A. Anthony Bloom4, Mathew Williams2,3, and Alexandra G. Konings1
Caroline A. Famiglietti et al.
  • 1Department of Earth System Science, Stanford University, Stanford, USA
  • 2School of GeoSciences, University of Edinburgh, Edinburgh, UK
  • 3National Centre for Earth Observation, University of Edinburgh, Edinburgh, UK
  • 4Jet Propulsion Laboratory, California Institute of Technology, Pasadena, USA
  • 5School of the Art Institute of Chicago, Chicago, USA

The future role of the terrestrial biosphere in the global carbon cycle is highly uncertain. Modeling and predicting the terrestrial net carbon balance is difficult due to the numerous processes driving variability of gross fluxes. Many approaches to reducing this model uncertainty have focused on model structure, namely by adding additional processes (e.g., nutrient dynamics or vegetation demography) and thus increasing complexity. While these developments seek to achieve greater structural realism by mirroring the complexity of the natural world, they often rely, by necessity, on poorly-determined or over-generalized parameters. Furthermore, increased structural complexity may increase the risk that parameters with compensating errors are found during model development, thereby reducing model accuracy in prediction. It is not clear whether or to what extent carbon cycle predictability scales with structural complexity, or whether an intermediate, optimum level of complexity exists that may balance the costs of a low (more biased) or high (more variant) complexity model. Here, we explore and define the relationship between carbon cycle model complexity and prediction accuracy. To do so, we leverage the CARbon Data MOdel fraMework (CARDAMOM), a Bayesian data assimilation system that retrieves terrestrial carbon cycle variables (including pools, fluxes, and static parameters) by combining multiple observations with a relatively simple ecosystem carbon balance model. CARDAMOM includes several ecological and dynamical constraints that can prevent ecologically unrealistic parameter combinations and reduce compensating errors between parameters (also known as equifinality). Furthermore, it is a flexible framework to which process representations, parameters, and constraints can easily be added and removed. We used CARDAMOM to develop a suite of model versions spanning a broad range of structural complexity, including the number of carbon pools and the allocation of carbon to the canopy. We assessed a model’s complexity based on its inherent dimensionality, determined via a principal component analysis that reduces the parameter space to its principal components. We tested and compared the training and forecast accuracies of net ecosystem exchange predictions using 14 increasingly complex versions of CARDAMOM, each with 48 different experimental designs (i.e., combinations of data constraints and error assumptions) at 5 globally-distributed eddy covariance sites representing a range of biomes and vegetation types across a total of 70 site-years. We also compared the model performance values against a range of machine learning approaches, which are assumed to represent the limit of infinite model complexity due to their large number of underlying parameters. In this presentation, we use this population to demonstrate and explain patterns in the mapping of model complexity and other assimilation choices to prediction accuracy, offering theoretical and empirical insights into the optimal structure of a carbon cycle model.

How to cite: Famiglietti, C. A., Smallman, T. L., Flack-Prain, S., Ge, R., Meyer, V., Parazoo, N. C., Quetin, G. R., Revill, A., Stettz, S. G., Yang, Y., Zhao, Y., Zhu, P., Bloom, A. A., Williams, M., and Konings, A. G.: Optimal model complexity for terrestrial carbon cycle prediction using data assimilation, EGU General Assembly 2020, Online, 4–8 May 2020, EGU2020-10722,, 2020.


Display file

Comments on the display

AC: Author Comment | CC: Community Comment | Report abuse

displays version 1 – uploaded on 30 Apr 2020, no comments