EGU26-8161, updated on 14 Mar 2026
https://doi.org/10.5194/egusphere-egu26-8161
EGU General Assembly 2026
© Author(s) 2026. This work is distributed under
the Creative Commons Attribution 4.0 License.
Oral | Monday, 04 May, 14:55–15:05 (CEST)
 
Room 0.11/12
Constrained hybrid modelling to predict microbial dynamics and organic matter turnover in soil systems
Paul Collart1,2, Jürgen Gall3,4, Andrea Schnepf1,2, Lars Doorenbos3,4, and Holger Pagel1,2
Paul Collart et al.
  • 1Agrosphere (IBG-3), Forschungszentrum Jülich GmbH, Germany
  • 2Institute of Crop Science and Resource Conservation, University of Bonn
  • 3Institute of Computer Science, University of Bonn
  • 4Lamarr Institute for Machine Learning and Artificial Intelligence

Soil microorganisms control organic matter cycling and, to a large extent, determine how soil systems can cope with and mitigate climate change and environmental threats. Integrating them explicitly into process-based soil models is critical for predicting how soil carbon (C) flows and stocks change in ecosystems with time. Models are critical tools for integrating datasets with theory. However, integrating information from modern omics-based datasets is a challenge due to the nonlinear relationship between genomes and the actual function microbes express in their local environment. Functional traits can be defined and inferred from these genomic datasets to better leverage their information and better understand the complexity of the soil microbiome. Integrating trait information with process-based microbially explicit models provides an opportunity leverage genomic data for an improved soil carbon prediction.

We present a hybrid modeling framework that uses a data-driven neural network approach to derive microbial parameters of process-based models from metagenome inferred functional traits, leveraging information from metagenomic and DNA sequencing datasets. We combine a neural network (multi-layer perceptron) with a process-based soil model to set up a hybrid model. The neural network uses genomic trait data as the input and predicts biokinetic parameters of the process-based model. We trained the hybrid model with synthetic genomic trait datasets of varying complexity and time series of state variables of the process-based model (e.g. carbon dioxide production) to demonstrate the approach. Using trait inference from genomes, the model can learn several biokinetic parameters such as growth rates, dormancy rates, affinities to organic matter, growth yields or decay rates. The training uses a complex constraint-based loss function, informing the model from ecological theory and literature data, ensuring the realistic behavior of every non observed state variable during training such as active and dormant microbial pools. Compared to a ‘naïve’ hybrid model, the use of a more complex loss function reduces model equifinality and ensure realistic behavior of the non-observed state variables. Naïve loss function cannot efficiently learn the behavior of non-observed state variables and fail to predict realistic microbial dynamics. We present i) the concept of the hybrid soil modelling framework, ii) the constraint-based loss function approach, iii) the performance of constrained versus naïve hybrid models after training with different synthetic datasets.

How to cite: Collart, P., Gall, J., Schnepf, A., Doorenbos, L., and Pagel, H.: Constrained hybrid modelling to predict microbial dynamics and organic matter turnover in soil systems, EGU General Assembly 2026, Vienna, Austria, 3–8 May 2026, EGU26-8161, https://doi.org/10.5194/egusphere-egu26-8161, 2026.