Reconciling process-based modelling and machine learning in biogeochemistry
Recent developments aim at bridging the gaps between “processes and data” and to mitigate these issues. Notably, Interpretable Machine Learning is a developing field, embracing efforts to render model predictions comprehensible. To tackle the second issue, a framework to implement physical constraints or other relevant domain knowledge into ML models is emerging, provisionally coined physics-aware ML modelling. Finally, the data-driven causal inference and causal network construction are promising avenues to constrain predictor selection and connect to process knowledge.
For this session, we invite contributions from the biogeosciences dealing with one or more of the following topics:
- Application of interpretable ML methods to ecosystem datasets
- Combinations of process-based models and ML methods
- Implementation of physical constraints into ML approaches, e.g. physics-aware regression
- Identification of drivers using causal inferences / causal networks
Knowledge domains relevant for the session include, but are not limited to, carbon and nitrogen cycles in terrestrial ecosystems, hydrology, soil processes, remote sensing, paleobiogeosciences, or extreme events.