EGU25-10915, updated on 15 Mar 2025
https://doi.org/10.5194/egusphere-egu25-10915
EGU General Assembly 2025
© Author(s) 2025. This work is distributed under
the Creative Commons Attribution 4.0 License.
Poster | Wednesday, 30 Apr, 16:15–18:00 (CEST), Display time Wednesday, 30 Apr, 14:00–18:00
 
Hall X5, X5.134
Assessing the Geological Plausibility of Machine Learning Borehole Interpretations: A Case Study in the Roer Valley Graben
Sebastián Garzón1, Willem Dabekaussen2, Eva De Boever2, Freek Busschers2, Siamak Mehrkanoon3, and Derek Karssenberg1
Sebastián Garzón et al.
  • 1Department of Physical Geography, Faculty of Geosciences, Utrecht University, Utrecht, The Netherlands
  • 2TNO – Geological Survey of the Netherlands, Utrecht, The Netherlands
  • 3Department of Information and Computing Sciences, Faculty of Science, Utrecht University, Utrecht, The Netherlands

Expert interpretation of borehole data is a critical component of geological modelling, offering essential insights into the spatial distribution of geological units within the subsurface. For large-scale regional mapping efforts, expert interpretation of all available data is impractical due to the sheer volume of boreholes. Therefore, many 3D geological subsurface models rely only on a small portion of all available data. Machine learning (ML) models can be used to automate borehole data interpretation, increasing data density. However, these automated interpretations must adhere to strict spatial and stratigraphical relationships to be consistent with the established geological knowledge of the area. Using a dataset of 1,400 boreholes with expert interpretations from the Roer Valley Graben (Southeast Netherlands), we explore how ML models can be integrated into geological modelling workflows, highlighting the challenge of ensuring compatibility with geological principles and known spatial relationships. We evaluate the model performance using traditional metrics such as accuracy, Cohen's kappa and F1 Score and newly proposed geology-inspired metrics to quantify the ability of Random Forest and Neural Network models to interpret borehole data into lithostratigraphic units while preserving key geological relationships. Our results demonstrate that while many models achieve accuracy values of 75% to 80%, Neural Networks perform significantly better in capturing the expected sequential relationships between geological units, achieving up to 96% of geological transitions between geological units that are plausible, compared to 65% for the best-performing Random Forest model selected based on traditional metrics. This study underscores the need for domain-specific metrics in evaluating model performance and the potential for ML to increase the volume of data incorporated in subsurface models.

How to cite: Garzón, S., Dabekaussen, W., De Boever, E., Busschers, F., Mehrkanoon, S., and Karssenberg, D.: Assessing the Geological Plausibility of Machine Learning Borehole Interpretations: A Case Study in the Roer Valley Graben, EGU General Assembly 2025, Vienna, Austria, 27 Apr–2 May 2025, EGU25-10915, https://doi.org/10.5194/egusphere-egu25-10915, 2025.