EGU23-10228
https://doi.org/10.5194/egusphere-egu23-10228
EGU General Assembly 2023
© Author(s) 2023. This work is distributed under
the Creative Commons Attribution 4.0 License.

Interpreting Machine Learning Models for Geochemistry Data Classification using Decision Boundary Maps

Yu Wang1,2, Kunfeng Qiu1, Alexandru Telea2, Zhaoliang Hou3, and Haocheng Yu1
Yu Wang et al.
  • 1School of Earth Sciences and Resources, China University of Geosciences, Beijing, China
  • 2Department of Information and Computing Sciences, Utrecht University, Utrecht, Netherlands
  • 3Department of Geology, University of Vienna, Vienna, Austria

Machine learning has been shown to be a highly effective method for classifying geochemistry data, such as mineral forming environments and rock tectonics. However, it can be difficult to understand the decision-making processes of these models. To address this issue, we propose the use of Decision Boundary Maps (DBMs) as a visualization tool for interpreting machine learning models. These maps project high-dimensional geochemistry data onto a 2D plane and depict the decision boundaries in the projected space, providing a visual representation of the algorithm's decision-making processes. In addition, DBMs can reveal trends, correlations, and outliers in the data, helping to interpret the results obtained from machine learning-based geochemistry data classification. Seeing the positions of data points, rather than just class labels, is especially valuable because samples in geological categories often follow a sequence, such as a magmatic to hydrothermal transition. Observing the positions of data points allows for the identification of trends from one class to an adjacent class.

How to cite: Wang, Y., Qiu, K., Telea, A., Hou, Z., and Yu, H.: Interpreting Machine Learning Models for Geochemistry Data Classification using Decision Boundary Maps, EGU General Assembly 2023, Vienna, Austria, 24–28 Apr 2023, EGU23-10228, https://doi.org/10.5194/egusphere-egu23-10228, 2023.