EGU26-5192, updated on 13 Mar 2026
https://doi.org/10.5194/egusphere-egu26-5192
EGU General Assembly 2026
© Author(s) 2026. This work is distributed under
the Creative Commons Attribution 4.0 License.
Poster | Monday, 04 May, 16:15–18:00 (CEST), Display time Monday, 04 May, 14:00–18:00
 
Hall X4, X4.121
Geochemistry π: Machine Learning for Geochemists Who Don’t Want to Code
J.Zhou ZhangZhou
J.Zhou ZhangZhou
  • Zhejiang University, Earth Sciences, Hangzhou, China (zhangzhou333@zju.edu.cn)

Geochemistry π is an open-source automated machine learning Python framework. Geochemists need only provide tabulated data (e.g. excel spreadsheet) and select the desired options to clean data and run machine learning algorithms. The process operates in a question-and-answering format, and thus does not require that users have coding experience. Version 0.7.0 includes machine learning algorithms for regression, classification, clustering, dimension reduction and anomaly detection. After either automatic or manual parameter tuning, the automated Python framework provides users with performance and prediction results for the trained machine learning model. Based on the scikit-learn library, Geochemistry π has established a customized automated process for implementing machine learning. The Python framework enables extensibility and portability by constructing a hierarchical pipeline architecture that separates data transmission from algorithm application. The AutoML module is constructed using the Cost-Frugal Optimization and Blended Search Strategy hyperparameter search methods from the A Fast and Lightweight AutoML Library, and the model parameter optimization process is accelerated by the Ray distributed computing framework. The MLflow library is integrated into machine learning lifecycle management, which allows users to compare multiple trained models at different scales and manage the data and diagrams generated. In addition, the front-end and back-end frameworks are separated to build the web portal, which demonstrates the machine learning model and data science workflow through a user-friendly web interface. In summary, Geochemistry π provides a Python framework for users and developers to accelerate their data mining efficiency with both online and offline operation options. All source code is available on GitHub  (https://github.com/ZJUEarthData/geochemistrypi), with a detailed operational manual catering to both users and developers (https://geochemistrypi.readthedocs.io/en/latest/).

How to cite: ZhangZhou, J. Z.: Geochemistry π: Machine Learning for Geochemists Who Don’t Want to Code, EGU General Assembly 2026, Vienna, Austria, 3–8 May 2026, EGU26-5192, https://doi.org/10.5194/egusphere-egu26-5192, 2026.