EGU24-5044, updated on 08 Mar 2024
https://doi.org/10.5194/egusphere-egu24-5044
EGU General Assembly 2024
© Author(s) 2024. This work is distributed under
the Creative Commons Attribution 4.0 License.

Event-based features: An improved feature extraction approach to enrich machine learning based labquake forecasting

Sadegh Karimpouli1, Grzegorz Kwiatek1, Patricia Martínez-Garzón1, Georg Dresen1,2, and Marco Bohnhoff1,3
Sadegh Karimpouli et al.
  • 1Helmholtz Centre Potsdam, GFZ German Research Centre for Geosciences, Potsdam, Germany. (sadegh.karimpouli@gfz-potsdam.de)
  • 2Institute of Earth and Environmental Sciences, Universität Potsdam, Potsdam, Germany.
  • 3Department of Earth Sciences, Free University Berlin, Berlin, Germany.

Earthquake forecasting is a highly complex and challenging task in seismology ultimately aiming to save human lives and infrastructures. In recent years, Machine Learning (ML) methods have demonstrated progressive achievements in earthquake processing and even labquake forecasting. Developing a more general and accurate ML model for more complex and/or limited datasets is obtained by refining the ‘ML models’ and/or enriching the ‘input data’. In this study, we present an event-based approach to enrich the input data by extracting spatio-temporal seismo-mechanical features that are dependent on the origin time and location of each event. Accordingly, we define and analyze a variety of features such as: (a) immediate features, defined as the features which benefit from very short characteristics of the considered event in time and space, (b) time-space features, based on the subsets of acoustic emission (AE) catalog constrained by time and space distance from the considered event, and (c) family features, extracted from topological characteristics of the clustered (family) events extracted from clustering analysis in different time windows. We use AE catalogs recorded by tri-axial stick-slip experiments on rough fault samples to compute event-based features. Then, a random forest classifier is applied to forecast the occurrence of a large magnitude event (MAE>3.5) in the next time window. Results show that to obtain a more accurate forecasting model, one needs to separate background and clustered activities. Based on our results, the classification accuracy when the entire catalog data is used reaches 73.2%, however, it shows a remarkable improvement for separated background and clustered populations with an accuracy of 82.1% and 89.0%, respectively. Feature importance analysis reveals that not only AE-rate, seismic energy and b-value are important, but also family features developed from a topological tree decomposition play a crucial role for labquake forecasting.

How to cite: Karimpouli, S., Kwiatek, G., Martínez-Garzón, P., Dresen, G., and Bohnhoff, M.: Event-based features: An improved feature extraction approach to enrich machine learning based labquake forecasting, EGU General Assembly 2024, Vienna, Austria, 14–19 Apr 2024, EGU24-5044, https://doi.org/10.5194/egusphere-egu24-5044, 2024.