EGU26-10617, updated on 14 Mar 2026
https://doi.org/10.5194/egusphere-egu26-10617
EGU General Assembly 2026
© Author(s) 2026. This work is distributed under
the Creative Commons Attribution 4.0 License.
Poster | Thursday, 07 May, 14:00–15:45 (CEST), Display time Thursday, 07 May, 14:00–18:00
 
Hall X1, X1.75
Earthquake phase picking in out-of-distribution data conditions: Can synthetic data from earthquake simulations help?
Peifan Jiang1,2, Janneke I. de Laat2, Xuben Wang1, and Islam Fadel2
Peifan Jiang et al.
  • 1College of Geophysics, Chengdu University of Technology, Chengdu, China (jpeifan@qq.com, wxb@cdut.edu.cn)
  • 2Faculty of Geo-lnformation Science and Earth Observation, University of Twente, Enschede, Netherlands (j.i.delaat@utwente.nl, i.e.a.m.fadel@utwente.nl)

Deep learning has improved automated seismic phase picking in recent years. However, many pickers are trained on a single dataset and often fail to generalize when deployed on out-of-distribution (OOD) data. Variations in earthquake magnitude, propagation distance, sensor instrumentation, and ambient noise between training and target domains lead to significant performance degradation under OOD conditions. To address this challenge, we propose a new framework to reduce performance degradation under OOD conditions based on a pipeline of seismic simulation -- phase labeling -- site-conditions simulation -- transfer learning. First, we use the seismic simulation tool AxiSEM3D to establish a 3-D waveform simulation, covering local, regional, and teleseismic scales. Next, we obtain precise P- and S-phase labels by computing theoretical arrival times from velocity models and refining these onsets with traditional automatic picking algorithms, ensuring high-fidelity phase annotations. Then, based on actual site conditions, we simulate instrument responses and synthesize ambient noise constrained by PPSD analysis. This gives us station-specific, noisy waveforms that closely match real observational conditions. Finally, we employ transfer learning to fine-tune a phase picker on this specific synthetic dataset, thereby enhancing the picker's performance in new conditions. The proposed framework aims to improve the ability of deep learning models to pick phases under OOD conditions. It enables reliable performance across regional variability and instrumentation differences without large-scale manual relabeling. It also reduces the amount of real data needed for training, making it useful for small datasets and leaving more data to analyse. Overall, this work introduces a transferable methodology for seismic phase picking under distribution shift and shows that physics-informed data augmentation combined with targeted transfer learning can effectively decrease OOD performance degradation, thereby increasing the applicability of deep-learning-based phase picking.

How to cite: Jiang, P., de Laat, J. I., Wang, X., and Fadel, I.: Earthquake phase picking in out-of-distribution data conditions: Can synthetic data from earthquake simulations help?, EGU General Assembly 2026, Vienna, Austria, 3–8 May 2026, EGU26-10617, https://doi.org/10.5194/egusphere-egu26-10617, 2026.