VAE4OBS: Denoising ocean bottom seismograms using variational autoencoders

Maria Tsekhmistrenko; Ana Ferreira; Kasra Hosseini; Thomas Kitching

doi:https://doi.org/10.5194/egusphere-egu22-12351

[Back] [Session NP4.1]

EGU22-12351

https://doi.org/10.5194/egusphere-egu22-12351

EGU General Assembly 2022

© Author(s) 2022. This work is distributed under
the Creative Commons Attribution 4.0 License.

VAE4OBS: Denoising ocean bottom seismograms using variational autoencoders

Maria Tsekhmistrenko¹, Ana Ferreira¹, Kasra Hosseini², and Thomas Kitching³

Maria Tsekhmistrenko et al.

¹Department of Earth Sciences, University College London, London, UK
²The Alan Turing Institute, The British Library, London, UK
³Mullard Space Science Laboratory, University College London, London, UK

Data from ocean-bottom seismometers (OBS) are inherently more challenging than their land counterpart because of their noisy environment. Primary and secondary microseismic noises corrupt the recorded time series. Additionally, anthropogenic (e.g., ships) and animal noise (e.g., Whales) contribute to a complex noise that can make it challenging to use traditional filtering methods (e.g., broadband or Gabor filters) to clean and extract information from these seismograms.

OBS deployments are laborious, expensive, and time-consuming. The data of these deployments are crucial in investigating and covering the "blind spots" where there is a lack of station coverage. It, therefore, becomes vital to remove the noise and retrieve earthquake signals recorded on these seismograms.

We propose analysing and processing such unique and challenging data with Machine Learning (ML), particularly Deep Learning (DL) techniques, where conventional methods fail. We present a variational autoencoder (VAE) architecture to denoise seismic waveforms with the aim to extract more information than previously possible. We argue that, compared to other fields, seismology is well-posed to use ML and DL techniques thanks to massive datasets recorded by seismograms.

In the first step, we use synthetic seismograms (generated with Instaseis) and white noise to train a deep neural network. We vary the signal-to-noise ratio during training. Such synthetic datasets have two advantages. First, we know the signal and noise (as we have injected the noise ourselves). Second, we can generate large training and validation datasets, one of the prerequisites for high-quality DL models.

Next, we increased the complexity of input data by adding real noise sampled from land and OBS to the synthetic seismograms. Finally, we apply the trained model to real OBS data recorded during the RHUM-RUM experiment.

We present the workflow, the neural network architecture, our training strategy, and the usefulness of our trained models compared to traditional methods.

How to cite: Tsekhmistrenko, M., Ferreira, A., Hosseini, K., and Kitching, T.: VAE4OBS: Denoising ocean bottom seismograms using variational autoencoders, EGU General Assembly 2022, Vienna, Austria, 23–27 May 2022, EGU22-12351, https://doi.org/10.5194/egusphere-egu22-12351, 2022.