EGU22-4039
https://doi.org/10.5194/egusphere-egu22-4039
EGU General Assembly 2022
© Author(s) 2022. This work is distributed under
the Creative Commons Attribution 4.0 License.

Apply noise filters for better forecast performance in Machine Learning

Nhung Le1,2,3, Benjamin Männel1, Randa Natras4, Pierre Sakic1,5, Zhiguo Deng1, and Harald Schuh ‬‬‬‬‬‬‬‬‬‬‬‬‬1,2
Nhung Le et al.
  • 1GFZ German Research Centre for Geosciences (Deutsches GeoForschungsZentrum), Department 1: Geodesy, Potsdam, Germany (lenhung.hunre@gmail.com)
  • 2Technische Universität Berlin, Germany
  • 3Hanoi University of Natural Resources and Environment, Vietnam
  • 4Deutsches Geodätisches Forschungsinstitut der Technischen Universität München (DGFI-TUM), Technical University of Munich, Germany
  • 5Institut de physique du globe de Paris, Université de Paris, Paris, France

Abstract:

In Machine Learning (ML), one of the crucial tasks is understanding data characteristics to be able to extract exactly relevant information, while noise contained in data can cause misleading estimations and decrease the generalizability of ML-based prediction models. So far, only few previous studies have applied noise filtering techniques when building forecast models. Hence, their efficiency on ML-based forecasts has not yet been comprehensively demonstrated. Therefore, we aim to determine optimal noise filters to enhance the forecast performance of Total Electron Contents (TEC), crustal motion, and Earth’s polar motion. We investigate six noise filtering algorithms (Moving Mean, Moving Median, Lowess, Loess, and Savitzky Golay) on forecast models to select the best-suited filters. Five ML algorithms are applied to train forecast models, that are Support Vector Machine (SVM), Regression Trees, Linear Regression (LR), Ensembles of Trees, and Gaussian Process Regression (GPR). The findings show that the Savitzky Golay algorithm is the most effective on the ML-based forecast models, followed by Loess and Gaussian filters, while Moving Mean is the least sensitive. Noise filters are more sensitive for forecast models based on SVM and LR than Ensembles of Trees and GPR. Applying the Savitzky Golay filter for SVM and LR optimal models can enhance the prediction accuracy up to 14.0 %, 16.1 % and 89.5 % corresponding to forecasting TEC, crustal motion, and Earth's polar motion, respectively; while that for Ensembles and GPR are only from approximate 3.0 to 27.0 %. Overall, using noise filters is one of the practical solutions to improve prediction performance. They can also be used to smoothen time series with variable characteristics and to generalize high-rate data.

Keywords:

Machine Learning, Noise filters, Savitzky Golay filter, TEC forecast, Crustal motion, Earth’s polar motion.

How to cite: Le, N., Männel, B., Natras, R., Sakic, P., Deng, Z., and Schuh ‬‬‬‬‬‬‬‬‬‬‬‬‬, H.: Apply noise filters for better forecast performance in Machine Learning, EGU General Assembly 2022, Vienna, Austria, 23–27 May 2022, EGU22-4039, https://doi.org/10.5194/egusphere-egu22-4039, 2022.

Displays

Display file