EGU23-14585
https://doi.org/10.5194/egusphere-egu23-14585
EGU General Assembly 2023
© Author(s) 2023. This work is distributed under
the Creative Commons Attribution 4.0 License.

Correlation Analysis of GNSS Data Quality Indicators and Position Time Series using Machine-Learning Algorithms

Fikri Bamahry, Juliette Legrand, Carine Bruyninx, Eric Pottiaux, and Andras Fabian
Fikri Bamahry et al.
  • Royal Observatory of Belgium, Reference systems and Planetology, Brussels, Belgium (fikri.bamahry@oma.be)

The EUREF Permanent GNSS Network Central Bureau (EPN CB, www.epncb.eu [1]) monitors the quality of the daily GNSS observations of the EPN stations covering the period 1996-today. The associated data quality indicators (the number of observed versus expected observations in dual frequency, the lowest elevation cut-off observed, the number of missing epochs, the number of satellites, the number of maximum observations, and the number of cycle slips) are used to assess EPN stations' performance and as input for outlier detection in the daily position time series of the 400+ GNSS reference stations in Europe. Due to the increasing number of GNSS stations, the development of an automated algorithm to identify coordinate outliers caused by degraded GNSS data quality would allow to reduce the effort of human interpretation of the data quality indicators. We investigate the correlations between daily GNSS data quality metrics and daily position estimates to achieve this.

This study assesses several machine-learning classification algorithms to find a suitable data-driven model based on the correlation between degraded GNSS data quality metrics and quality degradation in position time series. Based on this investigation, the Random Forest algorithm proved to be the most precise algorithm, with an Area Under the Receiver Operating Characteristic Curve (AUC) equal to 0.95, and a correct classification of almost 90% of the test dataset. The GNSS data quality indicators ‘maximum observations‘ and ‘cycle slips’ are also shown as the most important parameters driving this model, which is in accordance with the data quality criteria for GNSS stations that IGS has determined [2]. Our model will be implemented in our EPN CB operation routine to develop a new monitoring system that can detect quality degradation on the GNSS station that will enable to improve the reliability of the EPN reference frame product by detecting position outliers due to degraded GNSS data quality. Here, we will present the current development of this automated algorithm, the challenges we faced, and the preliminary results of this work.

References:

[1] C. Bruyninx, J. Legrand , D. Mesmaker, A. Moyaert, A. Fabian (2022): EUREF Permanent Network Central Bureau (EPN CB) Information System, https://doi.org/10.24414/ROB-EUREF-EPNCB

[2] IGS (2019) Questions about the data quality graphs. https://kb.igs.org/hc/en-us/articles/204229743-Questions-about-the-data-quality-graphs

How to cite: Bamahry, F., Legrand, J., Bruyninx, C., Pottiaux, E., and Fabian, A.: Correlation Analysis of GNSS Data Quality Indicators and Position Time Series using Machine-Learning Algorithms, EGU General Assembly 2023, Vienna, Austria, 24–28 Apr 2023, EGU23-14585, https://doi.org/10.5194/egusphere-egu23-14585, 2023.