EGU25-16190, updated on 15 Mar 2025
https://doi.org/10.5194/egusphere-egu25-16190
EGU General Assembly 2025
© Author(s) 2025. This work is distributed under
the Creative Commons Attribution 4.0 License.
Poster | Wednesday, 30 Apr, 10:45–12:30 (CEST), Display time Wednesday, 30 Apr, 08:30–12:30
 
Hall X4, X4.90
Manual data review and quality control – An add-on to SaQC
Nicole Büttner1, Benjamin Louisot2, Christof Lorenz2, David Schäfer3, and Romy Fösig1
Nicole Büttner et al.
  • 1Institute of Meteorology and Climate Research Atmospheric Aerosol Research, Karlsruhe Institute of Technology, Karlsruhe, Germany (nicole.buettner@kit.edu)
  • 2Institute of Meteorology and Climate Research Atmospheric Environmental Research, Karlsruhe Institute of Technology, Garmisch-Partenkirchen, Germany
  • 3Research Data Management Team, Helmholtz Centre for Environmental Research GmbH – UFZ, Leipzig, Germany

The growing volume of high-resolution time-series data in Earth system science requires the implementation of standardised and reproducible quality control workflows to ensure compliance with the FAIR data standards. Automated tools such as SaQC1 address this need, but lack the capacity for manual data review and flagging. We are therefore planning to develop a Python-based tool with an intuitive graphical user interface (GUI) for local machines, thereby enhancing the functionality of SaQC. It is anticipated that the tool will be user-friendly, even for those with limited experience of Python. The GUI will be capable of interactively visualising the time-series data, highlighting the data that has already been automatically flagged. The selection of data points may be accomplished by clicking on them or via box-selection, and a flag may be assigned via a dropdown menu. An optional comment field can be used to record supplementary information, such as details of pollution events. Moreover, the option to unflag data that has failed the automated quality control process, but which is considered valid by the scientist, will be available.

The manual flagging tool will be based on SaQC, thereby facilitating a future integration into this software package. Consequently, integration into an existing SaQC workflow will be straightforward. It should be noted, however, that this is not exclusive to SaQC users; it can be easily applied to data created by another tool for automatic quality control. A simple conversion of the data via the pandas library will be sufficient for utilisation of the manual flagging tool. The flagging schemes can either be adopted from SaQC or user-specific schemes can be integrated. Once the flagging process is completed, the user is able to decide how to export the data set.

The manual flagging tool represents a valuable addition to existing toolkits for all scientists handling time-series datasets, effectively completing the data quality control process. From a scientific perspective, the benefits of this tool include increased efficiency and traceability in the data flow, as well as improved data quality through the fine-tuning of automatic controls based on experience and contextual knowledge.

 

1 Schäfer, David, Palm, Bert, Lünenschloß, Peter, Schmidt, Lennart, & Bumberger, Jan. (2023). System for automated Quality Control - SaQC (2.3.0). Zenodo. https://doi.org/10.5281/zenodo.5888547

How to cite: Büttner, N., Louisot, B., Lorenz, C., Schäfer, D., and Fösig, R.: Manual data review and quality control – An add-on to SaQC, EGU General Assembly 2025, Vienna, Austria, 27 Apr–2 May 2025, EGU25-16190, https://doi.org/10.5194/egusphere-egu25-16190, 2025.