Model Output Statistics (MOS) and Machine Learning applied to CAMS O3 forecasts: trade-offs between continuous and categorical skill scores

Hervé Petetin; Dene Bowdalo; Pierre-Antoine Bretonnière; Marc Guevara; Oriol Jorba; Jan Mateu armengol; Margarida Samso Cabre; Kim Serradell; Albert Soret; Carlos Pérez García-Pando

doi:https://doi.org/10.5194/egusphere-egu22-5746

[Back] [Session ITS2.6/AS5.1]

EGU22-5746, updated on 20 Apr 2023

https://doi.org/10.5194/egusphere-egu22-5746

EGU General Assembly 2022

© Author(s) 2023. This work is distributed under
the Creative Commons Attribution 4.0 License.

Model Output Statistics (MOS) and Machine Learning applied to CAMS O₃ forecasts: trade-offs between continuous and categorical skill scores

Hervé Petetin

¹, Dene Bowdalo¹, Pierre-Antoine Bretonnière

¹, Marc Guevara

¹, Oriol Jorba

¹, Jan Mateu armengol

¹, Margarida Samso Cabre¹, Kim Serradell

¹, Albert Soret

¹, and Carlos Pérez García-Pando

^1,2

Hervé Petetin et al.

¹Barcelona Supercomputing Center, Barcelona, Spain (herve.petetin@bsc.es)
²ICREA, Catalan Institution for Research and Advanced Studies, Barcelona, Spain

Air quality (AQ) forecasting systems are usually built upon physics-based numerical models that are affected by a number of uncertainty sources. In order to reduce forecast errors, first and foremost the bias, they are often coupled with Model Output Statistics (MOS) modules. MOS methods are statistical techniques used to correct raw forecasts at surface monitoring station locations, where AQ observations are available. In this study, we investigate to what extent AQ forecasts can be improved using a variety of MOS methods, including persistence (PERS), moving average (MA), quantile mapping (QM), Kalman Filter (KF), analogs (AN), and gradient boosting machine (GBM). We apply our analysis to the Copernicus Atmospheric Monitoring Service (CAMS) regional ensemble median O₃ forecasts over the Iberian Peninsula during 2018–2019. A key aspect of our study is the evaluation, which is performed using a very comprehensive set of continuous and categorical metrics at various time scales (hourly to daily), along different lead times (1 to 4 days), and using different meteorological input data (forecast vs reanalyzed).

Our results show that O₃ forecasts can be substantially improved using such MOS corrections and that this improvement goes much beyond the correction of the systematic bias. Although it typically affects all lead times, some MOS methods appear more adversely impacted by the lead time. When considering MOS methods relying on meteorological information and comparing the results obtained with IFS forecasts and ERA5 reanalysis, the relative deterioration brought by the use of IFS is minor, which paves the way for their use in operational MOS applications. Importantly, our results also clearly show the trade-offs between continuous and categorical skills and their dependencies on the MOS method. The most sophisticated MOS methods better reproduce O₃ mixing ratios overall, with lowest errors and highest correlations. However, they are not necessarily the best in predicting the highest O₃ episodes, for which simpler MOS methods can give better results. Although the complex impact of MOS methods on the distribution and variability of raw forecasts can only be comprehended through an extended set of complementary statistical metrics, our study shows that optimally implementing MOS in AQ forecast systems crucially requires selecting the appropriate skill score to be optimized for the forecast application of interest.

Petetin, H., Bowdalo, D., Bretonnière, P.-A., Guevara, M., Jorba, O., Armengol, J. M., Samso Cabre, M., Serradell, K., Soret, A., and Pérez Garcia-Pando, C.: Model Output Statistics (MOS) applied to CAMS O₃ forecasts: trade-offs between continuous and categorical skill scores, Atmos. Chem. Phys. Discuss. [preprint], https://doi.org/10.5194/acp-2021-864, in review, 2021.

How to cite: Petetin, H., Bowdalo, D., Bretonnière, P.-A., Guevara, M., Jorba, O., Mateu armengol, J., Samso Cabre, M., Serradell, K., Soret, A., and Pérez García-Pando, C.: Model Output Statistics (MOS) and Machine Learning applied to CAMS O3 forecasts: trade-offs between continuous and categorical skill scores, EGU General Assembly 2022, Vienna, Austria, 23–27 May 2022, EGU22-5746, https://doi.org/10.5194/egusphere-egu22-5746, 2022.