EGU22-11689
https://doi.org/10.5194/egusphere-egu22-11689
EGU General Assembly 2022
© Author(s) 2022. This work is distributed under
the Creative Commons Attribution 4.0 License.

Graphical Model Assessment of Probabilistic Forecasts

Moritz N. Lang1, Reto Stauffer1,2, and Achim Zeileis1
Moritz N. Lang et al.
  • 1Department of Statistics, Faculty of Economics and Statistics, Universität Innsbruck, Innsbruck, Austria (moritz.lang@uibk.ac.at)
  • 2Digital Science Center, Universität Innsbruck, Innsbruck, Austria

As a consequence of the growing importance of probabilistic predictions in various application fields due to a necessary functional risk management and strategy, there is an increasing demand for appropriate probabilistic model evaluation. Besides proper scoring rules, which can evaluate not only the expectation but the entire predictive distribution, graphical assessment methods are particularly advantageous to diagnose possible model misspecifications.

Probabilistic forecasts are often based on distributional regression models, whereby the computation of predictive distributions, probabilities, and quantiles is generally dependent on the software (package) being used. Therefore, routines to graphically evaluate probabilistic models are not always available and if so then only for specific types of models and distributions provided by the corresponding package. An easy to use unified infrastructure to graphical assess and compare different probabilistic model types does not yet exist. Trying to fill that gap, we present a common conceptual framework accompanied by a flexible and object-oriented software implementation in the R package topmodels (https://topmodels.R-Forge.R-project.org/).  

The package includes visualizations for PIT (probability integral transform) histograms, Q-Q (quantile-quantile) plots of (randomized) quantile residuals, rootograms, reliability diagrams, and worm plots. All displays can be rendered in base R as well as in ggplot2 and provide different options for, e.g., computing confidence intervals, scaling or setting graphical parameters. Using examples of post-processing precipitation ensemble forecasts, we further discuss how all theses types of graphics can be compared to each other and which types of displays are particularly useful for bringing out which types of model deficiencies.

How to cite: Lang, M. N., Stauffer, R., and Zeileis, A.: Graphical Model Assessment of Probabilistic Forecasts, EGU General Assembly 2022, Vienna, Austria, 23–27 May 2022, EGU22-11689, https://doi.org/10.5194/egusphere-egu22-11689, 2022.

Displays

Display file