Where Is Weather Predictable and Which Models Get It Right? Global Assessment of Conventional and AI-Based ForecastModels

Muhsin Puthiyaveettil; Hylke Beck; Jun Ma

doi:https://doi.org/10.5194/egusphere-egu26-7617

[Back] [Session AS1.1]

EGU26-7617, updated on 14 Mar 2026

https://doi.org/10.5194/egusphere-egu26-7617

EGU General Assembly 2026

© Author(s) 2026. This work is distributed under
the Creative Commons Attribution 4.0 License.

Where Is Weather Predictable and Which Models Get It Right? Global Assessment of Conventional and AI-Based ForecastModels

Muhsin Puthiyaveettil¹, Hylke Beck², and Jun Ma³

Muhsin Puthiyaveettil et al.

¹King Abdullah University of Science and Technology, Saudi Arabia (muhsin.puthiyaveettil@kaust.edu.sa)
²King Abdullah University of Science and Technology, Saudi Arabia (hylke.beck@kaust.edu.sa)
³King Abdullah University of Science and Technology, Saudi Arabia (jun.ma.1@kaust.edu.sa)

Accurate medium-range weather guidance is essential, yet end-users lack clear, location-specific evidence on where forecasts are predictable and which freely available global systems perform best. We present a global evaluation of three conventional numerical weather prediction (NWP) models (ICON, IFS, GEFS) and an AI forecast model (AIFS) for 1-10-day lead times, focusing on four near-surface variables (2-m air temperature, precipitation, 10-m wind speed, and 2-m relative humidity). Using 00 UTC cycles over 1 September 2024 to 30 November 2025, we resampled forecasts to a 1° grid and assessed day-to-day variability (correlation and root mean square error), mean bias, variance bias, and lead-time dependence (drift) against multiple references (primarily JRA-3Q, with additional evaluation against ERA5, station data, and additionally IMERG-Late for precipitation). AIFS achieves the highest skill for temperature, precipitation, and wind at all lead times (relative humidity is unavailable from AIFS); at 3-day lead it explains, on average, 53\% more variance in daily precipitation globally than the next-best model (ICON), and 232\% more variance than GEFS in the tropics. Among the conventional systems, ICON is generally most skillful, while GEFS ranks lowest overall. Mean-bias drift is negligible across models, but variance drift is evident for several variables, most notably increasingly attenuated AIFS precipitation variability with lead time. Model correlation rankings are robust across reference datasets, although precipitation and humidity show greater reference sensitivity than temperature. We also map global predictability using 3-day lead daily temporal correlation of the locally best-performing model, showing highest predictability for temperature and wind in mid-to-high latitudes and markedly lower predictability for precipitation in the tropics. Our study provides actionable guidance on where global forecasts can be trusted and establishes a baseline for future AI and NWP model assessments.

How to cite: Puthiyaveettil, M., Beck, H., and Ma, J.: Where Is Weather Predictable and Which Models Get It Right? Global Assessment of Conventional and AI-Based ForecastModels, EGU General Assembly 2026, Vienna, Austria, 3–8 May 2026, EGU26-7617, https://doi.org/10.5194/egusphere-egu26-7617, 2026.