OSA1.2 | Data Assimilation and Ensemble Forecasting (short, medium, extended range): traditional versus machine learning
Data Assimilation and Ensemble Forecasting (short, medium, extended range): traditional versus machine learning
Convener: Andrea Montani | Co-conveners: Zahra Parsakhoo, Fernando Prates
Orals Tue3
| Tue, 09 Sep, 14:00–16:00 (CEST)
 
Kosovel Hall
Posters P-Tue
| Attendance Tue, 09 Sep, 16:00–17:15 (CEST) | Display Mon, 08 Sep, 08:00–Tue, 09 Sep, 18:00
 
Grand Hall, P16–19
Tue, 14:00
Tue, 16:00
The session will focus on the latest developments in data assimilation and ensemble forecasting techniques, ranging from links with nowcasting to the ability to produce and deliver skillful and reliable forecasts of high-impact extreme events to the medium and extended range.
We welcome any methods and ideas, both traditional and machine learning-based, on how to assimilate data, but also on approaches to create and use an ensemble forecast, and how these techniques can vary with the forecast lead-time. Of particular interest will be the perspective of forecasters and the use of ensembles in forecasting extreme weather events.
The conveners invite papers on various issues associated with Data Assimilation and Ensemble Forecasting for weather prediction, such as:
- intercomparison and study of the complementarity between different assimilation techniques: Kalman filtering, variational assimilation, nudging techniques for frequent analysis cycles, etc;
- variational techniques with longer assimilation windows and weak constraint methods to allow for the inclusion of model error estimates;
- ensemble data assimilation systems and flow dependent estimation of background and on-the-fly error statistics;
- representation of uncertainties in initial conditions, model and boundary coupling in Global and Limited-Area Ensemble Prediction Systems;
- verification and calibration methods of Ensemble Prediction Systems;
- use of TIGGE database;
- application of ensemble forecast in different sectors, including energy, health, transport, agriculture, insurance, finance, etc.

Orals: Tue, 9 Sep, 14:00–16:00 | Kosovel Hall

Chairpersons: Zahra Parsakhoo, Andrea Montani
14:00–14:15
|
EMS2025-28
|
Onsite presentation
Marco Stefanelli, Ziga Zaplotnik, and Gregor Skok

Forecasting convective storms remains a significant challenge in Numerical Weather Prediction (NWP). Data Assimilation (DA) plays a crucial role by improving the initial conditions for forecasts through the integration of observational data with previous model outputs (background). Among observational platforms, weather radar is particularly valuable due to its high spatial and temporal resolution, offering detailed information for storm monitoring. Therefore, assimilating radar data into Numerical Weather Prediction models has the potential to substantially enhance the accuracy of storm forecasts. However, studies, such as Fabry and Meunier (2020), have shown that short-term precipitation forecasts produced through radar-based extrapolation methods (nowcasting) often outperform model-based forecasts with Data Assimilation. This is largely because radar primarily provides information on precipitation structures and intensities within the storm-affected region but lacks direct insights into broader environmental conditions like temperature, wind fields, and humidity. These atmospheric variables, both within and outside the storm system, are critical for accurate storm evolution forecasts. A promising approach to address this limitation is the application of machine learning (ML) to develop a more advanced observation operator for Data Assimilation. Specifically, an encoder-decoder neural network can be trained to relate Numerical Weather Prediction model variables (such as temperature, wind components, and relative humidity) to corresponding radar reflectivity observations. This ML-based observation operator captures complex non-linear relationships between model variables and radar data observations, while its Jacobian can propagate radar-derived information to other atmospheric variables in the Data Assimilation system, potentially leading to improved representation of storm dynamics and enhancing convective storm forecasting capabilities.

How to cite: Stefanelli, M., Zaplotnik, Z., and Skok, G.: A neural network-based observation operator for weather radar data assimilation, EMS Annual Meeting 2025, Ljubljana, Slovenia, 7–12 Sep 2025, EMS2025-28, https://doi.org/10.5194/ems2025-28, 2025.

Show EMS2025-28 recording (10min) recording
14:15–14:30
|
EMS2025-109
|
Onsite presentation
Boštjan Melinc, Uroš Perkan, and Žiga Zaplotnik

Data assimilation of atmospheric observations traditionally relies on variational and Kalman filter methods. Several ideas of merging data assimilation with machine learning have emerged in recent years. One possibility would be to perform variational data assimilation in the latent space of an autoencoder (AE). In our approach, we define and minimise the three-dimensional variational (3D-Var) data assimilation cost function there to determine the analysis that optimally fuses simulated observations and the encoded 24-hour neural network forecast (background), accounting for their errors. Similar to several previous studies, the latent space has Gaussian properties, which are favourable for variational data assimilation, and the climatology-based background-error covariance (B) matrix measured and represented in the latent space turns out to be quasi-diagonal, which, together with the cost function manipulation, leads to a vast computation process speed-up.  

The main focus of this presentation is to study the physical feasibility of the analysis increments that we obtain using a multilevel multivariate AE and the U-net type neural network forward model (NNfwd) in this kind of framework. The analysis increments after observing geopotential in the midlatitudes give a multivariate geostrophic pattern, obey the thermal wind balance, and resemble some distinctive local features, such as orography and land-sea distribution. Also, the magnitude of the analysis increment at the observation location and its standard deviation closely match their theoretical values, and the impact of the observation reasonably fades with distance in both horizontal and vertical directions. On the other hand, assimilating increased moisture in the tropics leads to increments that align with the physics of the emerging tropical convective cloud systems despite the clouds and precipitation not being explicitly forecasted by the AE and NNfwd. Also, the increments after assimilating the observations in the tropics seem to distinguish between the local horizontal length scales. Finally, we study the plausibility of the model response after running the forecast with NNfwd using the analysis as the initial condition. 

How to cite: Melinc, B., Perkan, U., and Zaplotnik, Ž.: Exploring a neural network-based background-error covariance model , EMS Annual Meeting 2025, Ljubljana, Slovenia, 7–12 Sep 2025, EMS2025-109, https://doi.org/10.5194/ems2025-109, 2025.

Show EMS2025-109 recording (12min) recording
14:30–14:45
|
EMS2025-691
|
Onsite presentation
Ziga Zaplotnik, Josef Schroettle, Jorge Bandeiras, Benoit Vanniere, Emiliano Orlandi, Michael Maier-Gerber, Elias Holm, and Massimo Bonavita

Enhancing the spatial resolution of global analyses and forecasts enables weather prediction systems to more accurately capture rapidly evolving mesoscale and convective-scale atmospheric phenomena. The Integrated Forecast System (IFS) of ECMWF consists of an Earth-system model coupled with an incremental 4D-Var data assimilation (DA) system, which has recently been experimentally upgraded to higher spatial resolution.

This upgrade involved higher resolution (4.4 km) 4D-Var trajectories and higher resolution (20 km) 4D-Var minimizations using the tangent linear model (TLM) and its adjoint (ADM). This configuration requires less observation thinning and resulted in greater use of high-resolution observations, such as those from multispectral imagers aboard Himawari and Meteosat satellites. Shorter time steps in TLM and ADM further enabled the use of shorter observation time slots (400 seconds), allowing for more accurate comparison between observations and their model equivalents.

We demonstrate that this setup leads to significant improvements in the accuracy of initial conditions and enhances tropospheric forecast skill, extending medium-range predictability by 6 to 12 hours. We argue that the improved forecast skill can be attributed to a better representation of large-scale features in the stratosphere. Higher resolution DA and forecasts also lead to improved forecasting of extreme precipitation events, as well as the track and intensity of tropical cyclones. Furthermore, the improved initial conditions significantly increase the predictability machine-learning-based forecasts.

The benefits of the higher resolution DA system are demonstrated using the case of Tropical Cyclone Otis, which made landfall as a Category 5 cyclone, but was predicted by most global and regional NWP models to peak only at tropical storm intensity – a major forecasting bust. By employing higher resolution 4D-Var DA, we were able to replicate the observed rapid intensification of TC Otis. 

How to cite: Zaplotnik, Z., Schroettle, J., Bandeiras, J., Vanniere, B., Orlandi, E., Maier-Gerber, M., Holm, E., and Bonavita, M.: Towards higher resolution data assimilation in ECMWF IFS, EMS Annual Meeting 2025, Ljubljana, Slovenia, 7–12 Sep 2025, EMS2025-691, https://doi.org/10.5194/ems2025-691, 2025.

Show EMS2025-691 recording (13min) recording
14:45–15:00
|
EMS2025-520
|
Onsite presentation
Stefano Federico, Eugenio Realini, Rosa Claudia Torcasio, Claudio Transerici, Giovanna Venuti, Xiangyang Song, and Mattia Crespi

The ICREN and NEW-ARGENT projects aim at studying the impact of GNSS data assimilation on the precipitation forecast over Italy. Specifically, the assimilation of GNSS-ZTD together with lightning was studied in the ICREN project, while NEW-ARGENT considered the assimilation of GNSS along slant paths. 

The approach used in both projects is the VSF (Very Short-term Forecast) in which a 6h data assimilation phase is followed by a 6h forecast. In addition, both projects used the WRF model and the 3DVar data assimilation system of CNR-ISAC (Federico, 2013; Torcasio et al., 2024). 

The ICREN project focuses over northern Italy, where intense convective events occur since late spring to early fall. More than 100 case studies were considered and GNSS-ZTD was assimilated with and without lightning data. Different strategy of simulations were considered and results show the important role of the VSF forecast and data assimilation in predicting convective rainfall. Specifically, the assimilation of lightning was very useful for the first hours of forecast improving substantially the prediction of intense precipitation (higher than 30 mm/3h). In any case, assimilating lightning was useful also at lower precipitation thresholds.  The GNSS-ZTD data assimilation had a lower impact on the precipitation forecast, nevertheless it improved the forecast for all precipitation thresholds. However, the combination of both data had the largest impact on the precipitation forecast, especially for the forecast period from 1h to 4h after the ending of the assimilation phase. Finally, the ICREN project showed a significant impact of the GNSS-ZTD and lightning data assimilation up to 6h forecast.

The NEW-ARGENT project considered four months (May, June and October 2023, and September 2022) and focused on the assimilation of GNSS along slant paths for three regions: Lazio, Lombardy and Sicily. This problem was firstly solved assimilating GNSS gradient that partially recover the anisotropy of GNSS observations. Specifically, we compared the precipitation forecast at the short-range in four different experiments set-up: CTRL (control), without GNSS data assimilation, GNSS-ZTD, with the assimilation of GNSS zenith delay, GNSS-GRA, in which the gradients are assimilated, and GNSS-ZTD-GRA, in which both the gradients and the zenith total delay are assimilated.Results show that the assimilation of the gradients, both alone and with the GNSS-ZTD, is beneficial for the improvement of precipitation forecast of convective events over Italy.

 

References

Federico, S.: doi:10.5194/amt-6-3563-2013, 2013.

Torcasio, R.C.; et al.: https://doi.org/10.3390/rs16101769, 2023.

 

Acknowledgments 

This work has been realized in the projects PRIN-PNRR NEW-ARGENT (MUR contract- P20228LMA2) and ICREN-PRIN project (MUR- CUP: D53D23004770006). 

How to cite: Federico, S., Realini, E., Torcasio, R. C., Transerici, C., Venuti, G., Song, X., and Crespi, M.: Assimilation of GNSS data over Italy: the results of ICREN and NEW-ARGENT projects, EMS Annual Meeting 2025, Ljubljana, Slovenia, 7–12 Sep 2025, EMS2025-520, https://doi.org/10.5194/ems2025-520, 2025.

Show EMS2025-520 recording (14min) recording
15:00–15:15
|
EMS2025-302
|
Onsite presentation
Zahra Parsakhoo, Chiara Marsigli, Christoph Gebhardt, Axel Seifert, Daniel Reinert, Thorsten Steinert, and Jan Keller

Within the context of the “Global-to-Regional ICON digital twin” (GLORI) project, a convection-permitting ensemble forecasting is established in order to study the predictability of high-impact weather events with high-resolution modeling (up to 500 m) and the influence of the land-surface—atmosphere coupling mechanisms.

At the DWD, ICON-D2-EPS is the limited-area high-resolution component of the ICON modeling system, running as an ensemble of 20 members at 2 km horizontal resolution over Germany and surrounding areas. The perturbed initial conditions are provided by the km-scale ensemble data assimilation system KENDA, run at the same resolution, assimilating a wide range of observations, including radar-derived radar volumes. Boundary conditions are provided by ICON-EPS, the global ensemble with a refinement at 13 km over Europe and is refreshed every 3 hours.

In this work, we employ a nested domain with horizontal resolution of 1-km in the southern region of the ICON-D2, encompassing the Alps mountains. We run a 24-hour forecast simulation starting at 00UTC on the 21st of June 2022 and the 20th of June 2024, with 20 ensemble members. The choice of the date are crucial as they correspond to a day when the DWD recorded instances of heavy rain and hail in southern Germany.

In our study, we perform all experiments using a two-moment microphysics scheme. Additionally, we incorporate the standard operational model perturbations and subsequently analyze the influence of various convection schemes on the predictability of processes that lead to convection development. Specifically, we examine the behavior of the convection scheme in two configurations: shallow convection only and deep convection parameterization in the so-called gray-zone-tuning version. By selectively enabling and disabling these schemes, our goal is to evaluate their individual contributions to predictability.

Following this, we implement a tailored variant of the stochastically perturbed parameterization scheme (SPP) in ICON in order to delve into the influence of some uncertain parameters within either microphysics, convection or turbulent parameterization, further advancing our understanding of its effects on the model performance.

How to cite: Parsakhoo, Z., Marsigli, C., Gebhardt, C., Seifert, A., Reinert, D., Steinert, T., and Keller, J.: Studies of Convection-Permitting Ensemble Forecasting for ICON-D2 with a 1km Nest over the Alps, EMS Annual Meeting 2025, Ljubljana, Slovenia, 7–12 Sep 2025, EMS2025-302, https://doi.org/10.5194/ems2025-302, 2025.

Show EMS2025-302 recording (13min) recording
15:15–15:30
|
EMS2025-47
|
Onsite presentation
Jian Tang, Yuejian Zhu, and Kan Dai

In recent years, data-driven weather models have advanced rapidly, leveraging deep learning and vast reanalysis datasets to generate high-resolution forecasts with remarkable speed. Models such as Pangu-Weather, GraphCast and FourCastNet have demonstrated competitive or even superior accuracy compared to traditional numerical weather prediction (NWP) models. These AI-based models eliminate the need for explicit physical equations, relying instead on learned patterns from historical data. However, challenges remain, including limited physical interpretability, difficulty in handling extreme events, and the need for robust uncertainty quantification. Despite these challenges, the rapid progress of data-driven weather models suggests they will play an increasingly important role in future forecasting systems, potentially complementing or even transforming traditional NWP methods.

Pangu-Weather comprises four AI-driven models, each optimized for different forecast lead times (1-hour, 3-hour, 6-hour, and 24-hour). These models exhibit distinct error growth rates and capture different weather phenomena, ranging from rapidly evolving atmospheric features to large-scale global weather patterns. To leverage these characteristics, this study explores a stochastic combination approach for generating both initial perturbations and model perturbations:

  • Initial perturbations are created by randomly combining differences between the 48-hour and 24-hour forecasts from the 1-hour, 3-hour, 6-hour and 24-hour models to form 15 pairs (30 members).
  • Model perturbations are introduced through the stochastic combination of forecasts from the 3-hour, 6-hour and 24-hour models from 0-120 hours.

This multi-model ensemble strategy with 30 members enhances the representation of forecast uncertainties for extratropical areas of synoptic scale phenomena by incorporating the strengths of each model while mitigating their individual weaknesses. As a result, it provides a more robust and probabilistic forecasting framework with limited costs, improving forecast skills for extended range of atmospheric prediction. However, the data driven model is biased which could reduce forecast reliability; and it is still challenged for tropical uncertainty representation due to limited capability to assimilate small scale error growth.

How to cite: Tang, J., Zhu, Y., and Dai, K.: Fully Data Driven “Multi-Model” Ensemble, EMS Annual Meeting 2025, Ljubljana, Slovenia, 7–12 Sep 2025, EMS2025-47, https://doi.org/10.5194/ems2025-47, 2025.

Show EMS2025-47 recording (13min) recording
15:30–15:45
|
EMS2025-510
|
Online presentation
Nigel Roberts, Timothy Hewson, and Anca Brookshaw

Modern weather forecasting is moving towards ensemble prediction by default alongside the increased use of Machine Learning (ML) models and ensembles over a range of resolutions. From the user perspective, this will mean having to deal with a very large number of forecasts from different sources, often with different biases. There could potentially be many hundreds of ensemble members made available, with increased time frequency. At the European Centre for Medium Range Weather Forecasts (ECMWF) there are already 152 members available if the medium-range and sub-seasonal ensembles are used together out to 15 days, and with the addition of a ML ensemble the number increases to over 200.

For downstream applications that have limited resources, the problem becomes one of picking out the most salient and representative information from all those forecasts to extract the most important forecast messages. There is likely to be an increasing need for forms of “storylines” or “representative forecasts” that provide context, enable simpler communication and can be used effectively in downstream models. Clearly, there is still a vital place for probabilistic outputs, but on their own they limit the full extent of what could be usefully obtained.

Here we present highlights from work that investigates combining ensembles, member extraction and verification, making use of the different ensembles available at ECMWF with a focus on the prediction of synoptic weather patterns and regimes over a range of scales. The findings make use of a new diagnostic called CURV that categorises the degree of cyclonic or anticyclonic curvature from Mean Sea Level Pressure or Geopotential Height fields.

How to cite: Roberts, N., Hewson, T., and Brookshaw, A.: Obtaining salient information from many forecasts, EMS Annual Meeting 2025, Ljubljana, Slovenia, 7–12 Sep 2025, EMS2025-510, https://doi.org/10.5194/ems2025-510, 2025.

Show EMS2025-510 recording (14min) recording
15:45–16:00

Posters: Tue, 9 Sep, 16:00–17:15 | Grand Hall

Display time: Mon, 8 Sep, 08:00–Tue, 9 Sep, 18:00
Chairpersons: Zahra Parsakhoo, Andrea Montani
P16
|
EMS2025-365
youngsoon jo, ji-hyun ha, and youghee lee

The initial condition is crucial for the accuracy of the Numerical Weather Prediction (NWP) model. In the data assimilation, the quality and characteristics of the observations play an important role of the NWP predictability. As the number of observations is gradually increased, many studies investigating the influence of observations on numerical weather prediction models are actively being  conducted. In addition, the research on tools for evaluating the impact of observations on numerical weather forecasting is also emerging.
Currently, several methods to carry out the examination of the impact of observational data used in the data assimilation: OSEs (Observing System Experiences), FSO (Forecast Sensitivity to Observations), and EFSO (Ensemble Forecast Sensitivity to Observations) using an ensemble predictions.
The Korean Meteorological Administration (KMA) has been operating the KIM (Korean Integrated Model)-Global model since April 2020, which adopts a Hybrid 4DEnVar. In the year of 2023, we developed an EFSO system using an ensemble forecast field and tested on the analysis the observations sensitivity of the KIM-Global model. When the EFSO applied for one month in July 2022 based on the low-resolution global analysis system of the KIM-Global model, the influence of radiosonde data and GNSS-RO (Global Navigation Satellite Systems Radio Occultation) data were 24.3% and 38.5%, respectively, which were the most influential observation data. 
Meanwhile KMA has been conducting an IOP (Intensive Observation Program) since 2020, to monitor and improve predictability of severe weather phenomena that occur in the West Sea and Gyeonggi Bay in summer and affect the Seoul metropolitan area. 
In this study we have investigated impact of radiosonde and dropsonde data observed in the IOP on the KIM-Global model’s forecast by using the EFSO. This results will be discussed in the presentation.

How to cite: jo, Y., ha, J., and lee, Y.: Ensemble based Forecast Sensitivity Observation Impact in the Hybrid 4DEnsemble Variational Data Assimilation System of the KIM-Global model, EMS Annual Meeting 2025, Ljubljana, Slovenia, 7–12 Sep 2025, EMS2025-365, https://doi.org/10.5194/ems2025-365, 2025.

P17
|
EMS2025-674
Haseeb Ur Rehman

Compared to alluvial floods, flash or pluvial floods are difficult to predict because they result from intense and brief periods of extreme precipitation. Luxembourg has a history of being impacted by floods, with notable occurrences in January 2011, May 2016, December 2017, January 2018, February 2019, and February 2020. However, July 2021 stands out as the most severe flood year on record in the region. Study evaluates the Weather Research and Forecasting (WRF) model, with and without WRFDA 3D-VAR data assimilation, for precipitation and temperature forecasts in Luxembourg and the Greater Region during June–July 2021, a period marked by severe flooding. Conventional meteorological observations and GNSS Zenith Total Delay (ZTD) data were assimilated into WRF, using Global Forecast System (GFS) data for initial conditions. Precipitation forecasts were validated against NASA’s GPM IMERG data and against regional station measurements. Results demonstrate that data assimilation enhances the WRF model’s ability to replicate the spatial distribution and intensity of precipitation, with visual comparisons (e.g., July 14, 2021) showing improved alignment with satellite observations post-assimilation. Quantitatively, data assimilation reduces bias in precipitation and temperature forecasts at most stations, with mean absolute error (MAE) and symmetric mean absolute percentage error (SMAPE) often improving, though root mean square error (RMSE) exhibits mixed outcomes. These findings underscore the potential of integrating GNSS‑ZTD and conventional observations into high‑resolution numerical weather prediction to enhance flash flood forecasts in greater region.

Keywords: NWP; WRF; Flash flood; Weather forecast; High‑Resolution; GNSS; ZTD; Data assimilation; 3D‑VAR; GPM IMERG; Precipitation forecasting; Temperature forecasting; Convective processes; Flood modeling; Regional NWP; Global Forecast System; Luxembourg; Greater Region

How to cite: Rehman, H. U.: Evaluating the Performance of Numerical Weather Prediction Models for Precipitation and Temperature in Luxembourg and Greater Region: Insights from WRF and WRFDA 3D-Var, EMS Annual Meeting 2025, Ljubljana, Slovenia, 7–12 Sep 2025, EMS2025-674, https://doi.org/10.5194/ems2025-674, 2025.

P18
|
EMS2025-234
Jens Pruschke, Annika Schomburg, Jana Mendrok, Klaus Stephan, Ulrich Görsdorf, Moritz Löffler, and Christine Knist

To improve the forecast quality of numerical weather prediction (NWP), the German Meteorological Service (Deutscher Wetterdienst, DWD) has initiated a project aimed at assessing data quality and assimilation of observations from ground-based remote sensing instruments that have not yet been exploited operationally.

The objective of this initiative is to fill the observational gap in the atmospheric boundary layer, especially with respect to short time scales, by providing continuous, high-temporal-resolution profiles of thermodynamic variables, wind, and cloud properties. These observations are expected to be especially beneficial for nowcasting and (short-term) forecasting applications. The DWD is evaluating various remote sensing systems with regard to the continuous data supply, their operational use and their impact on NWP.  

In this contribution, we present the integration and assimilation of two such data sources into the kilometer-scale ensemble data assimilation system (KENDA): radar reflectivity from a cloud radar and water vapour mixing ratio from a Differential Absorption Lidar (DIAL). The complex forward operator EMVORADO (Efficient Modular Volume scan Radar Operator), originally developed and previously used only for precipitation radars, has been adapted for cloud radar data. In contrast, the DIAL observations do not require a complex forward operator, and only minor adjustments have been made to the data assimilation code environment of the DWD.

Observation minus first guess statistics, as well as first single observation data assimilation experiments have been shown to produce promising results. To assess the overall impact, dedicated data assimilation experiments were conducted and compared to reference experiments without these additional observations. First results indicate a positive impact of the DIAL data on first guess humidity and temperature fields in the analysis cycle, while the impact of cloud radar data appears neutral at this stage. These findings suggest that these ground-based remote sensing data can provide valuable additional information for convective-scale data assimilation and form a sound basis for further impact studies in the context of NWP.

How to cite: Pruschke, J., Schomburg, A., Mendrok, J., Stephan, K., Görsdorf, U., Löffler, M., and Knist, C.: First Steps Towards Data Assimilation of Cloud Radar and Differential Absorption Lidar Data, EMS Annual Meeting 2025, Ljubljana, Slovenia, 7–12 Sep 2025, EMS2025-234, https://doi.org/10.5194/ems2025-234, 2025.

P19
|
EMS2025-376
Seo-ha Park and Hyo-jong Song

Numerical weather prediction models inevitably produce forecast errors due to structural errors from discretization and limitations in physical parameterization processes. Accurately representing and incorporating these forecast errors is a key element for improving data assimilation performance. The background error covariance is operated in a hybrid form, combining a static climatological component with a flow-dependent ensemble component. The appropriate combination of these two components, and how well they reproduce the actual error characteristics, serves as a critical prerequisite for enhancing forecast accuracy.

This study aims to improve the hybrid data assimilation system of the Korea Integrated Model KIM by advancing ensemble-based background error diagnostics and enhancing its application to data assimilation performance. To this end, the study seeks to overcome the limitations of static climatological background error covariance and to develop diagnostic and adjustment techniques that can effectively reflect the spatiotemporal variability and uncertainty of forecast errors. In particular, ensemble error diagnostic information will be utilized to analyze error structure characteristics, adjust weighting factors by region and altitude, and conduct observation sensitivity analysis, thereby enabling adaptive optimization of the data assimilation system. Ultimately, this study aims to establish a practical technological foundation that can improve the efficiency of observation utilization in operational environments and contribute to enhancing the accuracy of short-and medium-range forecasts.

The forecast error characteristics and ensemble spread of the Korea Integrated Model KIM and the ECMWF operational model IFS were analyzed through intercomparison experiments. The root-mean-square error RMSE and ensemble spread were calculated based on the differences between the forecast fields and observations, and these results were used to diagnose the background error covariance (B).
 The background error covariance was constructed by separately calculating the static component (Static B) based on climatological statistics and the flow-dependent component (Ensemble B) derived from ensemble forecasts. These two components were combined using optimal weighting factors to generate the hybrid background error covariance Hybrid B. In this study, a series of experiments were conducted using the KIM-based hybrid data assimilation system, including sensitivity tests on the weighting factors, error diagnostics.

 

Key Word : Hybrid data assimilation, Background error covariance, Ensemble spread

How to cite: Park, S. and Song, H.: Ensemble error diagnosticsof hybrid data assimilation in the Korea integrated model, EMS Annual Meeting 2025, Ljubljana, Slovenia, 7–12 Sep 2025, EMS2025-376, https://doi.org/10.5194/ems2025-376, 2025.