EGU24-8062, updated on 08 Mar 2024
https://doi.org/10.5194/egusphere-egu24-8062
EGU General Assembly 2024
© Author(s) 2024. This work is distributed under
the Creative Commons Attribution 4.0 License.

One-pass algorithms for streamed climate data

Katherine Grayson1, Aleks Lacima-Nadolnik1, Francesc Roura Adserias1, Ehsan Sharifi2, Stephan Thober2, and Francisco Doblas-Reyes1
Katherine Grayson et al.
  • 1Earth Sciences, Barcelona Supercomputing Center, Barcelona, Spain (katherine.grayson@bsc.es)
  • 2Department of Computational Hydrosystems, Helmholtz Centre for Environmental Research GmbH - UFZ, Leipzig, Germany (stephan.thober@ufz.de)

Projections from global climate models (GCMs) are regularly used to create information for climate adaptation policies and socio-economic decisions. As demand grows for accuracy in these projections, GCMs are being run at increasingly finer spatiotemporal resolution to better resolve physical processes and consequently reduce uncertainty associated with parametrizations (Iles et al., 2020;  Palmer, 2014). Yet this increase in resolution and the consequent size of the data output makes the current state-of-the-art archives (e.g., CORDEX, CMIP) unfeasible. Moreover, the current archival method has left some data consumers without their required data due to the limited number of variables stored and their lower frequency (e.g., monthly means). Initiatives like Destination Earth are investigating the novel method of data streaming, where user applications can be run as soon as the required data is produced by the climate models. Data streaming allows users to access the climate data at the highest frequency possible (e.g., hourly) and native resolution in near real model run-time. This provides an unprecedented time-scale reduction to access the climate data compared with the current simulation paradigm and the possibility of using variables and frequencies not previously available.

Yet the advent of data streaming in the climate community poses its own set of challenges. Often users require climate data that spans long periods. For example, many hydrological impact models require daily, monthly or annual maximum precipitation values (Teutschbein and Seibert, 2012), while in the wind energy sector, accurate distributions of the wind speed over long periods are essential (Lledo, 2019). Obtaining statistics for periods longer than the time the climate model output is accessible can no longer be done using traditional statistical algorithms. This introduces the one-pass problem; how to compute summaries, diagnostics or derived quantities that only see each data point once (i.e., pass through the data one time)?

We present here a detailed analysis on the use of one-pass algorithms to compute statistics on streamed climate data. Unlike traditional two-pass methods, one-pass algorithms do not have access to the full time series of data needed to estimate the statistic; instead, they process data incrementally every time that the model outputs new time steps. While these algorithms have been adopted in other fields such as online trading and machine learning, they have yet to find a foothold in climate science, mainly because they have not been necessary until now. Here we show how one-pass algorithms can be harnessed for use in Earth system digital twins, generating the statistics required by users with minimal loss in accuracy and bypassing unfeasible storage requirements.

Iles, C.E., Vautard, R., Strachan, J., Joussaume, S., Eggen, B.R., Hewitt, C.D., 2020. The benefits of increasing resolution in global and regional climate simulations for European climate extremes. Geoscientific Model Development 13.

Lledo, L., et al. 2019. Seasonal forecasts of wind power generation. Renewable Energy 143.

Palmer, T., 2014. Climate forecasting: Build high-resolution global climate models. Nature 515.

Teutschbein, C., Seibert, J., 2012. Bias correction of regional climate model simulations for hydrological climate-change impact studies: Review and evaluation of different methods. Journal of Hydrology 456-457.

How to cite: Grayson, K., Lacima-Nadolnik, A., Roura Adserias, F., Sharifi, E., Thober, S., and Doblas-Reyes, F.: One-pass algorithms for streamed climate data, EGU General Assembly 2024, Vienna, Austria, 14–19 Apr 2024, EGU24-8062, https://doi.org/10.5194/egusphere-egu24-8062, 2024.

Supplementary materials

Supplementary material file

Comments on the supplementary material

AC: Author Comment | CC: Community Comment | Report abuse

supplementary materials version 1 – uploaded on 11 Apr 2024, no comments

Post a comment