EMS Annual Meeting Abstracts
Vol. 20, EMS2023-501, 2023, updated on 06 Jul 2023
EMS Annual Meeting 2023
© Author(s) 2023. This work is distributed under
the Creative Commons Attribution 4.0 License.

Data-driven s2s forecasting with s2spy & lilio

Yang Liu1, Bart Schilperoort1, Peter Kalverla1, Sem Vijverberg2, Jannes van Ingen2, Sarah Alidoost1, Stefen Verhoeven1, and Dim Coumou2
Yang Liu et al.
  • 1Netherlands eScience Center, Environment and Sustainability, Amsterdam, Netherlands (y.liu@esciencecenter.nl)
  • 2Water and Climate Risk, Faculty of Science, Vrije Universiteit Amsterdam (sem.vijverberg@vu.nl )

Reliable S2S forecasts remain a huge scientific challenge. Only for specific ‘windows of predictability’, skillful forecasts are possible, in an otherwise largely unpredictable future. Due to a number of successes in S2S forecasting, the interest in machine learning (ML) is growing fast. However, we argue there is a need for more standardization, consensus on best practices, higher efficiency, and higher reproducibility. Typical S2S ML use-cases, such as (1) pure statistical forecasting based on observations, (2) transfer learning, and (3) post-processing of dynamical model ensembles, require a large coding and preprocessing effort. Such experiments are not trivial to set up, and without sufficient experience and expertise there is a large risk of improper cross-validation and/or improper and non-standard verification. 

Driven by the need for a reliable tool to integrate expert knowledge and artificial intelligence, we are developing two python packages to tackle the scientific challenge of (sub) seasonal (S2S) forecasting. s2spy is a high level python package designed to handle and optimize the entire data-driven forecasting workflow. Lilio is an advanced calendar system for resampling timeseries into training and target data for machine learning. Our aim is to make ML workflows more transparent and easier to build, and to facilitate standardization and collaboration across the S2S community. This also contributes to higher reproducibility and works towards a wider acceptance of standards and best practices. We will present our vision and the capabilities of our package, show-casing that we can build a model from raw climate data up to verification in only a few lines of code. 

How to cite: Liu, Y., Schilperoort, B., Kalverla, P., Vijverberg, S., van Ingen, J., Alidoost, S., Verhoeven, S., and Coumou, D.: Data-driven s2s forecasting with s2spy & lilio, EMS Annual Meeting 2023, Bratislava, Slovakia, 4–8 Sep 2023, EMS2023-501, https://doi.org/10.5194/ems2023-501, 2023.