EGU26-18188, updated on 14 Mar 2026
https://doi.org/10.5194/egusphere-egu26-18188
EGU General Assembly 2026
© Author(s) 2026. This work is distributed under
the Creative Commons Attribution 4.0 License.
Oral | Monday, 04 May, 14:45–14:55 (CEST)
 
Room -2.92
Probabilistic Solar Flare Forecasting via Weakly Supervised Contrastive Refinement of VAE Latent Spaces
Ekatarina Dineva, Jasmina Magdalenic, George Miloshevich, Panagiotis Gonidakis, Francesco Carella, and Stefaan Poedts
Ekatarina Dineva et al.
  • KU Leuven, Centre for mathematical Plasma-Astrophysics, Leuven, Belgium (ekaterina.dineva@kuleuven.be)

Reliable solar flare forecasting is limited by two forms of class imbalance in active region time series: (i) the overwhelming dominance of the non-flaring, quiet state over the eruptive state, and (ii) the insufficient separability between common, physically similar event classes (e.g. C-class versus M-class flares). Although empirical parameters derived from the photospheric vector magnetic field (VMF), such as those provided by SDO/HMI SHARP products, capture aspects of active region complexity and free energy buildup, they often evolve smoothly and overlap across flare classes. Consequently, while many models can distinguish between flares and no-flares reasonably well, they struggle to distinguish flare magnitude and association with eruptive phenomena (e.g. CMEs) using photospheric information alone. This suggests that improved flare-class separation requires (a) the explicit definition of what constitutes 'similarity' between pre-flare states, and (b) parametrization that emphasizes flare-relevant structure over common active region features.

We investigate a representation learning strategy that combines the parametrization of SDO/HMI SHARP VMF cutouts using a Variational Autoencoder (VAE) with a contrastive stage to reshape the resulting embedding geometry. First, a VAE is trained to encode SHARP cutouts into compact latent vectors that capture active region morphology. These vectors are then refined using a Siamese-like objective constructed from weak supervision, which uses event labels and empirical SHARP parameters as proxies for elevated flare likelihood. The contrastive stage then uses this weak supervision to encourage a latent geometry that better reflects flare-relevant evolution. This study emphasizes latent-space structure, i.e. neighborhood consistency and class-conditional clustering, and evaluates whether these properties facilitate improved probabilistic prediction across multiple forecast horizons, by training lightweight downstream models on (i) empirical parameters, (ii) VAE latents and (iii) their combined representations.

How to cite: Dineva, E., Magdalenic, J., Miloshevich, G., Gonidakis, P., Carella, F., and Poedts, S.: Probabilistic Solar Flare Forecasting via Weakly Supervised Contrastive Refinement of VAE Latent Spaces, EGU General Assembly 2026, Vienna, Austria, 3–8 May 2026, EGU26-18188, https://doi.org/10.5194/egusphere-egu26-18188, 2026.