A versatile exploration method for simulated data based on Self Organizing Maps
- 1Institut für Theoretische Physik, Fakultät für Physik und Astronomie, Ruhr-Universität Bochum, Bochum, Germany
- 2Centre for mathematical Plasma Astrophysics, Department of Mathematics, KULeuven, Leuven, Belgium
- 3Institute for the Study of Earth, Oceans and Space, University of New Hampshire, Durham, NH, USA
The large amount of data produced by measurements and simulations of space plasmas has made it fertile ground for the application of classification methods, that can support the scientist in preliminary data analysis. Among the different classification methods available, Self Organizing Maps, SOMs [Kohonen, 1982] offer the distinct advantage of producing an ordered, lower-dimensional representation of the input data that preserves their topographical relations. The 2D map obtained after training can then be explored to gather knowledge on the data it represents. The distance between nodes reflects the distance between the input data: one can then further cluster the map nodes to identify large scale regions in the data where plasma properties are expected to be similar.
In this work, we train SOMs using data from different simulations of different aspects of the heliospheric environment: a global magnetospheric simulation done with the OpenGGCM-CTIM-RCM code, a Particle In Cell simulation of plasmoid instability done with the semi-implicit code ECSIM, a fully kinetic simulation of single X point reconnection done with the Vlasov code implemented in MuPhy2.
We examine the SOM feature maps, unified distance matrix and SOM node weights to unlock information on the input data. We then classify the nodes of the different SOMs into a lower and automatically selected number of clusters, and we obtain, in all three cases, clusters that map well to our a priori knowledge on the three systems. Results for the magnetospheric simulations are described in Innocenti et al, 2021.
This classification strategy then emerges as a useful, relatively cheap and versatile technique for the analysis of simulation, and possibly observational, plasma physics data.
Innocenti, M. E., Amaya, J., Raeder, J., Dupuis, R., Ferdousi, B., & Lapenta, G. (2021). Unsupervised classification of simulated magnetospheric regions. Annales Geophysicae Discussions, 1-28.
https://angeo.copernicus.org/articles/39/861/2021/angeo-39-861-2021.pdf
How to cite: Innocenti, M. E., Köhne, S., Hornisch, S., Grauer, R., Amaya, J., Raeder, J., Ferdousi, B., Edmond, J. "., and Lapenta, G.: A versatile exploration method for simulated data based on Self Organizing Maps, EGU General Assembly 2022, Vienna, Austria, 23–27 May 2022, EGU22-12480, https://doi.org/10.5194/egusphere-egu22-12480, 2022.