EGU25-6762, updated on 14 Mar 2025
https://doi.org/10.5194/egusphere-egu25-6762
EGU General Assembly 2025
© Author(s) 2025. This work is distributed under
the Creative Commons Attribution 4.0 License.
PICO | Thursday, 01 May, 08:53–08:55 (CEST)
 
PICO spot 2, PICO2.10
Automated disaster event extraction to understand lessons learned: A large-scale text analysis on the scientific literature of floods, droughts, and landslides. 
Lina Stein1, Birgit M. Pfitzmann2, S. Karthik Mukkavilli3, Ugur Ozturk1,5, Peter W. J. Staar4, Cesar Berrospi4, Thomas Brunschwiler4, and Thorsten Wagener1
Lina Stein et al.
  • 1University of Potsdam, Institute for Environmental Science and Geography, Faculty of Science, Potsdam, Germany (lina.stein@uni-potsdam.de)
  • 2Smart City & ERZ Zurich, Zurich, Switzerland
  • 3Mercuria, Geneva, Switzerland
  • 4IBM Research - Europe, Zurich, Switzerland
  • 5Helmholtz Centre Potsdam - GFZ German Research Centre for Geosciences

A natural hazard event that highly impacted a society might trigger a wave of post-disaster research analysis, which looks into the cause of the disaster, the types of impact, or any lessons learned to prevent similar events in the future. In short, post-disaster research contains valuable knowledge that should be utilized in disaster risk management. However, in the past 70 years, the scientific community published around 600,000 articles on hydro-hazards, such as floods, droughts, and landslides. Finding articles that describe specific disaster events and synthesizing their knowledge is not humanly possible anymore due to near exponentially increasing numbers of publications. However, recent advancements in large language models allow the analysis and extraction of described disaster events in the scientific literature.

Here we make use of the Wealth over Woe scientific abstract dataset (Stein et al. 2024), with abstracts that were automatically annotated for hydro-hazards and geolocation.  It allows us to track publication trends and to identify disaster events that triggered a wave of new research. We additionally make use of the large language model Llama 70B to extract specific hazard events mentioned in each abstract (e.g. 2003 summer drought in Europe, Pakistan flood in 2010, 2002 Elbe flood, etc.) as well as other described details surrounding the event.

While we know that hydro-hazard research is biased against low-income countries, exceptional disaster events can shift research priorities for several years. The additional funding can support valuable local post-disaster research. The named event recognition can therefore help us answer questions such as: What kind of hydro-hazards are studied in detail and where? What are the key research foci for post-disaster analysis? And are there regional differences to these answers?

How to cite: Stein, L., Pfitzmann, B. M., Mukkavilli, S. K., Ozturk, U., Staar, P. W. J., Berrospi, C., Brunschwiler, T., and Wagener, T.: Automated disaster event extraction to understand lessons learned: A large-scale text analysis on the scientific literature of floods, droughts, and landslides. , EGU General Assembly 2025, Vienna, Austria, 27 Apr–2 May 2025, EGU25-6762, https://doi.org/10.5194/egusphere-egu25-6762, 2025.