EGU26-8330, updated on 14 Mar 2026
https://doi.org/10.5194/egusphere-egu26-8330
EGU General Assembly 2026
© Author(s) 2026. This work is distributed under
the Creative Commons Attribution 4.0 License.
Oral | Tuesday, 05 May, 14:45–14:55 (CEST)
 
Room B
 Determination of Spatial and Temporal Drought Patterns Over CONUS Using Unsupervised Machine Learning Clustering Algorithms 
Olivier Prat1, Iype Eldho1, David Coates1, Brian Nelson2, Michael Shaw3, and Steve Ansari4
Olivier Prat et al.
  • 1North Carolina State University (NCSU), Cooperative Institute for Satellite Earth System Studies (CISESS), Asheville, NC, USA (opprat@ncsu.edu)
  • 2NOAA/NCEI/Center for Weather and Climate (CWC), Asheville, NC, USA
  • 3ISciences, L.L.C., National Centers for Environmental Information (NCEI), Asheville, NC, USA
  • 4NOAA/NCEI/National Integrated Drought Information System (NIDIS), Asheville, NC, USA

The Standardized Precipitation Index (SPI) is computed over CONUS using daily precipitation estimates from the NOAA Daily U.S. Climate Gridded Dataset (NClimGrid-Daily). From the NClimGrid-SPI (1951-present; 0.05°x0.05°), we derive historical hydro-climatological conditions and drought information from the Drought Severity and Coverage Index (DSCI) which combines drought levels into a single areal value (from 0 to 500). One of our objective is to better understand drought dynamic and particularly how drought episodes evolve from short term rainfall deficit (i.e., less than three months) to persistent drought condition (i.e., beyond nine months). To investigate how those cascading effect work, we use a Machine Learning (ML) approach to identify spatio-temporal patterns of drought episodes over CONUS. Several unsupervised ML clustering algorithms are tested using an ensemble of features including drought duration, rainfall accumulation, drought severity (maximum DSCI, time of maximum DSCI), seasonality (drought beginning and end dates), location (latitude, longitude). Results show that the most severe drought events (i.e., DSCI > 350) are those that have the longest durations and for which drought relief is associated with higher rainfall accumulation regardless of the location considered. Furthermore, there is an apparent consistency across accumulation scales and the number of parameters selected with an optimum number of clusters around four. The Euclidian distance ML models tested seems to be able to define spatiotemporal areas of similar drought patterns. Differences between models are observed in terms of spatial definitions and predominance  of a cluster at a given location. The strongest prevalence of a given cluster has allowed to isolate areas of coherence such as the Pacific Northwest, the PNW, the Eastern Seaboard and the Southeast, and the Southwest area along the MX-US border. Domain delineations are weaker for areas such as the Rockies, the Midwest, and the Great Plains. While the SPI algorithm assumes a Gamma (McKee et al., 1993) or a Pearson III (Guttman, 1998) distribution for monthly rainfall accumulation periods, results show that this assumption might not be optimal depending on the domain considered and the accumulation period when applied to daily drought monitoring.

How to cite: Prat, O., Eldho, I., Coates, D., Nelson, B., Shaw, M., and Ansari, S.:  Determination of Spatial and Temporal Drought Patterns Over CONUS Using Unsupervised Machine Learning Clustering Algorithms , EGU General Assembly 2026, Vienna, Austria, 3–8 May 2026, EGU26-8330, https://doi.org/10.5194/egusphere-egu26-8330, 2026.