- 1School of Hydraulic Engineering, Dalian University of Technology, 116024 Dalian, Liaoning, China
- 2Institute of Photogrammetry and Remote Sensing, TUD Dresden University of Technology, 01062 Dresden, Germany
Characterized by high spatial heterogeneity and short response times, urban flood research has long been constrained by the scarcity of precise, high-spatiotemporal resolution observational data required to capture the complex dynamics. Without such data, it remains challenging to distinguish whether model simulation errors stem from uncertain parameters or fundamental structural deficiencies, obscuring the model's reliable physical representation. Consequently, the physical fidelity of urban hydrodynamic models in reproducing complex, spatiotemporal flood dynamics needs to be further validated. To address this, we constructed the first large-scale, 1-minute resolution urban surface inundation dataset derived from camera videos using a Large Multimodal Model (GPT-5). Using the observations, we audited the physical driving mechanisms of 1D-2D coupled hydrodynamic models by examining the spatial stratified heterogeneity of flood responses.
Focusing on 5 diverse rainfall-flood events recorded by 226 traffic surveillance cameras in Dalian, China, we utilized GPT-5 to automatically extract real-time waterlogging levels. The extracted data underwent manual verification by experts to strictly correct errors, resulting in a high-fidelity urban surface inundation dataset at a 1-minute resolution. Subsequently, we constructed and calibrated a 1D-2D coupled urban flood numerical model to obtain simulation results for the corresponding events. The Geodetector model was then employed to quantify and compare the spatial stratified heterogeneity of waterlogging derived from observations versus simulations, attributing them to 9 potential drivers including rainfall, topography, and drainage infrastructure.
Results indicate that GPT-5 achieved satisfactory extraction performance, with an average accuracy of 77%. Comparative Geodetector analysis of observations versus simulations revealed critical discrepancies. The factor detector showed low individual explanatory power (q<0.1) but distinct rankings, with simulations underestimating rainfall's role. The interaction detector revealed stronger observed synergy, where the dominant control shifted from the observed "rainfall-imperviousness" coupling (16.6%) to a simulated "pipe-imperviousness" one (14%). While the risk detector confirmed consistent trend patterns, it highlighted significant "peak-shaving" effects, with simulated depths averaging 20 cm lower. Finally, the ecological detector verified that these structural discrepancies are statistically significant rather than random errors.
Observations confirm that urban flood distribution is governed by the non-linear synergy of multiple factors, reflecting high system complexity. The model reveals a systematic structural defect: it erroneously shifts the dominant control from a dynamic rainfall-surface coupling to a static boundary condition. This bias causes the model to be insensitive to dynamic meteorological forcing and to underestimate severe localized inundation caused by micro-environments. Future improvements must move beyond parameter calibration to focus on enhancing sensitivity to rainfall fluctuations and micro-environmental representation.
How to cite: Zhou, S., Eltner, A., Zamboni, P., Lyu, H., and Zhang, C.: Auditing the Physical Fidelity of Urban Flood Model with Large Multimodal Model-Derived High-Resolution Observational Data, EGU General Assembly 2026, Vienna, Austria, 3–8 May 2026, EGU26-14345, https://doi.org/10.5194/egusphere-egu26-14345, 2026.