Toward trustworthy AI in systematic reviews: a statistically validated AI-augmented framework for analysing knowledge transfer strategies in urban water management

Chen Wang; Gerald Corzo; Clarine Van Oel; Chris Zevenbergen

doi:https://doi.org/10.5194/egusphere-egu26-20948

[Back] [Session ESSI1.2]

EGU26-20948, updated on 14 Mar 2026

https://doi.org/10.5194/egusphere-egu26-20948

EGU General Assembly 2026

© Author(s) 2026. This work is distributed under
the Creative Commons Attribution 4.0 License.

Toward trustworthy AI in systematic reviews: a statistically validated AI-augmented framework for analysing knowledge transfer strategies in urban water management

Chen Wang¹, Gerald Corzo¹, Clarine Van Oel², and Chris Zevenbergen¹

Chen Wang et al.

¹IHE Delft Institute for Water Education, Coastal and Urban Risk & Resilience, Netherlands
²Delft University of Technology, Faculty of Architecture and the Built Environment, Netherlands

The exponential expansion of academic literature across complex environmental domains has created a gap where the volume of research outpaces human capacity for effective integration. While Large Language Models (LLMs) offer a transformative solution to bridge this gap, their deployment in rigorous scientific inquiry is frequently compromised by model stochasticity, potential for hallucination, and the opacity of automated reasoning. Addressing the critical imperative for dependable and reproducible AI, this study presents a robust workflow designed to ensure methodological rigor and evidential integrity in the rapid and reliable synthesis of large-scale scientific literature.

We operationalised this framework within the domain of urban water management, specifically to analyse complex Knowledge Transfer (KT) strategies from a corpus of over 1,500 unstructured articles. To mitigate the risks inherent in generative AI, we developed a multi-layered validation protocol. First, we deployed an AI-assisted screening mechanism to filter the initial corpus down to 115 highly relevant articles, ensuring data relevance. Second, we implemented a Human-in-the-Loop design to iteratively synthesise a comprehensive analytical framework. By refining LLM-generated insights against domain expertise, we consolidated 24 operational attributes that specifically characterise the operational mechanisms of learning strategies from the corpus, preventing ungrounded inference while capturing emerging learning dynamics. Third, we addressed model variability through iterative Multi-LLM Triangulation (utilising Gemini, ChatGPT, and Deepseek). By repeatedly coding the 115 articles with the framework, we quantified qualitative insights to analyse how distinct learning strategies manifest their operational mechanisms. Finally, we employed Multiple Correspondence Analysis (MCA) and Hierarchical Clustering (HAC) to analyse the quantified results, categorising the eight identified learning strategies into three distinct clusters based on their functions and usage contexts, thereby effectively harnessing the LLM-generated insights.

Beyond this specific application, this research contributes a methodological blueprint for responsible AI integration in scientific inquiry. It demonstrates that combining theory-driven constraints with statistical verification is essential to elevate LLM-generated insights to the standard of reproducible scientific evidence.

How to cite: Wang, C., Corzo, G., Van Oel, C., and Zevenbergen, C.: Toward trustworthy AI in systematic reviews: a statistically validated AI-augmented framework for analysing knowledge transfer strategies in urban water management, EGU General Assembly 2026, Vienna, Austria, 3–8 May 2026, EGU26-20948, https://doi.org/10.5194/egusphere-egu26-20948, 2026.