EGU26-11928, updated on 14 Mar 2026
https://doi.org/10.5194/egusphere-egu26-11928
EGU General Assembly 2026
© Author(s) 2026. This work is distributed under
the Creative Commons Attribution 4.0 License.
Poster | Thursday, 07 May, 10:45–12:30 (CEST), Display time Thursday, 07 May, 08:30–12:30
 
Hall A, A.42
A Disjunctive Interpretation Approach to Missing Data Based on Clustering Quality
hamza khyari1 and Salem Benferhat2
hamza khyari and Salem Benferhat
  • 1USMBA, FTSF, Computer Sciecne, Morocco (hamza.khyari@usmba.ac.ma)
  • 2CRIL, CNRS UMR 8188, Université d'Artois, France

Data completion is a major challenge in many applications, particularly in Geographic Information Systems (GIS) for water networks. Numerous approaches have been proposed to address this problem, ranging from classical statistical methods to artificial intelligence-based techniques.

In this presentation, we address the problem of missing or imprecise data in water network GIS by proposing a clustering-based data completion approach. For a given attribute with missing or uncertain values, each possible value in the attribute domain is considered as a candidate for completion. Each candidate is evaluated by analyzing its impact on the clustering of the entire dataset: inserting a candidate value induces a specific global clustering, whose quality is assessed using appropriate clustering validity criteria. The value that yields the highest-quality clustering, namely the one that best captures the intrinsic structure of the data, is selected as the final completion value.

To cope with the combinatorial explosion resulting from multiple attributes with missing values and large domains, several strategies are employed to reduce the number of candidate completions, including aggregation mechanisms, while maintaining both the effectiveness and efficiency of the proposed approach.

How to cite: khyari, H. and Benferhat, S.: A Disjunctive Interpretation Approach to Missing Data Based on Clustering Quality, EGU General Assembly 2026, Vienna, Austria, 3–8 May 2026, EGU26-11928, https://doi.org/10.5194/egusphere-egu26-11928, 2026.