An a contrario approach to hierarchical clustering validity assessment - INRIA - Institut National de Recherche en Informatique et en Automatique Accéder directement au contenu
Rapport (Rapport De Recherche) Année : 2004

An a contrario approach to hierarchical clustering validity assessment

Résumé

In this paper we present a method to detect natural groups in a data set, based on hierarchical clustering. A measure of the meaningfulness of clusters, derived from a background model assuming no class structure in the data, provides a way to compare clusters, and leads to a cluster validity criterion. This criterion is applied to every cluster in the nested structure. While all clusters passing the validity test are meaningful in themselves, the set of all of them will probably provide a redundant data representation. By selecting a subset of the meaningful clusters, a good data representation, which also discards outliers, can be achieved. The strategy we propose combines a new merging criterion (also derived from the background model) with a selection of local maxima of the meaningfulness with respect to inclusion, in the nested hierarchical structure.
Fichier principal
Vignette du fichier
RR-5318.pdf (667.28 Ko) Télécharger le fichier
Loading...

Dates et versions

inria-00070682 , version 1 (19-05-2006)

Identifiants

  • HAL Id : inria-00070682 , version 1

Citer

Frédéric Cao, Julie Delon, Agnès Desolneux, Pablo Musé, Frédéric Sur. An a contrario approach to hierarchical clustering validity assessment. [Research Report] RR-5318, INRIA. 2004, pp.15. ⟨inria-00070682⟩
152 Consultations
313 Téléchargements

Partager

Gmail Facebook X LinkedIn More