Many indexes heve been introduced in the literature in order to compare two partitions determined by different clustering procedures applied to the same set of individuals (clustering procedures are considered different either when they are based on different algorithms, or when they use different data collected on the same set of individuals). Most of them are based on the number of pairs placed in the same or in different groups. Perhaps the most popular one is the Rand index. In this paper, a generalisation of some of these indexes is proposed in order to take into account the estimates of membership probabilities provided by model-based classifications. Correction for chance will also be discussed.
A class of indexes for comparing model-based clustering solutions
PASTORE, Andrea;TONELLATO, Stefano Federico
2009-01-01
Abstract
Many indexes heve been introduced in the literature in order to compare two partitions determined by different clustering procedures applied to the same set of individuals (clustering procedures are considered different either when they are based on different algorithms, or when they use different data collected on the same set of individuals). Most of them are based on the number of pairs placed in the same or in different groups. Perhaps the most popular one is the Rand index. In this paper, a generalisation of some of these indexes is proposed in order to take into account the estimates of membership probabilities provided by model-based classifications. Correction for chance will also be discussed.File | Dimensione | Formato | |
---|---|---|---|
cladag09.pdf
non disponibili
Tipologia:
Documento in Post-print
Licenza:
Accesso chiuso-personale
Dimensione
2.3 MB
Formato
Adobe PDF
|
2.3 MB | Adobe PDF | Visualizza/Apri |
I documenti in ARCA sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.