In this paper we introduce new similarity indeces for variables with multiple categories. The proposed measures are conceptually simple and straightforward to compute. In contrast to traditionally used similarity indeces, they also consider the frequency of the modalities of each attribute in the sample. This feature is useful when dealing with rare categories, since it makes sense to differently evaluate the pairwise presence of a rare category from the pairwise presence of a widespread one. Moreover, this feature helps finding under-represented groups in cluster analysis. There are two versions of the weighted index: one for independent categorical variables and one for dependent variables. The suitability of the proposed indeces is shown in this paper using both simulated and real world data sets.

A New Class of Weighted Similarity Indices Using Polytomous Variables / Morlini, Isabella; Zani, Sergio. - In: JOURNAL OF CLASSIFICATION. - ISSN 0176-4268. - 29, n. 2:(2012), pp. 199-226.

A New Class of Weighted Similarity Indices Using Polytomous Variables

MORLINI, Isabella;ZANI, Sergio
2012-01-01

Abstract

In this paper we introduce new similarity indeces for variables with multiple categories. The proposed measures are conceptually simple and straightforward to compute. In contrast to traditionally used similarity indeces, they also consider the frequency of the modalities of each attribute in the sample. This feature is useful when dealing with rare categories, since it makes sense to differently evaluate the pairwise presence of a rare category from the pairwise presence of a widespread one. Moreover, this feature helps finding under-represented groups in cluster analysis. There are two versions of the weighted index: one for independent categorical variables and one for dependent variables. The suitability of the proposed indeces is shown in this paper using both simulated and real world data sets.
2012
A New Class of Weighted Similarity Indices Using Polytomous Variables / Morlini, Isabella; Zani, Sergio. - In: JOURNAL OF CLASSIFICATION. - ISSN 0176-4268. - 29, n. 2:(2012), pp. 199-226.
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11381/2434843
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact