In this article, extensions to the recently introduced concept of pairwise overlap between mixture components are proposed. The notion of overlap is useful for studying the systematic performance of clustering algorithms. Existing methods can be used for simulating elliptical data according to pre-specified overlap characteristics. First, an approach to simulating skewed clusters with a desired overlap is proposed. Next, an extension to measuring overlap in cluster-weighted models is considered. Thus, this article provides important extensions to the existing methods for simulating heterogeneous data for studying the systematic performance of clustering algorithms. Supplementary materials for this article are available online.

On Simulating Skewed and Cluster-Weighted Data for Studying Performance of Clustering Algorithms / Melnykov, Volodymyr; Wang, Yang; Melnykov, Yana; Torti, Francesca; Perrotta, Domenico; Riani, Marco. - In: JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS. - ISSN 1061-8600. - 33:1(2023), pp. 303-309. [10.1080/10618600.2023.2210338]

On Simulating Skewed and Cluster-Weighted Data for Studying Performance of Clustering Algorithms

Torti, Francesca;Riani, Marco
2023-01-01

Abstract

In this article, extensions to the recently introduced concept of pairwise overlap between mixture components are proposed. The notion of overlap is useful for studying the systematic performance of clustering algorithms. Existing methods can be used for simulating elliptical data according to pre-specified overlap characteristics. First, an approach to simulating skewed clusters with a desired overlap is proposed. Next, an extension to measuring overlap in cluster-weighted models is considered. Thus, this article provides important extensions to the existing methods for simulating heterogeneous data for studying the systematic performance of clustering algorithms. Supplementary materials for this article are available online.
2023
On Simulating Skewed and Cluster-Weighted Data for Studying Performance of Clustering Algorithms / Melnykov, Volodymyr; Wang, Yang; Melnykov, Yana; Torti, Francesca; Perrotta, Domenico; Riani, Marco. - In: JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS. - ISSN 1061-8600. - 33:1(2023), pp. 303-309. [10.1080/10618600.2023.2210338]
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11381/3034722
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 1
  • ???jsp.display-item.citation.isi??? 2
social impact