Multivariate outlier detection requires computation of robust distances to be compared with appropriate cut-off points. In this paper we propose a new calibration method for obtaining reliable cut-off points of distances derived from the MCD estimator of scatter. These cut-off points are based on a more accurate estimate of the extreme tail of the distribution of robust distances. We show that our procedure gives reliable tests of outlyingness in almost all situations of practical interest, provided that the sample size is not much smaller than 50. Therefore, it is a considerable improvement over all the available MCD procedures, which are unable to provide good control over the size of multiple outlier tests for the data structures considered in this paper.

Controlling the size of multivariate outlier tests with the MCD estimator of scatter / Cerioli, Andrea; Riani, Marco; Atkinson, A. C.. - In: STATISTICS AND COMPUTING. - ISSN 0960-3174. - 19:(2009), pp. 341-353. [10.1007/s11222-008-9096-5]

Controlling the size of multivariate outlier tests with the MCD estimator of scatter

CERIOLI, Andrea;RIANI, Marco;
2009-01-01

Abstract

Multivariate outlier detection requires computation of robust distances to be compared with appropriate cut-off points. In this paper we propose a new calibration method for obtaining reliable cut-off points of distances derived from the MCD estimator of scatter. These cut-off points are based on a more accurate estimate of the extreme tail of the distribution of robust distances. We show that our procedure gives reliable tests of outlyingness in almost all situations of practical interest, provided that the sample size is not much smaller than 50. Therefore, it is a considerable improvement over all the available MCD procedures, which are unable to provide good control over the size of multiple outlier tests for the data structures considered in this paper.
2009
Controlling the size of multivariate outlier tests with the MCD estimator of scatter / Cerioli, Andrea; Riani, Marco; Atkinson, A. C.. - In: STATISTICS AND COMPUTING. - ISSN 0960-3174. - 19:(2009), pp. 341-353. [10.1007/s11222-008-9096-5]
File in questo prodotto:
File Dimensione Formato  
CRA_STCO09.pdf

non disponibili

Tipologia: Documento in Post-print
Licenza: NON PUBBLICO - Accesso privato/ristretto
Dimensione 501.8 kB
Formato Adobe PDF
501.8 kB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11381/2282546
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 26
  • ???jsp.display-item.citation.isi??? 24
social impact