Controlling the size of multivariate outlier tests with the MCD estimator of scatter

Cerioli, Andrea; Riani, Marco; Atkinson, A. C.

doi:10.1007/s11222-008-9096-5

Multivariate outlier detection requires computation of robust distances to be compared with appropriate cut-off points. In this paper we propose a new calibration method for obtaining reliable cut-off points of distances derived from the MCD estimator of scatter. These cut-off points are based on a more accurate estimate of the extreme tail of the distribution of robust distances. We show that our procedure gives reliable tests of outlyingness in almost all situations of practical interest, provided that the sample size is not much smaller than 50. Therefore, it is a considerable improvement over all the available MCD procedures, which are unable to provide good control over the size of multiple outlier tests for the data structures considered in this paper.

Controlling the size of multivariate outlier tests with the MCD estimator of scatter / Cerioli, Andrea; Riani, Marco; Atkinson, A. C.. - In: STATISTICS AND COMPUTING. - ISSN 0960-3174. - 19:(2009), pp. 341-353. [10.1007/s11222-008-9096-5]