Motivated by the requirement of controlling the number of false discoveries that arises in several application fields, we study the behaviour of diagnostic procedures obtained from popular high-breakdown regression estimators when no outlier is present in the data. We find that the empirical error rates for many of the available techniques are surprisingly far from the prescribed nominal level. Therefore, we propose a simulation-based approach to correct the liberal diagnostics and reach reliable inferences. We provide evidence that our approach performs well in a wide range of settings of practical interest and for a variety of robust regression techniques, thus showing general appeal. We also evaluate the loss of power that can be expected from our corrections under different contamination schemes and show that this loss is often not dramatic. Finally, we detail some possible extensions that may further enhance the applicability of the method.

Reliable Robust Regression Diagnostics / Salini, Silvia; Cerioli, Andrea; Laurini, Fabrizio; Riani, Marco. - In: INTERNATIONAL STATISTICAL REVIEW. - ISSN 0306-7734. - 84:1(2016), pp. 99-127. [10.1111/insr.12103]

Reliable Robust Regression Diagnostics

CERIOLI, Andrea;LAURINI, Fabrizio;RIANI, Marco
2016-01-01

Abstract

Motivated by the requirement of controlling the number of false discoveries that arises in several application fields, we study the behaviour of diagnostic procedures obtained from popular high-breakdown regression estimators when no outlier is present in the data. We find that the empirical error rates for many of the available techniques are surprisingly far from the prescribed nominal level. Therefore, we propose a simulation-based approach to correct the liberal diagnostics and reach reliable inferences. We provide evidence that our approach performs well in a wide range of settings of practical interest and for a variety of robust regression techniques, thus showing general appeal. We also evaluate the loss of power that can be expected from our corrections under different contamination schemes and show that this loss is often not dramatic. Finally, we detail some possible extensions that may further enhance the applicability of the method.
Reliable Robust Regression Diagnostics / Salini, Silvia; Cerioli, Andrea; Laurini, Fabrizio; Riani, Marco. - In: INTERNATIONAL STATISTICAL REVIEW. - ISSN 0306-7734. - 84:1(2016), pp. 99-127. [10.1111/insr.12103]
File in questo prodotto:
File Dimensione Formato  
INSR12103.pdf

accesso aperto

Tipologia: Documento in Post-print
Licenza: Creative commons
Dimensione 1.55 MB
Formato Adobe PDF
1.55 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11381/2787651
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 8
  • ???jsp.display-item.citation.isi??? 11
social impact