Hostname: page-component-cd9895bd7-hc48f Total loading time: 0 Render date: 2024-12-23T09:44:57.899Z Has data issue: false hasContentIssue false

Some Asymptotic Analysis of Resistant Rules For Outlier Labeling

Published online by Cambridge University Press:  27 July 2009

John E. Angus
Affiliation:
Hughes Aircraft Co. Fullerton, California 92634

Abstract

Previous studies have examined the behavior of outlier detection rules for symmetric distributions that label as “outside” any observations that fall outside the interval [FL – k(Fu – FL), Fu + k(Fu – FL)], where FL and FU are functions of the order statistics estimating the 0.25 and 0.75 quantiles of the distribution underlying the i.i.d. sample. A measure of the performance of this type of rule is the “some-outside rate” per sample computed with respect to a given (usually Gaussian) null distribution. The “some-outside rate” (SOR) per sample is the probability that the sample will contain one or more observations labeled as “outside,” given that the null distribution is the true distribution. In this paper, asymptotic expansions of k = kn as a function of n that guarantee an asymptotically constant, prespecified SOR are given for a variety of symmetric null distributions including the Gaussian, double exponential, logistic, and Cauchy distributions. The main theorem also applies to the case of a nonsymmetric null distribution by slightly modifying the labeling rule.

Type
Articles
Copyright
Copyright © Cambridge University Press 1989

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

References

Galambos, J. (1978). The asymptotic theory of extreme order statistics. New York: John Wiley.Google Scholar
Hoaglin, D.C., Iglewicz, B. (1987). Fine-tuning some resistant rules for outlier labeling. Journal of the American Statistical Association 82: 11471149.CrossRefGoogle Scholar
Hoaglin, D.C., Iglewicz, B. & Tukey, J.W. (1986). Performance of some resistant rules for outher labeling. Journal of the American Statistical Association 81: 991999.CrossRefGoogle Scholar
Serfling, R.J. (1980). Approximation theorems of mathematical statistics. New York: John Wiley.CrossRefGoogle Scholar
Tukey, J.W. (1977). Exploratory data analysis. Reading, Massachusetts: Addison-Wesley.Google Scholar