Efficient Corrections for Standardized Person-Fit Statistics

Kylie Gorney; Sandip Sinharay; Carol Eckerly

doi:10.1007/s11336-024-09960-x

Efficient Corrections for Standardized Person-Fit Statistics

Published online by Cambridge University Press: 27 December 2024

Kylie Gorney

Sandip Sinharay and

Carol Eckerly

Show author details

Kylie Gorney*: Affiliation:
Michigan State University
Sandip Sinharay: Affiliation:
Educational Testing Service
Carol Eckerly: Affiliation:
Educational Testing Service
*: Correspondence should be made to Kylie Gorney, Department of Counseling, Educational Psychology, and Special Education, Michigan State University, 460 Erickson Hall, 620 Farm Lane, East Lansing, MI 48824, USA. Email: [email protected]

Article contents

Abstract
References

Get access

Rights & Permissions

Abstract

Many popular person-fit statistics belong to the class of standardized person-fit statistics, T, and are assumed to have a standard normal null distribution. However, in practice, this assumption is incorrect since T is computed using (a) an estimated ability parameter and (b) a finite number of items. Snijders (Psychometrika 66(3):331–342, 2001) developed mean and variance corrections for T to account for the use of an estimated ability parameter. Bedrick (Psychometrika 62(2):191–199, 1997) and Molenaar and Hoijtink (Psychometrika 55(1):75–106, 1990) developed skewness corrections for T to account for the use of a finite number of items. In this paper, we combine these two lines of research and propose three new corrections for T that simultaneously account for the use of an estimated ability parameter and the use of a finite number of items. The new corrections are efficient in that they only require the analysis of the original data set and do not require the simulation or analysis of any additional data sets. We conducted a detailed simulation study and found that the new corrections are able to control the Type I error rate while also maintaining reasonable levels of power. A real data example is also included.

Keywords

Person fit item response theory aberrant behavior

Type: Theory & Methods
Information: Psychometrika , Volume 89 , Issue 2 , June 2024 , pp. 569 - 591

DOI: https://doi.org/10.1007/s11336-024-09960-x [Opens in a new window]
Copyright: Copyright © 2024 The Author(s), under exclusive licence to The Psychometric Society

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Article purchase

Temporarily unavailable

References

Albers, C. J., Meijer, R. R., Tendeiro, J. N.. (2016). Derivation and applicability of asymptotic results for multiple subtests person-fit statistics. Applied Psychological Measurement, 40(4), 274–288.CrossRef Google Scholar PubMed

Bedrick, E. J.. (1997). Approximating the conditional distribution of person fit indexes for checking the Rasch model. Psychometrika, 62(2), 191–199.CrossRef Google Scholar

Cheng, Y., Yuan, K.-H.. (2010). The impact of fallible item parameter estimates on latent trait recovery. Psychometrika, 75(2), 280–291.CrossRef Google Scholar PubMed

Cizek, G. J., & Wollack, J. A. (Eds.). (2017). Handbook of quantitative methods for detecting cheating on tests. Routledge.Google Scholar

de la Torre, J., Deng, W.. (2008). Improving person-fit assessment by correcting the ability estimate and its reference distribution. Journal of Educational Measurement, 45(2), 159–177.CrossRef Google Scholar

Drasgow, F., Levine, M. V., Williams, E. A.. (1985). Appropriateness measurement with polychotomous item response models and standardized indices. British Journal of Mathematical and Statistical Psychology, 38(1), 67–86.CrossRef Google Scholar

Glas, C. A. W., Meijer, R. R.. (2003). A Bayesian approach to person fit analysis in item response theory models. Applied Psychological Measurement, 27(3), 217–233.CrossRef Google Scholar

Gorney, K., Sinharay, S., Liu, X.. (2024). Using item scores and response times in person-fit assessment. British Journal of Mathematical and Statistical Psychology, 77(1), 151–168.CrossRef Google Scholar PubMed

Gorney, K., Wollack, J. A.. (2023). Using item scores and distractors in person-fit assessment. Journal of Educational Measurement, 60(1), 3–27.CrossRef Google Scholar

Hong, M., Lin, L., Cheng, Y.. (2021). Asymptotically corrected person fit statistics for multidimensional constructs with simple structure and mixed item types. Psychometrika, 86(2), 464–488.CrossRef Google Scholar PubMed

Li, M. F., Olejnik, S.. (1997). The power of Rasch person-fit statistics in detecting unusual response patterns. Applied Psychological Measurement, 21(3), 215–231.CrossRef Google Scholar

Magis, D., Béland, S., Raîche, G.. (2014). Snijders’s correction of the infit and outfit indices with estimated ability level: An analysis with the Rasch model. Journal of Applied Measurement, 15(1), 82–93.Google Scholar

Magis, D., Raîche, G., Béland, S.. (2012). A didactic presentation of Snijders’s

l_{z}^{*}

\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$l_z^*$$\end{document}

index of person fit with emphasis on response model selection and ability estimation. Journal of Educational and Behavioral Statistics, 37(1), 57–81.CrossRef Google Scholar

Molenaar, I. W., Hoijtink, H.. (1990). The many null distributions of person fit indices. Psychometrika, 55(1), 75–106.CrossRef Google Scholar

Nering, M. L.. (1997). The distribution of indexes of person fit within the computerized adaptive testing environment. Applied Psychological Measurement, 21(2), 115–127.CrossRef Google Scholar

Noonan, B. W., Boss, M. W., Gessaroli, M. E.. (1992). The effect of test length and IRT model on the distribution and stability of three appropriateness indexes. Applied Psychological Measurement, 16(4), 345–352.CrossRef Google Scholar

Reise, S. P.. (1995). Scoring method and the detection of person misfit in a personality assessment context. Applied Psychological Measurement, 19(3), 213–229.CrossRef Google Scholar

Santos, K. C. P., de la Torre, J., von Davier, M.. (2020). Adjusting person fit index for skewness in cognitive diagnosis modeling. Journal of Classification, 37(2), 399–420.CrossRef Google Scholar

Sinharay, S. (2016a). Assessment of person fit using resampling-based approaches. Journal of Educational Measurement, 53(1), 63–85.CrossRef Google Scholar

Sinharay, S. (2016b). Asymptotic corrections of standardized extended caution indices. Applied Psychological Measurement, 40(6), 418–433.CrossRef Google Scholar

Sinharay, S. (2016c). Asymptotically correct standardization of person-fit statistics beyond dichotomous items. Psychometrika, 81(4), 992–1013.CrossRef Google Scholar

Sinharay, S. (2016d). The choice of the ability estimate with asymptotically correct standardized person-fit statistics. British Journal of Mathematical and Statistical Psychology, 69(2), 175–193.CrossRef Google Scholar

Snijders, T. A. B.. (2001). Asymptotic null distribution of person fit statistics with estimated person parameter. Psychometrika, 66(3), 331–342.CrossRef Google Scholar

Tatsuoka, K. K.. (1984). Caution indices based on item response theory. Psychometrika, 49(1), 95–110.CrossRef Google Scholar

van Krimpen-Stoop, E. M. L. A., Meijer, R. R.. (1999). The null distribution of person-fit statistics for conventional and adaptive tests. Applied Psychological Measurement, 23(4), 327–345.CrossRef Google Scholar

van Krimpen-Stoop, E. M. L. A., Meijer, R. R.. (2002). Detection of person misfit in computerized adaptive testing with polytomous items. Applied Psychological Measurement, 26(2), 164–180.CrossRef Google Scholar

von Davier, M., Molenaar, I. W.. (2003). A person-fit index for polytomous Rasch models, latent class models, and their mixture generalizations. Psychometrika, 68(2), 213–228.CrossRef Google Scholar

Warm, T. A.. (1989). Weighted likelihood estimation of ability in item response theory. Psychometrika, 54(3), 427–450.CrossRef Google Scholar

Article contents

Efficient Corrections for Standardized Person-Fit Statistics

Abstract

Keywords

Access options

Article purchase

Temporarily unavailable

References

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests