Hostname: page-component-745bb68f8f-l4dxg Total loading time: 0 Render date: 2025-01-08T08:19:31.959Z Has data issue: false hasContentIssue false

Testing for Local Dependency in Dichotomous and Polytomous Item Response Models

Published online by Cambridge University Press:  01 January 2025

Edward Hak-sing Ip*
Affiliation:
Marshall School of Business, University of Southern California
*
Request for reprints should be directed to the author at the Marshall School of Business, Information and Operations Management Department, University of Southern California, Los Angeles, CA 90089-1421. E-Mail: [email protected]

Abstract

Researchers studying item response models are often interested in examining the effects of local dependency on the validity of the resulting conclusion from statistical inference. This paper focuses on the detection of local dependency. We provide a framework for viewing local dependency within dichotomous and polytomous items that are clustered by design, and present a testing procedure that allows researchers to specifically identify individual item pairs that exhibit local dependency, while controlling for false positive rate. Simulation results from the study indicate that the proposed method is effective. In addition, a discussion of its relation to other existing methods is provided.

Type
Articles
Copyright
Copyright © 2001 The Psychometric Society

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Footnotes

The research was supported under the National Assessment of Educational Progress (Grant No. R902B990007) administered by the National Center of Education Statistics, U.S. Department of Education. This work was started when the author was at the Division of Statistics and Psychometrics at the Educational Testing Service. I thank Juliet Shaffer for her comments on the multiple testing procedure. I also thank three anonymous referees and the Associate Editor for suggestions that greatly improved the presentation of the manuscript.

References

Agresti, A. (1990). Categorical data analysis. New York: Wiley & Sons.Google Scholar
Bahadur, R. (1961). A representation of the joint distribution of responses ton dichotomous items. In Solomon, H. (Eds.), Studies in item analysis and prediction (pp. 158168). Palo Alto, CA: Stanford University Press.Google Scholar
Becker, R.A., Chambers, J. M., Wilks, A. R. (1988). The new S Language. New York: Chapman & Hall.Google Scholar
Benjamini, Y., Hochberg, Y. (1995). Controlling the false discovery rate: A practical and powerful approach to multiple testing. Journal of the Royal Statistical Society, Series B, 57, 289300.CrossRefGoogle Scholar
Birch, M.W. (1964). The detection of partial association I: The case. Journal of Royal Statistical Society, Series B, 27, 313324.CrossRefGoogle Scholar
Bishop, Y., Fienberg, S., Holland, P. (1975). Discrete multivariate analysis. Boston, MA: MIT Press.Google Scholar
Bradlow, E., Wainer, H., Wang, X. (1999). A Bayesian random effects model for testlets. Psychometrika, 64, 153168.CrossRefGoogle Scholar
Breslow, N. (1981). Odds ratio estimators when the data are sparse. Biometrika, 68, 7384.CrossRefGoogle Scholar
Chen, W., Thissen, D. (1997). Local dependence indexes for item pairs using item response theory. Journal of Educational and Behavioral Statistics, 22, 265289.CrossRefGoogle Scholar
Cochran, W.G. (1954). Some methods of strengthening the commonx 2 tests. Biometrics, 10, 417451.CrossRefGoogle Scholar
Dale, R. (1986). Global cross-ratio models for bivariate, discrete ordered responses. Biometrics, 42, 909917.CrossRefGoogle ScholarPubMed
Darroch, J.N. (1981). The Mantel-Haenszel test and tests of marginal symmetry: Fixed effects and mixed models for a categorical response. International Statistical Review, 49, 285307.CrossRefGoogle Scholar
Donner, A., Hauck, W. (1988). Estimation of a common odds ratio in case-control studies of familial aggregation. Biometrics, 44, 369378.CrossRefGoogle ScholarPubMed
Douglas, J., Kim, H., Habing, B., Gao, F. (1998). Investigating local dependence with conditional covariance functions. Journal of Educational and Behavioral Statistics, 23, 129151.CrossRefGoogle Scholar
Efron, B. (1982). The jackknife, the bootstrap and other resampling plans. Philadelphia: SIAM.CrossRefGoogle Scholar
Gao, F. (1997). DIMTEST enhancements in some parametric IRT asymptotics. Unpublished doctoral dissertation, University of Illinois at Urbana-Champaign, Department of Statistics.Google Scholar
Gibbons, R.D., Bock, R.D., Hedeker, D.R. (1989). Conditional dependence. Urbana-Champaign, IL: University of Illinois.CrossRefGoogle Scholar
Goldstein, H. (1980). Dimensionality, bias, independence and measurement scale problems in latent trait test score models. British Journal of Mathematical and Statistical Psychology, 33, 234246.CrossRefGoogle Scholar
Habing, B.T. (1998). Some issues in weak local dependence in item response theory. Unpublished doctoral dissertation, University of Illinois at Urbana-Champaign, Department of Statistics.Google Scholar
Habing, B., & Donoghue, J.R. (1998). Local dependence assessment for exams with polytomous items and incomplete item-examinee layouts. Manuscript submitted for publication.Google Scholar
Habing, B.T., & Roussos, L. (1998). A model for item response data with pairwise local dependence. Paper presented at the annual meeting of the National Council of Measurement in Education, San Diego, CA.Google Scholar
Hambleton, R.K., Swaminathan, H., Cook, L.L., Eignor, D.E., Gifford, J.A. (1978). Developments in latent trait theory: Models, technical issues, and applications. Review of Educational Research, 48, 476510.CrossRefGoogle Scholar
Harwell, M., Stone, C.A., Hsu, T., Kirisci, L. (1996). Monte Carlo studies in item response theory. Applied Psychological Measurement, 20, 101125.CrossRefGoogle Scholar
Hattie, J.A. (1985). Methodological review: Assessing unidimensionality of tests and items. Applied Psychological Measurement, 9, 139164.CrossRefGoogle Scholar
Hattie, J., Krakowski, K., Rogers, H.J., Swaminathan, H. (1996). An assessment of Stout's index of essential unidimensionality. Applied Psychological Measurement, 20, 114.CrossRefGoogle Scholar
Hauck, W. (1979). The large sample variance of the Mantel-Haenszel estimator of a common odds ratio. Biometrics, 25, 817820.CrossRefGoogle Scholar
Hochberg, Y., Tamhane, A. (1987). Multiple comparison procedures. New York, NY: Wiley & Sons.CrossRefGoogle Scholar
Holland, P.W. (1981). When are item response models consistent with observed data?. Psychometrika, 46, 7992.CrossRefGoogle Scholar
Holland, P., Rosenbaum, P. (1986). Conditional association and unidimensionality in montone latent variable models. Annals of Statistics, 14, 15231543.CrossRefGoogle Scholar
Holland, P.W., Thayer, D.T. (1988). Differential item performance and the Mantel-Haenszel procedure. In Wainer, H., Braun, H.I. (Eds.), Test validity (pp. 129145). Hillsdale, NJ: Erlbaum.Google Scholar
Hoskens, M., De Boeck, P. (1997). A parametric model for local item dependencies among test items. Psychological Methods, 2, 261277.CrossRefGoogle Scholar
Ip, E.H. (2000). Adjusting for information inflation due to local dependency in moderately large item clusters. Psychometrika, 65, 7391.CrossRefGoogle Scholar
Jannarone, R. (1992). Conjunctive measurement theory: Cognitive research prospects. In Wilson, M. (Eds.), Objective measurement: Theory and practice, Volume 1 (pp. 210235). Norwood, NJ: Ablex Publishing.Google Scholar
Jannarone, R. (1992). Local dependence: Objectively measurable or objectionably abominable?. In Wilson, M. (Eds.), Objective Measurement: Theory and practice, Volume 2. Norwood, NJ: Ablex Publishing.Google Scholar
Jennings, D.E. (1986). Outliers and Residual distributions in logistic regression. Journal of the American Statistical Association, 81, 987990.CrossRefGoogle Scholar
Junker, B.W. (1991). Essential independence and likelihood-based ability estimation for polytomous items. Psychometrika, 56, 255278.CrossRefGoogle Scholar
Junker, B.W. (1993). Progress in characterizing strictly unidimensional IRT representations. The Annals of Statistics, 21, 13591378.Google Scholar
Kim, H. (1994). New techniques for the dimensionality assessment of standardized test data. Unpublished doctoral dissertation, University of Illinois at Urbana-Champaign, Department of Statistics.Google Scholar
Lehmann, E.L. (1991). Testing statistical hypothesis 2nd ed., New York, NY: Springer-Verlag.Google Scholar
Mantel, N. (1963). Chi-square tests with one degree of freedom: Extensions of the Mantel-Haenszel procedure. Journal of the American Statistical Association, 58, 690700.Google Scholar
Mantel, N., Haenszel, W. (1959). Statistical aspects of the retrospective study of disease. Journal of the National Cancer Institute, 22, 719748.Google ScholarPubMed
McCullagh, P., Nelder, J.A. (1989). Generalized linear models 2nd ed., New York: Chapman & Hall.CrossRefGoogle Scholar
McDonald, R.P. (1981). The dimensionality of tests and items. British Journal of Mathematical and Statistical Psychology, 34, 100117.CrossRefGoogle Scholar
McDonald, R. P. (1994). Testing for approximate dimensionality. In Laveault, D., Zumbo, B., Gessarli, M., Boss, M. (Eds.), Modern theory of measurement: Problems and issues (pp. 6386). Ottawa: University of Ottawa Press.Google Scholar
Muraki, E. (1992). A generalized partial credit model: Application of an EM algorithm. Applied Psychological Measurement, 16, 159176.CrossRefGoogle Scholar
Nandakumar, R., Stout, W.F. (1993). Refinements of Stout's procedure for assessing latent trait unidimensionality. Journal of Educational Statistics, 18, 4168.Google Scholar
Pashley, P.J., Reese, L.M. (1995). On generating locally dependent item responses. Newton, PA: Law School Admission Council.Google Scholar
Plackett, R.L. (1965). A class of bivariate distributions. Journal of American Statistical Association, 65, 516522.CrossRefGoogle Scholar
Reese, L. (1995). The impact of local dependencies on some LSAT outcomes. Newton, PA: Law School Admission Council.Google Scholar
Rosenbaum, P.R. (1984). Testing the conditional independence and monotonicity assumptions of item response theory. Psychometrika, 49, 425435.CrossRefGoogle Scholar
Roussos, L.A., Stout, W.F., Marden, J.I. (1998). Using new proximity measure with hierarchical cluster analysis to detect multidimensionality. Journal of Educational Measurement, 35, 130.CrossRefGoogle Scholar
Shaffer, J.P. (1995). Multiple hypothesis testing. Annual Review of Psychology, 46, 561584.CrossRefGoogle Scholar
Somes, G.W., O'Brien, K.F. (1985). Mantel-Haenszel statistics. In Johnson, , Kotz, (Eds.), Encyclopedia of Statistical Science, Vol.5 (pp. 214217). New York, NY: Wiley & Sons.Google Scholar
Stout, W.F. (1987). A nonparametric approach for assessing latent traitdimensionality. Psychometrika, 52, 589617.CrossRefGoogle Scholar
Stout, W.F. (1990). A new item response theory modeling approach with application to unidimensionality assessment and ability estimation. Psychometrika, 55, 293325.CrossRefGoogle Scholar
Stout, W.F., Habing, B., Douglas, J., Kim, H., Roussos, L., Zhang, J. (1996). Conditional covariance based nonparametric multidimensionality assessment. Applied Psychological Measurement, 20, 331354.CrossRefGoogle Scholar
Stout, W.F., Nandakumar, R., Junker, B., Chang, H.H., Steidinger, D. (1991). DIMTEST and TESTSIM. Urbana-Champaign: University of Illinois, Department of Statistics.Google Scholar
Suppes, P., Zanotti, M. (1981). When are probabilistic explanations possible?. Synthese, 48, 191199.CrossRefGoogle Scholar
Tate, R.L. (1998). A comparison of selected methods for assessing the dimensionality of tests comprised of dichotomous items. Paper presented at the meeting of the National Council of Measurement in Education, San Diego, California.Google Scholar
Tuerlinckx, F., De Boeck, P. (1998). The effect of ignoring local item dependencies on the estimated discrimination parameters. Leuven, Belgium: University of Leuven.Google Scholar
Williams, V.S.L., Jones, L.V., Tukey, J. (1994). Controlling error in multiple comparisons, with special attention to National Assessment of Educational Progress. Research Triangle Park, NC: National Institute of Statistical Sciences.Google Scholar
Wu, H., & Stout, W.F. (1996, June). A test of local independence going beyond conditional covariance exploration. Paper presented at the Annual Meeting of the Psychometric Society, Banff, Canada.Google Scholar
Yen, W.M. (1984). Effects of local item dependence on the fit and equating performance of the three-parameter logistic model. Applied Psychological Measurement, 8, 125145.CrossRefGoogle Scholar
Yen, W.M. (1993). Scaling performance assessments: Strategies for managing local item dependence. Journal of Educational Measurement, 30, 187213.CrossRefGoogle Scholar
Zhang, J., Stout, W.F. (1999). Conditional covariance structure of generalized compensatory multidimensional items. Psychometrika, 64, 129152.CrossRefGoogle Scholar
Zwick, R. (1987). Assessing the dimensionality of NAEP reading data. Journal of Educational Measurement, 24, 293308.CrossRefGoogle Scholar