Hostname: page-component-745bb68f8f-b6zl4 Total loading time: 0 Render date: 2025-01-07T13:27:52.119Z Has data issue: false hasContentIssue false

Increasing Item Complexity: A Possible Cause of Scale Shrinkage for Unidimensional Item Response Theory

Published online by Cambridge University Press:  01 January 2025

Wendy M. Yen*
Affiliation:
CTB/McGraw-Hill
*
Requests for reprints should be sent to Wendy M. Yen, CTB/McGraw-HilI, 2500 Garden Road, Monterey, CA 93940.

Abstract

When the three-parameter logistic model is applied to tests covering a broad range of difficulty, there frequently is an increase in mean item discrimination and a decrease in variance of item difficulties and traits as the tests become more difficult. To examine the hypothesis that this unexpected scale shrinkage effect occurs because the items increase in complexity as they increase in difficulty, an approximate relationship is derived between the unidimensional model used in data analysis and a multidimensional model hypothesized to be generating the item responses. Scale shrinkage is successfully predicted for several sets of simulated data.

Type
Original Paper
Copyright
Copyright © 1985 The Psychometric Society

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Footnotes

The author is grateful to Robert Mislevy for kindly providing a copy of his computer program, RESOLVE.

References

CTB/McGraw-Hill (1981). Comprehensive tests of basic skills, forms U & V, Monterey, CA: Author.Google Scholar
CTB/McGraw-Hill (1982). Comprehensive tests of basic skills, forms U & V, test coordinator's handbook, Monterey, CA: Author.Google Scholar
CTB/McGraw-Hill (1984). Comprehensive tests of basic skills, forms U & V, Monterey, CA: Author.Google Scholar
Doody-Bogan, E., & Yen, W. M. (1983, April). Detecting multidimensionality and examining its effects on vertical equating with the three-parameter logistic model. Paper presented at the meeting of the American Educational Research Association, Montreal.Google Scholar
International Mathematical and Statistical Libraries (1979). IMSL Library 7th ed.,, Houston, TX: Author.Google Scholar
Lord, F. M. (1975). The “ability” scale in item characteristic curve theory. Psychometrika, 40, 205217.CrossRefGoogle Scholar
Lord, F. M. (1980). Applications of item response theory to practical testing problems, New York: Lawrence Erlbaum.Google Scholar
Lord, F. M. (1984). Conjunctive and disjunctive item response functions, Princeton, NJ: Educational Testing Service.CrossRefGoogle Scholar
McKinley, R. L. (1983, April). A multidimensional extension of the two-parameter logistic latent trait model. Paper presented at the meeting of the National Council on Measurement in Education, Montreal.Google Scholar
Mislevy, R. J. (1984). Estimating latent distributions. Psychometrika, 49, 359381.CrossRefGoogle Scholar
Reckase, M. D. (1979). Unifactor latent trait models applied to multifactor tests: Results and implications. Journal of Educational Statistics, 4, 207230.CrossRefGoogle Scholar
Reckase, M. D., & McKinley, R. L. (1983, April). The definition of difficulty and discrimination for multidimensional item response theory models. Paper presented at the meeting of the American Educational Research Association, Montreal.Google Scholar
Wingersky, M. S., Barton, M. A., Lord, F. M. (1982). LOGIST User's Guide, Princeton, NJ: Educational Testing Service.Google Scholar
Yen, W. M. (1983). Tau equivalence and equipercentile equating. Psychometrika, 48, 353369.CrossRefGoogle Scholar
Yen, W. M. (1983). Use of the three-parameter logistic model in the development of a standardized achievement test. In Hambleton, R. K. (Eds.), Applications of Item Response Theory, British Columbia: Educational Research Institute of British Columbia.Google Scholar
Yen, W. M. (1984). Effects of local item dependence on the fit and equating performance of the three-parameter logistic model. Applied Psychological Measurement, 8, 125145.CrossRefGoogle Scholar