Hostname: page-component-745bb68f8f-l4dxg Total loading time: 0 Render date: 2025-01-08T10:23:24.869Z Has data issue: false hasContentIssue false

Variation in Test Validity with Variation in the Distribution of Item Difficulties, Number of Items, and Degree of their Intercorrelation

Published online by Cambridge University Press:  01 January 2025

Hubert E. Brogden*
Affiliation:
Personnel Research Section The Adjutant General's Office

Abstract

The relation between item difficulty distributions and the “validity” and reliability of tests is computed through use of normal correlation surfaces for varying numbers of items and varying degrees of item intercorrelations. Optimal or near optimal item difficulty distributions are thus identified for various possible item difficulty distributions. The results indicate that, if a test is of conventional length, is homogeneous as to content, and has a symmetrical distribution of item difficulties, correlation with a normally distributed perfect measure of the attribute common to the items does not vary appreciably with variation in the item difficulty distribution. Greater variation was evident in correlation with a second duplicate test (reliability). The general implications of these findings and their particular significance for evaluating techniques aimed at increasing reliability are considered.

Type
Original Paper
Copyright
Copyright © 1946 The Psychometric Society

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Footnotes

*

When expressed in terms of standard score scale values—not percentage correct

References

Ferguson, G. A. The factorial interpretation of test difficulty. Psychometrika, 1941, 6, 323329.CrossRefGoogle Scholar
Gulliksen, Harol. The relation of item difficulty and inter-item correlation to test variance and reliability. Psychometrika, 1945, 10, 7991.CrossRefGoogle Scholar
Kuder, G. F. and Richardson, M. W. The theory of the estimation of test reliability. Psychometrika, 1937, 2, 151160.CrossRefGoogle Scholar
Pearson, Kar. Tables for statisticians and biometricians. Cambridge: Cambridge Univ. Press. Pp. 143.Google Scholar
Richardson, M. W. The relation between the difficulty and the differential validity of a test. Psychometrika, 1936, 1, 3349.CrossRefGoogle Scholar
Symonds, P. M. Factors influencing test reliability. J. educ. Psychol., 1928, 19, 7387.CrossRefGoogle Scholar
Thurstone, T. G. The difficulty of a test and its diagnostic value. J. educ. Psychol., 1932, 23, 335343.CrossRefGoogle Scholar
Tucker, Ledyard R. Maximum validity of a test with equivalent items. Psychometrika, 1946, 11, 113.CrossRefGoogle ScholarPubMed
Wherry, Robert J. and Gaylord, Richard H. Factor pattern of test items and tests as a function of the correlation coefficient: content, difficulty and constant error factors. Psychometrika, 1944, 9, 237244.CrossRefGoogle Scholar
Wherry, Robert J. and Gaylord, Richard H. Test selection with integral gross score weights. Psychometrika, 1946, 11, 173183.CrossRefGoogle Scholar