Hostname: page-component-745bb68f8f-f46jp Total loading time: 0 Render date: 2025-01-08T07:26:11.446Z Has data issue: false hasContentIssue false

Methods for Evaluating Empirical Bayes Point Estimates of Latent Trait Scores

Published online by Cambridge University Press:  01 January 2025

Jack Kearns
Affiliation:
Educational Testing Service
William Meredith
Affiliation:
University of California, Berkeley

Abstract

Empirical Bayes point estimates of latent trait scores, derived under the assumptions of one of several test theory models, display a certain degree of instability unless the sample size is sufficiently large. A measure of this instability over repeated sampling is the distribution of the overall expected squared error loss which converges, both in probability and in the mean, to the minimum (Bayes) overall expected loss as sample size increases. An asymptotic distribution theory is developed, and the resulting large sample approximation is compared with results obtained from simulated data. Attention is also given to the effects of using a smoothing procedure.

Type
Original Paper
Copyright
Copyright © 1975 The Psychometric Society

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Footnotes

Research leading to this paper was partially supported by the National Science Foundation, Division of Biological and Medical Sciences, Program in Psychobiology, Grant No. NSF-GB-30779.

*

The authors are indebted to Thomas W. F. Stroud and Noel Cressie for their reviews of an earlier version of this paper and to Frederic M. Lord for helpful comments and suggestions.

*

The authors wish to thank Dr. John Bianchiai and Dr. John Helmick of Educational Testing Service for making these data available.

References

George, S. L.. Evaluation of empirical Bayes estimators for small numbers of past samples. Biometrika, 1971, 58, 244244.CrossRefGoogle Scholar
Horst, P.. Factor analysis of data matrices, 1965, New York: Holt, Rinehart and Winston.Google Scholar
Johnson, N. L., Nixon, E., Amos, D. E.. Table of percentage points of Pearson curves, for given √β 1, and β 2 expressed in standard measure. Biometrika, 1963, 50, 459498.Google Scholar
Kearns, J.. Empirical Bayes point estimates of true score using a compound binomial error model, 1974, Princeton, N.J.: Educational Testing Service.Google Scholar
Keats, J., Lord, F. M.. A theoretical distribution for mental test scores. Psychometrika, 1962, 27, 5972.CrossRefGoogle Scholar
Kendall, M. S., Stuart, A.. The advanced theory of statistics. Vol. 1, 1969, New York: Hafner.CrossRefGoogle Scholar
Kotz, S., Johnson, N. L., Boyd, D. W.. Series representations of distributions of quadratic ratic forms in normal variables. I. Central case. Annals of Mathematical Statistics, 1967, 38, 823837.CrossRefGoogle Scholar
Lord, F. M.. An approach to mental test theory. Psychometrika, 1959, 24, 283302.CrossRefGoogle Scholar
Lord, F. M.. A strong true-score theory with applications. Psychometrika, 1965, 30, 239270.CrossRefGoogle Scholar
Lord, F. M.. Estimating true-score distributions in psychological testing (an empirical Bayes estimation problem). Psychometrika, 1969, 34, 259299.CrossRefGoogle Scholar
Lord, F. M., Novick, M. R.. Statistical theories of mental test scores, 1968, Reading, Mass.: Addison-Wesley.Google Scholar
Lukacs, E., Laha, R. G.. Applications of characteristic functions, 1964, London: Griffin.Google Scholar
Maritz, J. S.. Empirical Bayes methods, 1970, London: Methuen.Google Scholar
Meredith, W. M.. Poisson distributions of error in mental test theory. British Journal of Mathematical and Statistical Psychology, 1971, 24, 4982.CrossRefGoogle Scholar
Meredith, W. M., Kearns, J.. Empirical Bayes point estimates of latent trait scores without knowledge of the trait distribution. Psychometrika, 1973, 38, 533554.CrossRefGoogle Scholar
Rao, C. R.. Linear statistical inference and its applications, 1965, New York: Wiley.Google Scholar
Rasch, G.. Probabilistic models for some intelligence and attainment tests, 1960, Copenhagen: Danmarks Paedagogiske Institut.Google Scholar
Rasch, G.. An item analysis which takes individual differences into account. British Journal of Mathematical and Statistical Psychology, 1966, 19, 4957.CrossRefGoogle ScholarPubMed
Robbins, H.. An empirical Bayes approach to statistics. Proceedings of the Third Berkeley Symposium on Mathematical and Statistical Probability, 1955, 1, 157164.Google Scholar
Robbins, H.. The empirical Bayes approach to statistical decision problems. Annals of Mathematical Statistics, 1964, 35, 120.CrossRefGoogle Scholar