Nominal Scale Agreement Among Observers

Hubert J. A. Schouten

doi:10.1007/BF02294066

Nominal Scale Agreement Among Observers

Published online by Cambridge University Press: 01 January 2025

Hubert J. A. Schouten

Show author details

Hubert J. A. Schouten*: Affiliation:
Institute of Biostatistics, Erasmus University Rotterdam
*: Requests for reprints should be sent to Hubert J. A. Schouten, Department of Medical Informaties and Statistics, University of Limburg, PO Box 616, 6200 MD Maastricht, THE NETHERLANDS.

Article contents

Abstract
Footnotes
References

Get access

Rights & Permissions

Abstract

Experiments are considered where each of a sample of subjects is assigned to one of C categories separately by each of a fixed or varying group of observers. Building on earlier publications, general procedures are proposed to analyze agreements and disagreements among observers. In the case of a varying group of observers, it is shown that it is not necessary to demand a constant number of observers per subject. In the case of a fixed group of observers, the problem of missing data is considered.

The procedures are illustrated within the context of two clinical diagnosis examples. In the first example it is investigated which categories are relatively hard to distinguish from one another; a new theorem is applied that shows a useful property of the statistic kappa. In the second example it is investigated if a subgroup of observers can be found with a significantly higher degree of interobserver agreement.

Keywords

agreement kappa missing data

Type: Original Paper
Information: Psychometrika , Volume 51 , Issue 3 , September 1986 , pp. 453 - 466

DOI: https://doi.org/10.1007/BF02294066 [Opens in a new window]
Copyright: Copyright © 1986 The Psychometric Society

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Article purchase

Temporarily unavailable

Footnotes

The author gratefully acknowledges the valuable suggestions by W. Molenaar, R. van Strik, R. Popping and the referees.

References

Cohen, J. (1960). A coefficient of agreement for nominal scales. Educational and Psychological Measurement, 20, 37–46.CrossRef Google Scholar

Cohen, J. (1968). Weighted kappa: Nominal scale agreement with provision for scaled disagreement or partial credit. Psychological Bulletin, 70, 213–220.CrossRef Google Scholar PubMed

Conger, A. J. (1980). Integration and generalization of kappas for multiple raters. Psychological Bulletin, 88, 322–328.CrossRef Google Scholar

Efron, B. (1982). The jackknife, the bootstrp and other resampling plans, Philadelphia: S.I.A.M..CrossRef Google Scholar

Efron, B., Gong, G. (1983). A leisurely look at the bootstrap, the jackknife, and cross-validation. The American Statistician, 37, 36–48.CrossRef Google Scholar

Fleiss, J. L. (1971). Measuring nominal scale agreement among many raters. Psychological Bulletin, 76, 378–382.CrossRef Google Scholar

Fleiss, J. L., Davies, M. (1982). Jackknifing functions of multinomial frequencies, with an application to a measure of concordance. American Journal of Epidemiology, 115, 841–845.CrossRef Google Scholar PubMed

James, I. R. (1983). Analysis of nonagreements among multiple raters. Biometrics, 39, 651–657.CrossRef Google Scholar

Kraemer, H. C. (1980). Extension of the kappa coefficient. Biometrics, 36, 207–216.CrossRef Google Scholar PubMed

Parr, W. C., Tolley, H. D. (1982). Jackknifing in categorical data analysis. The Australian Journal of Statistics, 24, 67–79.CrossRef Google Scholar

Schouten, H. J. A. (1980). Measuring pairwise agreement among many observers. Biometrical Journal, 22, 497–504.CrossRef Google Scholar

Schouten, H. J. A. (1982). Measuring pairwise agreement among many observers, II: Some improvements and additions. Biometrical Journal, 24, 431–435.CrossRef Google Scholar

Schouten, H. J. A. (1982). Measuring pairwise interobserver agreement when all subjects are judged by the same observers. Statistica Neerlandica, 36, 45–61.CrossRef Google Scholar

Schouten, H. J. A. (1985). Statistical Measurement of Interobserver Agreement. Unpublished doctoral dissertation, Erasmus University Rotterdam.Google Scholar

Van den Berge, J. H., Schouten, H. J. A., Boomstra, S., van Drunen Littel, S., Braakman, R. (1979). Interobserver agreement in assessment of ocular signs in coma. Journal of Neurology, Neurosurgery and Psychiatry, 42, 1163–1168.CrossRef Google Scholar PubMed

Article contents

Nominal Scale Agreement Among Observers

Abstract

Keywords

Access options

Article purchase

Temporarily unavailable

Footnotes

References

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests