The Paris 1976 Wine Tastings Revisited Once More: Comparing Ratings of Consistent and Inconsistent Tasters

Domenic V. Cicchetti

doi:10.1017/S193143610000016X

The Paris 1976 Wine Tastings Revisited Once More: Comparing Ratings of Consistent and Inconsistent Tasters

Published online by Cambridge University Press: 08 June 2012

Domenic V. Cicchetti

Show author details

Domenic V. Cicchetti: Affiliation:
Yale Home Office, 94 Linsley Lake Road, North Branford, CT 06471; e-mail address:[email protected].

Article contents

Abstract
References

Get access

Rights & Permissions

Abstract

In the author's earlier research, five quite reliable and six quite unreliable subsets of tasters were identified, from among the full sample of eleven wine tasters, at the heralded 1976 Paris blind Chardonnay and Bordeaux/Cabernets wine competitions. This study shows quite conclusively that the consistent tasters and the inconsistent ones provided quite different results when compared both to each other and to the results based upon the full sample of eleven tasters. Results demonstrate the following: one should be wary of findings based solely upon an omnibus approach (i.e., results based only on the full sample of 11 tasters); that a next logical step is not only to continue to identify consistent tasters, but to design future studies in which these reliable judges are used to teach neophyte imbibers to also achieve high levels of wine tasting consistency; and that in continuing to investigate other important empirically derived oenological information, we should not, in the process, lose sight of the sheer hedonic pleasure of the next glass of wine.

Type: Articles
Information: Journal of Wine Economics , Volume 1 , Issue 2 , Fall 2006 , pp. 125 - 140

DOI: https://doi.org/10.1017/S193143610000016X [Opens in a new window]
Copyright: Copyright © American Association of Wine Economists 2006

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Article purchase

Temporarily unavailable

References

Ashenfelter, O. and Quandt, R. (1999). Analyzing a wine tasting statistically. Chance, 12, 16–20.CrossRef Google Scholar

Bartko, J.J. (1966). The intraclass correlation coefficient as a measure of reliability. Psychological Reports, 19, 3–11.CrossRef Google Scholar PubMed

Bartko, J.J. (1976). On various intraclass correlation reliability coefficients. Psychological Bulletin, 83, 762–765.CrossRef Google Scholar

Borenstein, M. (1998). The shift from significance testing to effect size estimation. In: Bellak, A.S. and Hersen, M. (Series Eds.) and Schooler, N. (Vol. Ed.), Research and Methods, Vol. 3, Comprehensive Clinical Psychology. New York, NY: Pergamon. 313–349.CrossRef Google Scholar

Borenstein, M., Rothstein, H., and Cohen, J. (2001). Power and Precision: A Computer Program for Statistical Power Analysis and Confidence Intervals. Englewood, NJ: Biostat, Inc.Google Scholar

Cicchetti, D.V. (1981). Testing the normal approximation and minimal sample size requirements of weighted kappa when the number of categories is large. Applied Psychological Measurement, 5, 101–104.CrossRef Google Scholar

Cicchetti, D.V. (2001). The precision of reliability and validity estimates re-visited: Distinguishing between clinical and statistical significance of sample size requirements. Journal of Clinical and Experimental Neuropsychology, 23, 695–700.CrossRef Google Scholar PubMed

Cicchetti, D.V. (2004a). Who won the 1976 blind tasting of French Bordeaux and US cabernets? Parametrics to the rescue. Journal of Wine Research, 15, 211–220.CrossRef Google Scholar

Cicchetti, D.V. (2004b). On designing experiments and analyzing data to assess the reliability and accuracy of blind wine tastings. Journal of Wine Research, 15, 221–226.CrossRef Google Scholar

Cicchetti, D.V. (2006). The 1976 blind wine tastings: On the consistency of tasters from chardonnays to cabernets. Bordeaux: Vineyard Data Quantification Society. Mimeo.Google Scholar

Cicchetti, D.V., Bronen, R., Spencer, S., Haut, S., Berg, A., Oliver, P., and Tyrer, P. (2006). Rating scales, scales of measurement, issues of reliability: Resolving some critical issues for clinicians and researchers. Journal of Nervous and Mental Disease, 194, 557–564.CrossRef Google Scholar PubMed

Cicchetti, D.V. and Fleiss, J.L. (1977). Comparison of the null distributions of weighted kappa and the C ordinal statistic. Applied Psychological Measurement, 1, 195–201.CrossRef Google Scholar

Cicchetti, D.V. and Showalter, D. (1997). A computer program for assessing interexaminer agreement when multiple ratings are made on a single subject. Psychiatry Research, 72, 65–68.CrossRef Google Scholar PubMed

Cicchetti, D.V., Showalter, D., and Rosenheck, R. (1997). A new method for assessing interexaminer agreement when multiple ratings are made on a single subject: Applications to the assessment of neuropsychiatric symptomatology. Psychiatry Research, 72, 51–63.CrossRef Google Scholar

Cicchetti, D.V. and Sparrow, S.S. (1981). Developing criteria for establishing interrater reliability of specific items: Applications to assessment of adaptive behavior. American Journal of Mental Deficiency, 86, 127–137.Google Scholar PubMed

Cicchetti, D.V., Volkmar, F., Klin, A., and Showalter, D. (1995). Diagnosing autism using ICD-10 criteria: A comparison of neural networks and standard multivariate procedures. Child Neuropsychology, 1, 26–37.CrossRef Google Scholar

Cohen, J. (1960). A coefficient of agreement for nominal scales. Educational and Psychological Measurement, 23, 37–46.CrossRef Google Scholar

Cohen, J. (1968). Weighted kappa: Nominal scale agreement provision for scaled disagreement or partial credit. Psychological Bulletin, 70, 213–220.CrossRef Google Scholar PubMed

Cohen, J. (1988). Statistical Power Analysis for the Behavioral Sciences. Hillsdale, NJ: Lawrence Erlbaum Associates.Google Scholar

Everitt, B.S. (1968). Moments of the statistics kappa and weighted kappa. British Journal of Mathematical and Statistical Psychology, 21, 97–103.CrossRef Google Scholar

Fleiss, J.L. and Cicchetti, D.V. (1978). Inference about weighted kappa in the non-null case. Applied Psychological Measurement, 2, 113–117.CrossRef Google Scholar

Fleiss, J.L. and Cohen, J. (1973). The equivalence of weighted kappa and the intraclass correlation coefficient as measures of agreement. Educational and Psychological Measurement, 33, 613–619.CrossRef Google Scholar

Fleiss, J.L., Cohen, J., and Everitt, B.S. (1969). Large sample standard errors of kappa and weighted kappa. Psychological Bulletin, 72, 323–327.CrossRef Google Scholar

Fleiss, J.L., Levin, B., and Paik, C. (2003). Statistical Methods for Rates and Proportions. 3rd edition. New York, NY: John Wiley and Sons.CrossRef Google Scholar

Fleiss, J.L., Nee, J.C.M., and Landis, J.R. (1979). Large sample variance in the case of different sets of raters. Psychological Bulletin, 86, 974–977.CrossRef Google Scholar

Landis, R.J. and Koch, G.G. (1977). The measurement of observer agreement for categorical data. Biometrics, 33, 159–174.CrossRef Google Scholar PubMed

Leach, C. (1979). Introduction to Statistics: A Nonparametric Approach for the Social Sciences. New York, NY: John Wiley and Sons.Google Scholar

Lindley, D.V. (2006). Analysis of a wine tasting. Journal of Wine Economics, 1, 33–41.CrossRef Google Scholar

McCarthy, P.L., Cicchetti, D.V., Sznajderman, S.D., Forsyth, B.C., Baron, M.A., Fink, H.D., Czarkowski, N., Bauchner, H., and Lustman-Findling, K. (1991). Demographic, clinical and psychosocial predictors of the reliability of mothers' clinical judgments. Pediatrics, 88, 1041–1046.CrossRef Google Scholar PubMed

Parker, R.M., with Rovani, P.A. (2002). Parker's Wine Buyer's Guide. New York, NY: Simon and Schuster.Google Scholar

Rosenthal, R. (1991). Meta-Analytic Procedures for Social Research. Newbury Park, CA: Sage Publications (Revised Ed.).CrossRef Google Scholar

Rosenthal, R. and Rubin, D. (1979). A note on percent variance as a measure of the importance of effects. Journal of Applied Social Psychology, 9, 395–396.CrossRef Google Scholar

Rosenthal, R. and Rubin, D. (1982). A simple, general purpose display of magnitude of experimental effect. Journal of Educational Psychology, 74, 166–169.CrossRef Google Scholar

Stevens, S.S. (1951). Mathematics, measurement, and psychophysics. In: Stevens, S.S. (ed.). Handbook of Experimental Psychology. New York, NY: John Wiley and Sons. Ch. 1, 1–49.Google Scholar

Stevens, S.S. (1957). On the psychophysical law. Psychological Review, 14, 153–181.CrossRef Google Scholar

van Belle, G. (2002). Statistical Rules of Thumb. New York, NY: John Wiley and Sons.Google Scholar

Volkmar, F.R., Cicchetti, D.V., Dykens, E., Sparrow, S.S., Leckman, J.F., and Cohen, D.J. (1988). An evaluation of the autism behavior checklist. Journal of Autism and Developmental Disorders, 18, 81–97.CrossRef Google Scholar PubMed

von Wieser, F. (1893). Natural Value. New York, NY: Macmillan (English edition).Google Scholar

Article contents

The Paris 1976 Wine Tastings Revisited Once More: Comparing Ratings of Consistent and Inconsistent Tasters

Abstract

Access options

Article purchase

Temporarily unavailable

References

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests