Solving the Tower of Babel Problem for Patient-Reported Outcome Measures: Comments on: Linking Scores with Patient-Reported Health Outcome Instruments: A Validation Study and Comparison of Three Linking Methods

Jakob Bue Bjorner

doi:10.1007/s11336-021-09778-x

Solving the Tower of Babel Problem for Patient-Reported Outcome Measures

Comments on: Linking Scores with Patient-Reported Health Outcome Instruments: A Validation Study and Comparison of Three Linking Methods

Published online by Cambridge University Press: 01 January 2025

Jakob Bue Bjorner

Show author details

Jakob Bue Bjorner*: Affiliation:
QualityMetric Incorporated, LLC University of Copenhagen National Research Centre for the Working Environment
*: Correspondence should be made to Jakob Bue Bjorner, QualityMetric Incorporated, LLC, 1301 Atwood Avenue, Suite 311N, Johnston, RI 02919, USA. Email: [email protected]

Article contents

Abstract
References

Get access

Rights & Permissions

Abstract

The PROsetta Stone Project, summarized in this issue by Schalet et al. (Psychometrika 86, 2021), is a major step forward in enabling comparability between different patient-reported outcomes measures. Schalet et al. clearly describe the psychometric methods used in the PROsetta Stone project and other projects from the Patient-Reported Outcomes Measurement Information System (PROMIS): linking based on unidimensional item response theory (IRT), equipercentile linking, and calibrated projection based on multidimensional IRT. Analyses in a validation data set and simulation studies provide strong support that the linking methods are robust when basic assumptions are fulfilled. The links already established will be of great value to the field, and the methodology described by Schalet et al. will hopefully inspire the next series of linking studies. Among potential improvements that should be considered by new studies are: (1) a thorough evaluation of the content of the measures to be linked to better guide the evaluation of measurement assumptions, (2) improvements in the design of linking studies such as selection of the optimal sample to provide data in the score ranges where linking precision is most critical and using counterbalanced designs to control for order effects. Finally, it may be useful to consider how the linking algorithms are used in subsequent data analyses. Analytic strategies based on plausible values or latent regression IRT models may be preferable to the simple transformation of scores from one patient at the time.

Keywords

linking equating item response theory patient-reported outcomes depression

Type: Application Reviews and Case Studies
Information: Psychometrika , Volume 86 , Issue 3: Special Section: Advancing Methods to Assess Patient-Reported Outcomes: Lessons Learned from the Patient-Reported Outcomes Measurement Information System® (PROMIS®) Initiative , September 2021 , pp. 747 - 753

DOI: https://doi.org/10.1007/s11336-021-09778-x [Opens in a new window]
Copyright: Copyright © 2021 The Psychometric Society

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Article purchase

Temporarily unavailable

References

Bjorner, JB, Kosinski, M, Ware, JE Jr Using item response theory to calibrate the Headache Impact Test (HIT) to the metric of traditional headache scales. Quality of Life Research, (2003). 12 (8 981 1002CrossRef Google Scholar

Bjorner, JB, Rose, M, Gandek, B, Stone, AA, Junghaenel, DU, Ware, JE Jr Method of administration of PROMIS scales did not significantly impact score level, reliability, or validity. Journal of Clinical Epidemiology, (2014). 67 (1 108 113CrossRef Google Scholar PubMed

Choi, SW, Schalet, B, Cook, KF, Cella, D Establishing a common metric for depressive symptoms: Linking the BDI-II, CES-D, and PHQ-9 to PROMIS depression. Psychological Assessment, (2014). 26 (2 513 527CrossRef Google Scholar PubMed

DSM-IV-TR., A.P.A. (2000). Diagnostic and statistical manual of mental disorders, fourth edition, text revision: DSM-IV-TR (4th ed., text rev). Washington, DC: American Psychiatric Association.Google Scholar

Dorans, NJ Equating, concordance, and expectation. Applied Psychological Measurement, (2004). 28 (4 227 246CrossRef Google Scholar

Fischer, HF, Rose, M Scoring depression on a common metric: A comparison of EAP estimation, plausible value imputation, and full Bayesian IRT modeling. Multivariate Behavioral Research, (2019). 54 (1 85 99CrossRef Google Scholar PubMed

Holzapfel, N, Müller-Tasch, T, Wild, B, Jünger, J, Zugck, C, Remppis, A, Löwe, B Depression profile in patients with and without chronic heart failure. Journal of Affective Disorders, (2008). 105 (1–3 53 62CrossRef Google Scholar PubMed

Katzan, IL, Fan, Y, Griffith, SD, Crane, PK, Thompson, NR, Cella, D Scale linking to enable patient-reported outcome performance measures assessed with different patient-reported outcome measures. Value in Health, (2017). 20 (8 1143 1149CrossRef Google Scholar PubMed

Kim, J, Chung, H, Askew, RL, Park, R, Jones, SMW, Cook, KF, Amtmann, D Translating CESD-20 and PHQ-9 scores to PROMIS depression. Assessment, (2017). 24 (3 300 307CrossRef Google Scholar PubMed

Kolen, ML, Brennan, RLTest equating, scaling, and linking: Methods and practices, (2014). 3New York, NY: SpringerCrossRef Google Scholar

Kroenke, K, Spitzer, RL, Williams, JBW The PHQ-9: Validity of a brief depression severity measure. Journal of General Internal Medicine, (2001). 16 (9 606 613CrossRef Google Scholar PubMed

Mislevy, RJ Randomization-based inference about latent variables from complex samples. Psychometrika, (1991). 56, 177 196CrossRef Google Scholar

Mor, V, Guadagnoli, E Quality of life measurement: A psychometric tower of Babel. Journal of Clinical Epidemiology, (1988). 41 (11 1055 1058CrossRef Google Scholar PubMed

Orlando, M, Sherbourne, CD, Thissen, D Summed-score linking using item response theory: Application to depression measurement. Psychological Assessment, (2000). 12 (3 354 359CrossRef Google Scholar PubMed

Pilkonis, P. A., Choi, S. W., Reise, S. P., Stover, A. M., Riley, W. T., Cella, D., & PROMIS Cooperative Group. (2011). Item banks for measuring emotional distress from the Patient-Reported Outcomes Measurement Information System (PROMIS®): Depression, anxiety, and anger. Assessment, 18(3), 263–283.CrossRef Google Scholar

Schalet, B. D., Lim, S., Cella, D., & Choi, S. W. (2021). Linking scores with patient-reported health outcome instruments: A validation study and comparison of three linking methods. Psychometrika, 86.CrossRef Google Scholar

van Knippenberg, F. C., & de Haes, J. C. (1988). Measuring the quality of life of cancer patients: Psychometric properties of instruments. [Review]. J.Clin.Epidemiol., 41 (11), 1043–1053.CrossRef Google Scholar PubMed

Article contents

Solving the Tower of Babel Problem for Patient-Reported Outcome Measures

Abstract

Keywords

Access options

Article purchase

Temporarily unavailable

References

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests