Validation of a tool to assess patient satisfaction, waiting times, healthcare utilization, and cost

Breda H. Eubank; Mark R. Lafave; Nicholas G. Mohtadi; David M. Sheps; J. Preston Wiley

doi:10.1017/S1463423619000094

Validation of a tool to assess patient satisfaction, waiting times, healthcare utilization, and cost

Published online by Cambridge University Press: 11 June 2019

Breda H. Eubank

Mark R. Lafave ,

Nicholas G. Mohtadi ,

David M. Sheps and

J. Preston Wiley

Show author details

Breda H. Eubank*: Affiliation:
Department of Health and Physical Education, Faculty of Health, Community, and Education, Mount Royal University, Calgary, AB, Canada
Mark R. Lafave: Affiliation:
Department of Health and Physical Education, Faculty of Health, Community, and Education, Mount Royal University, Calgary, AB, Canada
Nicholas G. Mohtadi: Affiliation:
Director Sport Medicine Centre, Faculty of Kinesiology, University of Calgary, Calgary, AB, Canada
David M. Sheps: Affiliation:
Division of Orthopaedics, Department of Surgery, University of Alberta, Edmonton, AB, Canada
J. Preston Wiley: Affiliation:
Sport Medicine Centre, Faculty of Kinesiology, University of Calgary, Calgary, AB, Canada
*: Author for correspondence: Breda H. Eubank, Assistant Professor, Department of Health and Physical Education, Faculty of Health, Community, and Education, Mount Royal University, 4825 Mount Royal Gate SW, Calgary, AB, Canada T3E 6K6. E-mail: [email protected]

Article contents

Abstract
Aim
Background
Methods
Findings
Background
Methods
Results
Discussion
Conclusion
Footnotes
References

Rights & Permissions

Abstract

Aim

Patients’ experience of the quality of care received throughout their continuum of care can be used to direct quality improvement efforts in areas where they are most needed. This study aims to establish validity and reliability of the Healthcare Access and Patient Satisfaction Questionnaire (HAPSQ) – a tool that collects patients’ experience that quantifies aspect of care used to make judgments about quality from the perspective of the Alberta Quality Matrix for Health (AQMH).

Background

The AQMH is a framework that can be used to assess and compare the quality of care in different healthcare settings. The AQMH provides a common language, understanding, and approach to assessing quality. The HAPSQ is one tool that is able to assess quality of care according to five of six AQMH’s dimensions.

Methods

This was a prospective methodologic study. Between March and October 2015, a convenience sample of patients presenting with chronic full-thickness rotator cuff tears was recruited prospectively from the University of Calgary Sport Medicine Centre in Calgary, Alberta, Canada. Reliability of the HAPSQ was assessed using test–retest reliability [interclass correlation coefficient (ICC)>0.70]. Validity was assessed through content validity (patient interviews, floor and ceiling effects), criterion validity (percent agreement >70%), and construct validity (hypothesis testing).

Findings

Reliability testing was completed on 70 patients; validity testing occurred on 96 patients. The mean duration of symptoms was three years (SD: 5.0, range: 0.1–29). Only out-of-pocket utilization possessed an ICC<0.70. Patients reported that items were relevant and appropriate to measuring quality of care. No floor or ceiling effects were present. Criterion validity was reached for all items assessed. A priori hypotheses were confirmed. The HAPSQ represents an inexpensive, reliable, and valid approach toward collecting clinical information across a patient’s continuum of care.

Keywords

patient satisfaction psychometrics quality of healthcare rotator cuff disease survey waiting times

Type: Research
Information: Primary Health Care Research & Development , Volume 20 , 2019 , e47

DOI: https://doi.org/10.1017/S1463423619000094 [Opens in a new window]
Creative Commons: This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution, and reproduction in any medium, provided the original work is properly cited.
Copyright: © Cambridge University Press 2019

Background

Rotator cuff disorders (RCDs) ranks among the most prevalent of musculoskeletal disorders, yet treatment and management of these conditions are complex and a multitude of different treatment options exist (Yamaguchi, Reference Yamaguchi2011). RCDs include a broad spectrum of acute and chronic pathological conditions, including tendinopathy, calcific tendinopathy, and rotator cuff tears. Of the three RCD pathologies, rotator cuff tears represent a significant proportion of RCDs, and are likely the most expensive in terms of public healthcare expenditure because many patients receive surgery as a means of treatment. In particular, patients with chronic rotator cuff tears suffer lengthy waiting times; inefficient use of healthcare resources, and disjointed care (Chehade et al., Reference Chehade, Burgess and Bentley2011; Frank et al., Reference Frank, Marshall, Faris and Smith2011; Lau et al., Reference Lau, Lafave, Mohtadi and Butterwick2012; Mohtadi et al., Reference Mohtadi, Chan, Lau and Lafave2012; Marshall et al., Reference Marshall, Christiansen, Smith, Squire, Werle, Faris and Frank2015). Measuring and analyzing data on quality of care would help to identify gaps in the current clinical pathway and suggest ways of improving care for patients presenting to the healthcare system with chronic rotator cuff tears.

Assessing healthcare quality is the first step to improving care and service delivery. Healthcare is a complex system that is often inefficient, error-prone, and costly (Buchert and Butler, Reference Buchert and Butler2016). Pressured to improve healthcare quality and economic efficiencies, physicians are often criticized for being less connected to patient needs, values, and preferences (Schippits and Schippits, Reference Schippits and Schippits2013). Additionally, health decisions are becoming more complicated and patient care seems inconsistent with the availability of numerous clinical options (Schippits and Schippits, Reference Schippits and Schippits2013). Consequently, specific strategies are needed for quality improvements through healthcare reform, which can result in patient-centered care, considerable savings of resources, and expansion of services for the community (Peacock et al., Reference Peacock, Chan, Mangolini and Johansen2001). Measuring quality in healthcare is vital in evaluating patient outcomes and system performance before quality improvements can be achieved. Quality assessment can reveal the magnitude and nature of problems facing healthcare systems (Leatherman and Sutherland, Reference Leatherman and Sutherland2010), and offers one method for evaluating the impact of changes to the organization and financing of healthcare services (McGlynn, Reference McGlynn1997).

Measuring quality, however, is no simple task (McGlynn, Reference McGlynn1997). Often, there is little systematic information about the extent to which standard processes involved in healthcare – a key element of quality – are delivered (McGlynn et al., Reference McGlynn, Asch, Adams, Keesey, Hicks, DeCristofaro and Kerr2003). Furthermore, there exists a gap between what works and what is actually done (McGlynn et al., Reference McGlynn, Asch, Adams, Keesey, Hicks, DeCristofaro and Kerr2003). Given the complexity and diversity of the healthcare system, there is no simple solution. A key component of any solution to measuring and reporting on quality is the availability of reliable and valid information on performance at all levels (McGlynn et al., Reference McGlynn, Asch, Adams, Keesey, Hicks, DeCristofaro and Kerr2003). Before quality can be measured, however, it must first be defined (Donabedian, Reference Donabedian1988).

The Alberta Quality Matrix for Health (AQMH) is one approach to measuring, defining, and standardizing what quality healthcare means. It is an evidence-based approach that has been applied to other studies to assess and improve the quality of musculoskeletal care (Gooch et al., Reference Gooch, Smith, Wasylak, Faris, Marshall, Khong, Hibbert, Parker, Zernicke, Beaupre, Pearce, Johnston and Frank2009; Frank et al., Reference Frank, Marshall, Faris and Smith2011; Schull et al., Reference Schull, Guttmann, Leaver, Vermeulen, Hatcher, Rowe, Zwarenstein and Anderson2011). According to the AQMH, quality of care can be defined using six dimensions: accessibility, acceptability, efficiency, effectiveness, appropriateness, and safety (Health Quality Council of Alberta, Reference Health Quality Council2003). Accessible health services are defined as those ‘obtained in the most suitable setting in a reasonable time and distance’ (Health Quality Council of Alberta, Reference Health Quality Council2003). Acceptable health services are defined as being ‘respectful and responsive to user needs, preferences, and expectations’ (Health Quality Council of Alberta, Reference Health Quality Council2003). Efficient health services are defined as ‘resources that are optimally used in achieving desired outcomes’ (Health Quality Council of Alberta, Reference Health Quality Council2003). Effective health services are defined as being ‘based on scientific knowledge to achieve desired outcomes’ and refer to the efficacy of an intervention in providing the best outcome for the patient (Health Quality Council of Alberta, Reference Health Quality Council2003). Appropriate health services are defined as being ‘relevant to user needs and are based on accepted or evidence-based practice’ (Health Quality Council of Alberta, Reference Health Quality Council2003). In addition, safe health services are defined as being able to ‘mitigate risks to avoid unintended or harmful results’ (Health Quality Council of Alberta, Reference Health Quality Council2003).

The AQMH is only a theoretical framework that defines six quality dimensions. Therefore, it only offers a common language, understanding, and approach to assessing quality in healthcare (Health Quality Council of Alberta, Reference Health Quality Council2003). It is not a tool for gathering quantitative aspects of patients’ experience that can be used to evaluate care such as waiting times, patient satisfaction, health resource utilization, and care processes. Our group previously developed a tool for clinicians and healthcare teams to assess quality of care according to five of six AQMH’s dimensions – the Healthcare Access and Patient Satisfaction Questionnaire (HAPSQ) (Lau et al., Reference Lau, Lafave, Mohtadi and Butterwick2012; Mohtadi et al., Reference Mohtadi, Chan, Lau and Lafave2012). The HAPSQ does not assess effectiveness. To our knowledge, a search of the literature did not find any patient-report tools that were comparable and able to assess quality of care consistent with AQMH’s framework. Although the HAPSQ demonstrated good reliability and validity for patients with acute knee injuries (Lau et al., Reference Lau, Lafave, Mohtadi and Butterwick2012; Mohtadi et al., Reference Mohtadi, Chan, Lau and Lafave2012), reliability and validity are context and patient population-specific. Therefore, the psychometric properties of the HAPSQ were tested within the context of patients presenting to healthcare settings with chronic rotator cuff tears.

Methods

Early development

The HAPSQ was originally developed to measure the quality of care for patients presenting with acute knee injuries consistent with the AQMH (Lau, Reference Lau2009). The early development of the HAPSQ occurred between 2006 and 2008 using patients recruited from the University of Calgary Sport Medicine Centre in Calgary, Alberta, Canada (Lau, Reference Lau2009). The primary investigator (B.E.) initially generated a list of 39 fixed items. The initial item list was then circulated to a working group consisting of content experts and members of stakeholder groups. The working group examined the list for content validity and comprehensiveness, and modifications were made in response to the comments received. The revised list was then tested using expert focus groups and patient interviews. The HAPSQ experienced 19 iterations in which items were modified to improve clarity. HAPSQ (version 19) underwent reliability and validity testing (Lau, Reference Lau2009). Items that failed to meet test–retest reliability or possessed little variance were discarded resulting in 29 fixed items. The HAPSQ (version 20) has since been used to evaluate the quality of care in patients presenting with acute knee injuries (Lau et al., Reference Lau, Lafave, Mohtadi and Butterwick2012; Mohtadi et al., Reference Mohtadi, Chan, Lau and Lafave2012). The HAPSQ (version 20) is a web-based questionnaire. The web-based interface provided several advantages over traditional survey methods in terms of cost, speed, appearance, flexibility, functionality, and usability. The HAPSQ (version 20) was adapted to patients presenting with chronic rotator cuff tears by modifying questions with the word ‘knee’ to ‘shoulder.’ All other aspects of item organization and wording remained unchanged.

The HAPSQ

The HAPSQ is a self-administered, multipurpose web-based questionnaire that collects information related to healthcare utilization, access, and patient satisfaction. To ensure readability, the HAPSQ was designed at a 10th-grade reading level and the interface font was legible. A high-contrast design was created by using black text on a white background to ensure optimal legibility and esthetics (Hall and Hanna, Reference Hall and Hanna2004). Pop-up instructions and error messages eliminated nonresponse errors. Progressive indicators were also placed on the left-hand side of the screen to show respondents how far through the questionnaire they were. Individual items are grouped into sections of the questionnaire rather than scales: use of physician services (eg, GP/family physician, orthopedic surgeon) (four items); use of diagnostic investigations (three items); surgery (two items); use of complementary allied medical treatments (eg, physical therapy, massage therapy) (three items); out-of-pocket expenses (three items); lost wages (four items); patient satisfaction rating of care (two items); patient expectations around acceptable waiting times (one item); and demographic information (seven items). The HAPSQ has 29 fixed items, however, the total item count within several sections can vary depending on the quantity of services rendered or items purchased. For example, if the patient received care from two family physicians and one surgeon, then the number of items in that section would increase to 12 (three physicians × four items). If the patient purchased five out-of-pocket expenses, then the number of items in that section would increase to 15 (five expenses × three items). The HAPSQ is designed such that all items are required and must be answered. Therefore, patients are unable to submit their questionnaire until all required items have been answered. This reduces the potential for missing items.

The HAPSQ is a descriptive, health information tool. Therefore, it was not intended to provide one composite score. Instead, items from different sections are combined to provide health information, whereby results can be used to make a judgment about the accessibility, acceptability, efficiency, effectiveness, and safety of care. Table 1 summarizes items in the HAPSQ used to evaluate each of the dimensions. Items measuring waiting times and distance can be used to evaluate accessibility. Items measuring patient satisfaction can be used to evaluate acceptability. Items measuring healthcare consumption, direct costs, and indirect costs can be used to evaluate efficiency. Items relating to patient-suggested waiting times and utilization of healthcare resources can be used to assess appropriateness. Finally, safety can be evaluated by comparing actual clinical care pathways to ideal clinical care pathway algorithms (Eubank et al., Reference Eubank, Mohtadi, Lafave, Wiley, Bois, Boorman and Sheps2016). In this case, multiple items in the HAPSQ can be combined to map clinical pathways experienced by each patient. Clinical pathways detail steps in the care delivery process of each patient. Therefore, safety can be evaluated by comparing actual clinical pathways to ideal clinical pathway algorithms in order to identify unsafe practices.

Table 1 Items from the Healthcare Access and Patient Satisfaction Questionnaire (HAPSQ) mapped to Alberta Quality Matrix for Health’s quality dimensions

Five demographic questions are not mapped to a quality dimension, are for secondary analysis, and have not been included in Table 1. VAS scale: 100 mm visual analog scale from 0=‘extremely dissatisfied’ and 100=‘extremely satisfied’. Items are combined to map clinical pathways experienced by each patient, which are then used to evaluate the quality dimension: safety.

Design

Chronic rotator cuff tears were chosen because it ranks among the most prevalent of RCDs that present to the healthcare system (Kemp et al., Reference Kemp, Sheps, Luciak-Corea, Styles-Tripp, Buckingham and Beaupre2011; United States Bone and Joint Initiative, Reference United States Bone2014; Tashjian, Reference Tashjian2016; Jo et al., Reference Jo, Lee, Kim, Kim and Lee2017). Additionally, patients presenting with chronic rotator cuff tears are often treated using conservative, nonoperative management, or surgery (Bokor et al., Reference Bokor, Hawkins, Huckell, Angelo and Schickendantz1993; Kuhn et al., Reference Kuhn, Dunn, Sanders, An, Baumgarten, Bishop, Brophy, Carey, Holloway, Jones, Ma, Marx, McCarty, Poddar, Smith, Spencer, Vidal, Wolf and Wright2013; Boorman et al., Reference Boorman, More, Hollinshead, Wiley, Brett, Mohtadi, Nelson, Lo and Bryant2014; Kukkonen et al., Reference Kukkonen, Joukainen, Lehtinen, Mattila, Tuominen, Kauko and Aarimaa2015). Therefore, two groups of patients were targeted for this study to provide a representative sample of patients currently presenting to the healthcare system. The inclusion and exclusion criteria for this study are presented in Table 2. Between March and October 2015, a convenience sample of patients presenting with chronic rotator cuff tears was recruited prospectively from the University of Calgary Sport Medicine Centre. Patients were identified from new and follow-up referrals from primary care (eg, emergency room physicians, GPs/family physicians) and sport medicine physicians to three different orthopedic surgeons. Patients eligible for the study were recruited by the primary investigator (B.E.) during scheduled physician appointments. Group 1 included patients that did not require immediate surgical management and were treated conservatively with a nonoperative rehabilitation program (Boorman et al., Reference Boorman, More, Hollinshead, Wiley, Brett, Mohtadi, Nelson, Lo and Bryant2014). Group 2 consisted of surgically treated patients who had confirmed surgical dates or had already received surgical management for their shoulder problem. The goal was to first recruit 15 patients for pilot testing the HAPSQ. This sample size was suggested by Zukerberg et al. to be optimal for pilot testing (Zukerberg et al., Reference Zukerberg, Von Thurn and Moore1995). Once pilot testing was completed, recruitment of patients for reliability and validity testing began. The goal was to recruit patients until at least 35 pairs of questionnaires per group was obtained. According to Hertzog, who proposed that for a study to obtain a reliability estimate of at least 0.70, a sample size of at least 35 pairs of questionnaires should be analyzed (Hertzog, Reference Hertzog2008). This prospective methodologic study was approved by the Conjoint Health Research Ethics Board at the University of Calgary.

Table 2 Inclusion and exclusion criteria

Data analysis

Reliability of the HASPQ was assessed using test–retest reliability and the intraclass correlation coefficient (ICC) for continuous variables of the HAPSQ (13 items). Although there are six forms of ICCs, ICC (2,k) was chosen because it is a measure of agreement between two administrations where the raters are fixed (Shrout and Fleiss, Reference Shrout and Fleiss1979). An ICC of ⩾0.70 was deemed a good measure of reliability (Nunally and Bernstein, Reference Nunally and Bernstein1994). Patients were asked to complete two sets of questionnaires at least one week apart. Any patients with outstanding questionnaires were sent an email reminder at the three-week mark. This was deemed acceptable because the information collected by the HAPSQ was retrospective and thus stable.

Validity of the HAPSQ was assessed through content, criterion, and construct validity testing. Content validity was assessed using two methods. First, patients were interviewed about the relevance and comprehensiveness of the items in the questionnaire. Patients were asked to evaluate the HAPSQ for content, clarity, and readability. Second, content validity was assessed by calculating central tendency, distribution of scores, and floor and ceiling effects for each of the patient satisfaction rating of care items. Large floor and ceiling effects can be an indication that a scale is not valid (Mokkink et al., Reference Mokkink, Terwee, Knol, Stratford, Alonso, Patrick, Bouter and de Vet2010). Patient satisfaction surveys have often been criticized for possessing high ceiling effects (Cappelleri et al., Reference Cappelleri, Gerber, Kourides and Gelfand2000). Therefore, 30% was used as the cut-off for acceptable floor and ceiling effects, whereby floor and ceiling effects were indicated if more than 30% of respondents scored the lowest (0) or highest (100) possible score (Kane, Reference Kane2006).

Concurrent criterion validity was assessed by comparing eight items (eg, items relating to dates, use of physician services, use of diagnostic investigations, and surgery) with patient electronic medical records. The HAPSQ was deemed valid if there was at least 70% agreement with the reference standard (Jonsson and Svingby, Reference Jonsson and Svingby2007).

Construct validity was evaluated through hypothesis testing. Two hypotheses were developed a priori and tested. Studies that have analyzed the impact of waiting time on patient satisfaction scores have established that longer waiting times are negatively associated with clinical provider scores of patient satisfaction (Fournier et al., Reference Fournier, Heale and Rietze2012; Ansell et al., Reference Ansell, Crispo, Simard and Bjerre2017). Therefore, it was hypothesized that patients who experienced longer waiting times to treatment would have an inverse relationship to patient satisfaction scores with respect to time spent waiting for care (Hypothesis 1). Other studies have demonstrated preference in seeking specialist care over primary care for more complex medical conditions because they wanted to obtain the highest quality care (Lewis et al., Reference Lewis, Wickstrom, Kolar, Keyserling, Bognar, DuPre and Hayden2000). Therefore, it was also hypothesized that waiting times would have a negligible correlation to patient satisfaction scores with respect to quality of care received because levels of satisfaction were thought to be associated with level of competence in caring for chronic rotator cuff tears (Hypothesis 2). Pearson correlation coefficients were calculated for all hypotheses. An analysis of variance test was used to compare waiting times and patient satisfaction between physician groups.

A P-value of <0.05 was considered statistically significant for all analyses. All statistical analyses were performed using SPSS 17.0 software (SPSS Inc., Reference SPSS2007).

Results

A total of 15 patients were initially recruited for pilot testing the HAPSQ. Once pilot testing was completed, patient recruitment continued until at least 35 Group 1 patients (nonoperative, conservative management) and 35 Group 2 patients (surgical management) completed two sets of questionnaires. A total of 126 patients provided informed consent and were enrolled in the study before 35 pairs of questionnaires in each group were completed. Of these, 13 patients made no attempt to complete the questionnaire and were lost to follow-up, and 17 patients submitted only partially completed questionnaires, in which only the demographic page (Page 1) was completed. Information from these questionnaires was not included in the analysis. Questionnaires from 96 patients were included in validity testing, and questionnaires from 70 patients were included in reliability testing. The patients’ demographic and clinical characteristics are summarized in Table 3. For reliability testing (n=70), the average age was 58 years (SD: 9, range: 38–78). The patient population was 64% men (n=45), 90% Caucasian (n=63), and 26% retired (n=18); 39% (n=27) of patients reported an annual household income over $100,000. The mean duration of symptoms was three years (SD: 5.0, range: 0.1–25). For validity testing (n=96), the average age was 57 years (SD: 10, range: 27–78). The patient population was 62% men (n=59), 86% Caucasian (n=83), and 23% retired (n=22). Of these, 35% (n=34) of patients reported an annual household income over $100,000. The mean duration of symptoms was three years (SD: 5.0, range: 0.1–29).

Table 3 Patient demographics and clinical characteristics

Group 1: Patients that did not require immediate surgical management and were treated conservatively with a nonoperative, rehabilitation program. Group 2: Patients who had confirmed surgical dates or had already received surgery.

Test–retest reliability data for 13 items in the HAPSQ are presented in Table 4. The HAPSQ was completed on an average of 18 days apart (SD: 10, range: 7–39). The ICC (2,k) for various items ranged from 0.60 to 1.0. Only one item failed to reach an ICC>0.70. Out-of-pocket utilization volume possessed an ICC of 0.60. Although an ICC of 0.60 was below the cut-off value of 0.70, there were no other sources of information available to extract patient utilization with respect to out-of-pocket expenses incurred while suffering RCD. Therefore, this item was retained and used in subsequent analyses (Eubank et al., Reference Eubank, Lafave, Wiley, Sheps, Bois and Mohtadi2018). Date of surgery possessed an ICC of 1.0. All Group 2 patients were recruited within 1 year of receiving surgery, and therefore, a perfect score was accepted because it was thought the majority of patients would remember a major life-altering procedure such as surgery.

Table 4 Intraclass correlation coefficient (ICC) for continuous variables in the Healthcare Access and Patient Satisfaction Questionnaire

ICC: Intraclass correlation coefficient; CI: confidence interval.

The consensus from patient interviews during pilot testing was that the items in the HAPSQ were mostly thought to be relevant, appropriate, and comprehensive. Twelve patients said that they did not have difficulty completing the questionnaire, and found the questions clear and easy to understand. One patient said that it was hard to rate satisfaction with respect to time spent waiting because in his opinion, waiting for care was unavoidable. One patient was confused about the wording of a question with respect to what it meant by obtaining magnetic resonance imaging (MRI) in the public system or a private facility. This question was reworded to improve clarity and did not cause additional confusion in subsequent pretesting of the HAPSQ. Only one patient noted that it was hard to remember the dates for all of their tests and physician visits. When asked if anyone thought if there were irrelevant questions in the HAPSQ, all 15 patients said ‘no.’ However, two patients did comment that the questionnaire was quite lengthy. The average time it took to complete the HAPSQ was 19 min (range: 14–25 min). No patients suggested adding additional content.

The mean score for the patient satisfaction rating of care item with respect to quality of care was 82.34 (SD: 26.23, range: 0–100). For this item, only 2.1% responded 0 and 28.8% responded 100. The mean score for the patient satisfaction rating of care item with respect to waiting time was 64.89 (SD: 34.47, range: 0–100). For this item, only 4.0% responded 0, and 18.7% responded 100. Applying the 30% cut-off for floor and ceiling effect, neither item demonstrated any floor or ceiling effects.

Criterion validity was evaluated by calculating percent agreement between items relating to dates, use of physician services, use of diagnostic investigations, and surgery with patient electronic medical records. There was evidence of a consultation on the same date for 179/250 (72%) visits upon comparing patient-reported dates and the patients’ medical records. There was also evidence that patients accurately reported rendering 171/210 (81%) physician and diagnostic services. Lastly, there was evidence that 53 (76%) patients accurately reported the number and type of physicians they received care from.

The Pearson correlation coefficient, r, was used to evaluate construct validity on two hypotheses developed a priori. A significant inverse relationship was found between waiting time and patient satisfaction with respect to number of days waited, thus confirming Hypothesis 1. Specifically, the number of days spent waiting for diagnostic services (r=−0.40; P<0.001) and physician consultation (r=−0.41, P<0.001) resulted in lower patient satisfaction scores. There was no relationship between waiting time and patient satisfaction with respect to quality of care received (r=0.11, P=0.17), thus confirming Hypothesis 2.

Mean patient satisfaction with respect to quality of care was calculated. The mean patient satisfaction score was lowest for emergency room physicians at 67% (SD: 27) and highest for orthopedic surgeons at 89% (SD: 23). Mean patient satisfaction scores for GPs/family physicians and sport medicine physicians were 81 (SD: 26) and 82 (SD: 23), respectively. An analysis of variance demonstrated that patient satisfaction with respect to quality of care provided by a surgeon was significantly different between the other physician groups [F (3, 172)=3.23, P=0.02]. Tukey’s highest significant difference post-hoc test for significance demonstrated that patient satisfaction for surgeons was significantly higher than emergency room physicians (P=0.03).

Discussion

Assessment of healthcare requires the availability of reliable and valid data on health system performance (Kujala et al., Reference Kujala, Lillrank, Kronstrom and Peltokorpi2006). Reliable and valid data have the potential to guide quality improvement activities, redesign services, keep people and organizations accountable for their performance, change policy and practice, and inspire public debate (Leatherman and Sutherland, Reference Leatherman and Sutherland2010). In Canada, there are still major gaps between how information on health system performance is measured and monitored by several government agencies and health organizations (Health Council of Canada, Reference Health Council2012). Unfortunately, health services data are inaccurate and difficult to access, thus leaving decision makers with no consistent or comparable set of data to determine the impact of those services (Institute for Clinical Evaluative Sciences, Reference Institute for Clinical2012). Therefore, the goal of this study was to evaluate the reliability and validity of the HAPSQ in the context of patients presenting with chronic rotator cuff tears to healthcare settings in Alberta.

Reliability of the HAPSQ was confirmed using test–retest reliability. The ICC for all but one subscale in the HAPSQ was >0.70. Only out-of-pocket utilization volume possessed an ICC of 0.60. Although this may question the validity of this measure, there were no other sources of information available to extract patient utilization with respect to out-of-pocket expenses incurred while suffering a chronic rotator cuff tear.

Content validity was evaluated during pilot testing of the HAPSQ. Results from patient interviews indicated that the HAPSQ was relevant and comprehensive. Content validity was also assessed by calculating central tendency, distribution of scores, and floor and ceiling effects for each of the patient satisfaction rating of care items. Patient satisfaction was used as a measure of the patient’s perception of acceptable care. Studies have expressed concern with using patient satisfaction as a measure of quality, in that many surveys are prone to ceiling effects, which make it difficult to distinguish between the provision of simply adequate services from those providing superior care (Cappelleri et al., Reference Cappelleri, Gerber, Kourides and Gelfand2000; Sofaer and Firminger, Reference Sofaer and Firminger2005). Floor or ceiling effects were not found in the HAPSQ. Another criticism to using patient satisfaction is that it only represents one example of a patient perception, and by far, not the only means (Sofaer and Firminger, Reference Sofaer and Firminger2005). Therefore, studies have criticized that in using satisfaction as a measure of quality, one can never be too sure if variations in ratings from one patient to another are the result of differences in expectations or experiences (Sofaer and Firminger, Reference Sofaer and Firminger2005). However, patient satisfaction is a useful determinant in patient outcomes and compliance with treatment (Golin et al., Reference Golin, DiMatteo and Gelberg1996; Bartlett, Reference Bartlett2002). Additionally, Sofaer and Firminger suggest that asking very specific questions, such as ‘how satisfied they were with the waiting times’ may minimize the subjectivity and the confounding of patient expectations and their ratings (Sofaer and Firminger, Reference Sofaer and Firminger2005). Both patient satisfaction rating of care items in the HAPSQ were specific.

Evidence of concurrent criterion validity was demonstrated when patient-reported data were compared with electronic medical records as the reference standard and percent agreement occurred >70%. Construct validity was tested using two hypotheses developed a priori. Both hypotheses were confirmed. The findings that higher waiting times were moderately correlated to lower patient satisfaction scores, but unrelated to the quality of physician care received by the patient both supported construct validity. The findings that patient satisfaction with respect to quality of care were instead associated with the perceived competence level of the caregiver was confirmed, whereby mean patient satisfaction scores were highest for orthopedic surgeons and lowest for emergency room physicians.

Quality of care can be evaluated by collecting adequate, reliable, and valid data using patient self-report measures (Brook et al., Reference Brook, McGlynn and Shekelle2000). The HAPSQ is a tool that gathers quantitative aspects of a patient’s experience for use in evaluating the continuum of care for patients with chronic rotator cuff tears. Analyses of patient-reported outcome measures such as waiting times, patient satisfaction, health resource utilization, and other care processes can then be used to make a judgment about the accessibility, acceptability, efficiency, appropriateness, and safety of care received.

Although there were many strengths to this study, one limitation involved sampling bias. First, the sample population for the study was limited to patients presenting with chronic full-thickness rotator cuff tears. Therefore, the results presented in this study may not be representative of patients presenting with other RCD such as partial-thickness tears, acute tears, or tendinopathy of the rotator cuff. Second, all patients in this study were recruited from sport medicine clinics and seen by an orthopedic surgeon. Therefore, the results may not be generalizable to patients with RCDs that presented to other physician provider groups or complementary allied medical providers.

An argument could also be made about the accuracy of using computer-based patient medical records as a reference standard when assessing criterion validity. The medical record, however, is often viewed as the preferred data source for measuring processes of care and outcome measures (Tisnado et al., Reference Tisnado, Adams, Liu, Damberg, Chen, Hu, Carlisle, Mangione and Kahn2006). Additionally, several studies have demonstrated good to excellent congruency between self-report and electronic medical records (Rozario et al., Reference Rozario, Morrow-Howell and Proctor2004; Tisnado et al., Reference Tisnado, Adams, Liu, Damberg, Chen, Hu, Carlisle, Mangione and Kahn2006).

Despite these limitations, and considering the logistic challenges and increased expenses associated with measuring healthcare quality, the results of this study demonstrate that the HAPSQ represents an inexpensive, reliable, and valid approach toward collecting diagnostic and treatment information across a patient’s continuum of care. The above-mentioned approach to gathering data using the HAPSQ is feasible and likely to succeed. The challenge lies in using the data collected in order to generate additional work to inform practice, research, and policy. As such, there is a demand for reliable, valid, and relevant evidence on which to develop public policy (Swan and Boruch, Reference Swan and Boruch2004). Therefore, the development of a tool that can measure healthcare quality is a necessary first step in achieving quality improvements toward healthcare reform.

Conclusion

This tool represents the first step toward collecting waiting time, resource utilization, patient-reported outcome measures, and cost information at a provincial level for patients presenting to the healthcare system with chronic rotator cuff tears. This study found the HAPSQ to be psychometrically sound. The HAPSQ can thus serve as a cost-effective tool for evaluating health service quality.

Acknowledgments

The authorship would like to thank Dr Aaron Bois and Dr Richard Boorman for participating in study procedures.

Financial support

Breda Eubank is supported by the University of Calgary Sport Medicine Centre Simpson Endowment Graduate Studentship.

Conflicts of interest

The authors declare that they have no conflicts of interests.

Ethical standards

Ethics approval for this study was provided by the Conjoint Health Research Ethics Board at the University of Calgary (REB 14-1828) and the University of Alberta (Pro00059113). All study participants provided informed consent and received written information about the study.

Footnotes

Cite this article: Eubank BH, Lafave MR, Mohtadi NG, Sheps DM, Wiley JP. (2019) Validation of a tool to assess patient satisfaction, waiting times, healthcare utilization, and cost. Primary Health Care Research & Development. 20(e47): 1–8. doi: 10.1017/S1463423619000094

References

Ansell, D, Crispo, JAG, Simard, B Bjerre, LM (2017) Interventions to reduce wait times for primary care appointments: a systematic review. BMC Health Services Research 17, 295.10.1186/s12913-017-2219-yGoogle Scholar

Bartlett, JA (2002) Addressing the challenges of adherence. Journal of Acquired Immune Deficiency Syndrome 29 (Suppl 1), S2–S10.10.1097/00126334-200202011-00002Google Scholar

Bokor, DJ, Hawkins, RJ, Huckell, GH, Angelo, RL Schickendantz, MS (1993) Results of nonoperative management of full-thickness tears of the rotator cuff. Clinical Orthopaedics and Related Research 294, 103–110.10.1097/00003086-199309000-00013Google Scholar

Boorman, RS, More, KD, Hollinshead, RM, Wiley, JP, Brett, K, Mohtadi, NG, Nelson, AA, Lo, IK Bryant, D (2014) The rotator cuff quality-of-life index predicts the outcome of nonoperative treatment of patients with a chronic rotator cuff tear. Journal of Bone and Joint Surgery—American Volume 96, 1883–1888.10.2106/JBJS.M.01457Google Scholar

Brook, RH, McGlynn, EA Shekelle, PG (2000) Defining and measuring quality of care: a perspective from US researchers. International Journal of Quality Health Care 12, 281–295.10.1093/intqhc/12.4.281Google Scholar

Buchert, AR Butler, GA (2016) Clinical pathways: driving high-reliability and high-value care. Pediatrics Clinics of North America 63, 317–328.10.1016/j.pcl.2015.12.005Google Scholar

Cappelleri, JC, Gerber, RA, Kourides, IA Gelfand, RA (2000) Development and factor analysis of a questionnaire to measure patient satisfaction with injected and inhaled insulin for type 1 diabetes. Diabetes Care 23, 1799–1803.10.2337/diacare.23.12.1799Google Scholar

Chehade, MJ, Burgess, TA Bentley, DJ (2011) Ensuring quality of care through implementation of a competency-based musculoskeletal education framework. Arthritis Care Research (Hoboken) 63, 58–64.10.1002/acr.20329Google Scholar

Donabedian, A (1988) The quality of care. How can it be assessed? JAMA 260, 1743–1748.10.1001/jama.1988.03410120089033Google Scholar

Eubank, B, Lafave, M, Wiley, JP, Sheps, D, Bois, A Mohtadi, N (2018) Evaluating quality of care for patients with rotator cuff disorders. BMC Health Services Research 18, 569.10.1186/s12913-018-3375-4Google Scholar

Eubank, BH, Mohtadi, NG, Lafave, MR, Wiley, JP, Bois, AJ, Boorman, RS Sheps, DM (2016) Using the modified Delphi method to establish clinical consensus for the diagnosis and treatment of patients with rotator cuff pathology. BMC Medical Research Methodology 16, 56.10.1186/s12874-016-0165-8Google Scholar

Fournier, J, Heale, R Rietze, LL (2012) I can’t wait: advanced access decreases wait times in primary healthcare. Healthcare Quality 15, 64–68.Google Scholar

Frank, C, Marshall, D, Faris, P Smith, C (2011) Essay for the CIHR/CMAJ award: improving access to hip and knee replacement and its quality by adopting a new model of care in Alberta. Canadian Medical Association Journal 183, E347–E350.10.1503/cmaj.110358Google Scholar

Golin, CE, DiMatteo, MR Gelberg, L (1996) The role of patient participation in the doctor visit. Implications for adherence to diabetes care. Diabetes Care 19, 1153–1164.10.2337/diacare.19.10.1153Google Scholar

Gooch, KL, Smith, D, Wasylak, T, Faris, PD, Marshall, DA, Khong, H, Hibbert, JE, Parker, RD, Zernicke, RF, Beaupre, L, Pearce, T, Johnston, DW Frank, CB (2009) The Alberta Hip and Knee Replacement Project: a model for health technology assessment based on comparative effectiveness of clinical pathways. International Journal of Technology Assessment in Health Care 25, 113–123.10.1017/S0266462309090163Google Scholar

Hall, RH Hanna, P (2004) The impact of web page text-background colour combinations on readability, retention, aesthetics and behavioural intention. Behaviour and Information Technology 23, 183–195.10.1080/01449290410001669932Google Scholar

Health Council, of Canada (2012) Measuring and reporting on health system performance in Canada: Opportunities for improvement. Retrieved 24 May 2017 from http://publications.gc.ca/site/eng/423907/publication.html.Google Scholar

Health Quality Council, of Alberta (2003) Alberta quality matrix for health user guide. Health Quality Council of Alberta. Retrieved 5 May 2017 from file:///D|/Gowtham/01_CUP/CUP/01_AOP/PHC/PHC_EA/1900009/ http://hqca.ca/about/how-we-work/the-alberta-quality-matrix-for-health-1/.Google Scholar

Hertzog, M (2008) Considerations in determining sample size for pilot studies. Research in Nursing and Health 31, 180–191.10.1002/nur.20247Google Scholar

Institute for Clinical, Evaluative Sciences (2012) Quality monitor. 2012 report on Ontario’s health system. Health quality Ontario. Retrieved 24 May 2017 from http://www.hqontario.ca/portals/0/Documents/pr/qmonitor-full-report-2012-en.pdf.Google Scholar

Jo, YH, Lee, KH, Kim, SJ, Kim, J Lee, BG (2017) National trends in surgery for rotator cuff disease in Korea. Journal of Korean Medical Science 32, 357–364.10.3346/jkms.2017.32.2.357Google Scholar

Jonsson, A Svingby, G (2007) The use of scoring rubrics: reliability, validity, and educational consequences. Educational Research Review 2, 130–144.10.1016/j.edurev.2007.05.002Google Scholar

Kane, RL (2006) Understanding health care outcomes research. Sudbury, MA: Jones and Bartlett.Google Scholar

Kemp, KA, Sheps, DM, Luciak-Corea, C, Styles-Tripp, F, Buckingham, J Beaupre, LA (2011) Systematic review of rotator cuff tears in workers’ compensation patients. Occupational Medicine (Lond) 61, 556–562.10.1093/occmed/kqr068Google Scholar

Kuhn, JE, Dunn, WR, Sanders, R, An, Q, Baumgarten, KM, Bishop, JY, Brophy, RH, Carey, JL, Holloway, BG, Jones, GL, Ma, CB, Marx, RG, McCarty, EC, Poddar, SK, Smith, MV, Spencer, EE, Vidal, AF, Wolf, BR Wright, RW (2013) Effectiveness of physical therapy in treating atraumatic full-thickness rotator cuff tears: a multicenter prospective cohort study. Journal of Shoulder and Elbow Surgery 22, 1371–1379.10.1016/j.jse.2013.01.026Google Scholar

Kujala, J, Lillrank, P, Kronstrom, V Peltokorpi, A (2006) Time-based management of patient processes. Journal of Health Organization and Management 20, 512–524.10.1108/14777260610702262Google Scholar

Kukkonen, J, Joukainen, A, Lehtinen, J, Mattila, KT, Tuominen, EK, Kauko, T Aarimaa, V (2015) Treatment of nontraumatic rotator cuff tears: a randomized controlled trial with two years of clinical and imaging follow-up. Journal of Bone and Joint Surgery—American Volume 97, 1729–1737.10.2106/JBJS.N.01051Google Scholar

Lau, B (2009) Development and implementation of a Healthcare Access and Patient Satisfaction Questionnaire (HAPSQ) for measuring wait times, satisfaction, and costs with acute knee injury care in Alberta, Masters of Science Thesis. Faculty of Kinesiology. Calgary, Alberta: University of Calgary.Google Scholar

Lau, B, Lafave, M, Mohtadi, N Butterwick, D (2012) Utilization and cost of a new model of care for managing acute knee injuries: the Calgary Acute Knee Injury Clinic. BMC Health Services Research 12, 445.10.1186/1472-6963-12-445Google Scholar

Leatherman, S Sutherland, K (2010) Quality of healthcare in Canada: A chartbook. Canadian Foundation for Healthcare Improvement. Retrieved 24 May 2017 from http://www.cfhi-fcass.ca/SearchResultsNews/10-02-10/42054d49-16fb-4764-be05-1d03e6ff3bbb.aspx.Google Scholar

Lewis, CL, Wickstrom, GC, Kolar, MM, Keyserling, TC, Bognar, BA, DuPre, CT Hayden, J (2000) Patient preferences for care by general internists and specialists in the ambulatory setting. Journal of General Internal Medicine 15, 75–83.10.1046/j.1525-1497.2000.05089.xGoogle Scholar

Marshall, DA, Christiansen, T, Smith, C, Squire, HJ, Werle, J, Faris, P Frank, C (2015) Continuous quality improvement program for hip and knee replacement. American Journal of Medical Quality 30, 425–431.10.1177/1062860614540512Google Scholar

McGlynn, EA (1997) Six challenges in measuring the quality of health care. Health Affairs (Millwood) 16, 7–21.10.1377/hlthaff.16.3.7Google Scholar

McGlynn, EA, Asch, SM, Adams, J, Keesey, J, Hicks, J, DeCristofaro, A Kerr, EA (2003) The quality of health care delivered to adults in the United States. New England Journal of Medicine 348, 2635–2645.10.1056/NEJMsa022615Google Scholar

Mohtadi, N, Chan, D, Lau, B Lafave, M (2012) An innovative Canadian solution for improved access to care for knee injuries using “Non-Physician Experts”: the Calgary Acute Knee Injury Clinic. Rheumatology S2, https://doi.org/10.4172/2161-1149.S2-002.Google Scholar

Mokkink, LB, Terwee, CB, Knol, DL, Stratford, PW, Alonso, J, Patrick, DL, Bouter, LM de Vet, HC (2010) The COSMIN checklist for evaluating the methodological quality of studies on measurement properties: a clarification of its content. BMC Medical Research Methodology 10, doi.org/10.1186/1471-2288-10-22.Google Scholar

Nunally, J Bernstein, I (1994) Pscychometric theory, 3rd edn.. New York: McGraw-Hill.Google Scholar

Peacock, S, Chan, C, Mangolini, M Johansen, D (2001) Techniques for measuring efficiency in health services. Staff Working Paper. Productivity Commission. Retrieved 24 May 2017 from http://pc.gov.au/__data/assets/pdf_file/0018/60471/tmeihs.pdf.Google Scholar

Rozario, PA, Morrow-Howell, N Proctor, E (2004) Comparing the congruency of self-report and provider records of depressed elders’ service use by provider type. Medical Care 42, 952–959.10.1097/00005650-200410000-00003Google Scholar

Schippits, M Schippits, M (2013) Clinical pathways leading healthcare reform: transformational strategies for oncology and beyond. Journal of Medicine and the Person 11, 62–68.10.1007/s12682-013-0151-4Google Scholar

Schull, MJ, Guttmann, A, Leaver, CA, Vermeulen, M, Hatcher, CM, Rowe, BH, Zwarenstein, M Anderson, GM (2011) Prioritizing performance measurement for emergency department care: consensus on evidence-based quality of care indicators. CJEM 13, 300–343.10.2310/8000.2011.110334Google Scholar

Shrout, PE Fleiss, JL (1979) Intraclass correlations: uses in assessing rater reliability. Psychology Bulletin 86, 420–428.10.1037/0033-2909.86.2.420Google Scholar

Sofaer, S Firminger, K (2005) Patient perceptions of the quality of health services. Annual Review of Public Health 26, 513–559.10.1146/annurev.publhealth.25.050503.153958Google Scholar

SPSS, Inc (2007) Statistical Software: Release 17.0. Chicago, IL: SPSS Inc. [computer program].Google Scholar

Swan, BA Boruch, RF (2004) Quality of evidence: usefulness in measuring the quality of health care. Medical Care 42, II12–II20.10.1097/01.mlr.0000109123.10875.5cGoogle Scholar

Tashjian, RZ (2016) The natural history of rotator cuff disease: evidence in 2016. Techniques in Shoulder and Elbow Surgery 17, 132–138.10.1097/BTE.0000000000000109Google Scholar

Tisnado, DM, Adams, JL, Liu, H, Damberg, CL, Chen, WP, Hu, FA, Carlisle, DM, Mangione, CM Kahn, KL (2006) What is the concordance between the medical record and patient self-report as data sources for ambulatory care? Medical Care 44, 132–140.10.1097/01.mlr.0000196952.15921.bfGoogle Scholar

United States Bone, and Joint Initiative (2014) The burden of musculoskeletal diseases in the United States (BMUS), third edition. Rosemont, IL. Retrieved from 19 January 2018 http://www.boneandjointburden.org/.Google Scholar

Yamaguchi, K (2011) New guideline on rotator cuff problems. AAOS Now 5, 1–4.Google Scholar

Zukerberg, AL, Von Thurn, DR Moore, JC (1995) Practical considerations in sample size selection for behavior coding pretests. Proceedings of the Section on Survey Research Methods. American Statistical Association. Retrieved from 19 January 2018 http://www.amstat.org/sections/srms/Proceedings/papers/1995_194.pdf.Google Scholar

Table 1 Items from the Healthcare Access and Patient Satisfaction Questionnaire (HAPSQ) mapped to Alberta Quality Matrix for Health’s quality dimensions

Table 2 Inclusion and exclusion criteria

Table 3 Patient demographics and clinical characteristics

Table 4 Intraclass correlation coefficient (ICC) for continuous variables in the Healthcare Access and Patient Satisfaction Questionnaire

Article contents

Validation of a tool to assess patient satisfaction, waiting times, healthcare utilization, and cost

Abstract

Keywords

Background

Methods

Early development

The HAPSQ

Design

Data analysis

Results

Discussion

Conclusion

Acknowledgments

Financial support

Conflicts of interest

Ethical standards

Footnotes

References

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests