ARE POLICY DECISIONS ON SURGICAL PROCEDURES INFORMED BY ROBUST ECONOMIC EVIDENCE? A SYSTEMATIC REVIEW

Roberta Ara; Hasan Basarir; Anju D. Keetharuth; Marco Barbieri; Helen L.A. Weatherly; Mark J.S. Sculpher; Hashim Ahmed; Steven Brown

doi:10.1017/S0266462314000531

ARE POLICY DECISIONS ON SURGICAL PROCEDURES INFORMED BY ROBUST ECONOMIC EVIDENCE? A SYSTEMATIC REVIEW

Published online by Cambridge University Press: 13 November 2014

Helen L.A. Weatherly ,

Mark J.S. Sculpher ,

Hashim Ahmed and

Steven Brown

Show author details

Roberta Ara: Affiliation:
ScHARR, Senior Research Fellow, School of Health and Related Research (ScHARR), Regent Court, 30 Regent Street, Sheffield, S1 4DA
Hasan Basarir: Affiliation:
ScHARR, Research Fellow, The University of Sheffield, School of Health and Related Research (ScHARR), Regent Court, 30 Regent Street, Sheffield, S1 4DA
Anju D. Keetharuth: Affiliation:
ScHARR, Research Associate, The University of Sheffield, School of Health and Related Research (ScHARR), Regent Court, 30 Regent Street, Sheffield, S1 4DA
Marco Barbieri: Affiliation:
Centre for Health Economics (CHE), University of York, Heslington, York YO10 5DD, UK
Helen L.A. Weatherly: Affiliation:
Senior Research Fellow, Centre for Health Economics (CHE), University of York, Heslington, York YO10 5DD, UK
Mark J.S. Sculpher: Affiliation:
Professor of Health Economics, Deputy Director of the Policy Research Unit in Economic Evaluation of Health and Care Interventions (EEPRU), Centre for Health Economics (CHE), University of York, Heslington, York YO10 5DD, UK
Hashim Ahmed: Affiliation:
MRC Clinician Scientist and Honorary Consultant Urological Surgeon, University College Hospital, 235 Euston Road, London, NW1 2BU
Steven Brown: Affiliation:
Consultant Colorectal Surgeon, Sheffield Teaching Hospitals, NHS Foundation Trust, Northern General Hospital, Herries Road, Sheffield, S5 7AU

Article contents

Abstract
METHODS
RESULTS
DISCUSSION
CONCLUSION
CONTACT INFORMATION
CONFLICTS OF INTEREST
References

Rights & Permissions

Abstract

Objectives: The aim of this study was to examine the empirical and methodological cost-effectiveness evidence of surgical interventions for breast, colorectal, or prostate cancer.

Methods: A systematic search of seven databases including MEDLINE, EMBASE, and NHSEED, research registers, the NICE Web site and conference proceedings was conducted in April 2012. Study quality was assessed in terms of meeting essential, preferred and UK NICE specific requirements for economic evaluations.

Results: The seventeen (breast = 3, colorectal = 7, prostate = 7) included studies covered a broad range of settings (nine European; eight non-European) and six were published over 10 years ago. The populations, interventions and comparators were generally well defined. Very few studies were informed by literature reviews and few used synthesized clinical evidence. Although the interventions had potential differential effects on recurrence and mortality rates, some studies used relatively short time horizons. Univariate sensitivity analyses were reported in all studies but less than a third characterized all uncertainty with a probabilistic sensitivity analysis. Although a third of studies incorporated patients’ health-related quality of life data, only four studies used social tariff values.

Conclusions: There is a dearth of recent robust evidence describing the cost-effectiveness of surgical interventions in the management of breast, colorectal and prostate cancers. Many of the recent publications did not satisfy essential methodological requirements such as using clinical evidence informed by a systematic review and synthesis. Given the ratio of potential benefit and harms associated with cancer surgery and the volume of resources consumed by these, there is an urgent need to increase economic evaluations of these technologies.

Keywords

Economic evaluation Technology assessment Review Surgical procedures Neoplasms

Type: Assessments
Information: International Journal of Technology Assessment in Health Care , Volume 30 , Issue 4 , October 2014 , pp. 381 - 393

DOI: https://doi.org/10.1017/S0266462314000531 [Opens in a new window]
Copyright: Copyright © Cambridge University Press 2014

Health technology assessments (HTA) provide a connection between the research and decision-making worlds and over the last 20 years have become central to informing healthcare policy decision making in many settings worldwide (Reference Sorenson, Drummond and Kanavos1). The HTA process involves a systematic evaluation of the direct (intended) and indirect (unintended) effects and impacts of healthcare technologies. Organizations such as the Pharmaceutical Benefits Advisory Committee (PBAC) in Australia, the Heath Care Insurance Board (CVZ) in the Netherlands, and the National Institute for Health and Care Excellence (NICE) in the United Kingdom now publish guidelines to ensure consistency and transparency in the HTA decision-making and reimbursement process (2–4).

Once the therapeutic value of novel interventions is established, evidence of value for money and efficiency is required before allocation of scarce healthcare resources in many settings. To ensure consistency and increase the quality of evidence submitted, NICE provides detail of a “reference case” for economic evidence used to support submissions (4). While some of NICE's requirements (such as discount rates and perspectives on costs) differ from those stipulated for non-UK settings, many are essential requirements for any economic evaluation in healthcare. These are clearly defined, relevant comparators; systematic literature reviews for health effects; synthesis of clinical evidence where applicable; suitable time horizons; health related quality of life (HRQoL) data reported by patients; and exploration of uncertainty (Reference Johannesson, Jonsson and Jonsson5–Reference Lopez-Bastida, Oliva and Antonanzas7). An often recommended criterion, which is not standard across the board, is that results are reported in terms of incremental cost per quality-adjusted life-year (QALY) whereby the HRQoL weights for the QALY are valued by the general population (4;Reference Gold, Siegel and Russell8).

The last decade has seen a rapid growth in the volume of literature describing economic evaluations of interventions in healthcare. In particular the number of articles describing cost-effectiveness evaluations in cancer has increased substantially as new pharmaceuticals are brought to market. Surgery, radiotherapy, and chemotherapy are the main treatments for cancer (9), and considerable proportions of healthcare budgets are consumed by these interventions. In 2010–2011, cancer was the third largest category of the overall UK healthcare budget, accounting for 5.4 percent of the overall cost at £5.8 bn (EUR 7.24 bn) (Reference Harker10), while total economic costs attributed to colorectal, breast, and prostate cancers were estimated to be £1.6 bn (EUR 2.0 bn), £1.5 bn (EUR 1.87 bn) and £0.6 bn (EUR 0.75 bn), respectively (Reference Leal11). Although the clinical and cost-effectiveness of novel pharmaceutical interventions for cancer is well documented due to the submission processes for reimbursement in many settings, it is unclear whether existing and novel surgical and radiotherapy interventions are treated with the same rigorous scrutiny.

The objective of the study was to examine the methodological and empirical cost-effectiveness evidence specifically in relation to surgical interventions in breast, colorectal and prostate cancer. Relevant literature was identified through systematic searches and methodological standards were assessed using quality criteria, both essential and preferred, and informed by the NICE reference case.

METHODS

Search Strategy

Systematic searches were undertaken (April 2012) in the following databases: Medline, EMBASE, CDSR, NHSEED, HTA, DARE, and EconLit. The CEA registry, the NICE website, recent conference proceedings, and reference lists of any included studies and existing reviews were also searched. No language, publication, or date restrictions were applied to the searches. The full search strategy is available online (Reference Ara, Barbieri and Weatherly12).

Study Selection Strategy

Studies were selected for inclusion in two stages. An initial sieve was conducted on title and abstract and full articles were retrieved for any potentially relevant studies. Studies were included in the reviews if they assessed the cost-effectiveness of a surgical technique in individuals with either breast, colorectal, or prostate cancer, and presented results in the form of an incremental cost per QALY or an incremental cost per life-year [LY]). Data were extracted using a customized template designed to capture relevant information (Table 1–2). Studies were quality appraised (Figure 2) in terms of meeting essential (items 1–7: patient group and indication clearly defined, comparators clearly defined, effectiveness evidence based on a systematic review, appropriate time horizon, HRQoL data reported directly by patients or carers, probabilistic sensitivity analysis used to quantify uncertainty), preferred (items 8–9: cost-effectiveness analysis using QALYs, HRQoL preference data valuation by representative sample of the public), and UK NICE specific (items 10–14: 3.5 percent discount rate, comparators used in NHS, UK setting, UK NHS and personal social services costs, cost per QALY below the NICE threshold value, assumed to be £30 k per QALY for evaluations in cancer (i.e., society is willing to pay £30k [EUR 37.4 k] for one additional QALY) requirements for economic evaluations in healthcare (4). Many of the criteria that were applied are relevant to economic models irrespective of setting and the preferred items overlap with those in published quality checklists (Reference Gold, Siegel and Russell8;Reference Drummond, Sculpher and Torrance13). The evidence was reviewed by a single researcher and any issues were discussed within the review team until a consensus was achieved in cases of uncertainty or lack of clarity.

Table 1. Characteristics of the Studies Included

BCS, breast conserving surgery; BSCRT, breast conserving surgery with radiation; CEA, cost-effectiveness analysis; LRARP, laparoscopic remotely assisted radical prostatectomy; LS, laparoscopic surgery; LY, life-year; MRM, modified radical mastectomy; MR, magnetic resonance; OS, open surgery; QALY, quality adjusted life year; RP, radical prostatectomy; RALP, Robot-assisted laparoscopic prostatectomy; RRP, retropubic radical prostatectomy; WW, watchful waiting.

Table 2. Methods and Study Conclusions

BCS, breast conserving surgery; BSCRT, breast conserving surgery with radiation; CEA, cost-effectiveness analysis; LRARP, laparoscopic remotely assisted radical prostatectomy; LS, laparoscopic surgery; LY, life year; MRM, modified radical mastectomy; MR, magnetic resonance; OS, open surgery; QALY, quality-adjusted life-year; RP, radical prostatectomy; RALP, Robot-assisted laparoscopic prostatectomy; RRP, retropubic radical prostatectomy; WW, watchful waiting.

RESULTS

The searches identified 1,133, 2,408, and 722 unique references for the breast, colorectal and prostate cancer reviews, respectively (Figure 1). Seventy-eight full papers were retrieved for detailed inspection. Of these, sixteen studies met the inclusion criteria (Table 1). One additional study published after completion of the searches was identified and has been included in this article.

Figure 1. Flow diagram for study inclusion.

Three studies were in breast cancer (publication dates 1997 to 2003) (Reference Malin, Keeler and Wang14–Reference Polsky, Mandelblatt and Weeks16), seven were in colorectal cancer (publication dates 2000 to 2011) (Reference Callejo, Guerra and Reza17–Reference Murray, Lourenco and de Verteuil23) and seven were in prostate cancer (publication dates 2000 to 2012) (Reference Calvert, Morgan and Catto24–Reference Ramsay, Pickard and Robertson30). Two of the three breast cancer studies were set in the United States (Reference Malin, Keeler and Wang14;Reference Polsky, Mandelblatt and Weeks16), and one was in Norway (Reference Norum, Olsen and Wist15). Two of the seven colorectal studies were set in the United Kingdom (Reference de Verteuil, Hernandez and Vale18;Reference Murray, Lourenco and de Verteuil23), two in the United States (Reference Jensen and Abcarian21;Reference Miller, Cantor and Peoples22), one in Canada (Reference Govindarajan, Naimark and Coburn19), one in New Zealand (Reference Hayes and Hansen20), and one in Spain (Reference Callejo, Guerra and Reza17). Three of the seven prostate studies were set in the United Kingdom (Reference Calvert, Morgan and Catto24;28;Reference Ramsay, Pickard and Robertson30), with the remaining four in Australia (Reference O’Malley, Jordan and O’Malley29), Denmark (Reference Hohwu, Borre and Ehlers25), Sweden (Reference Lyth, Andersson and Andren27), or the United States (Reference Jager, Severens and Thornbury26). The studies scan a broad publication range with six published before 2004 (older publications) and five published after 2010 (recent publications). Of particular interest, there were no publications for breast cancer after 2003.

Essential Basic Requirements

Defined Research Question

A basic requirement of any economic evaluation in health care is a clear and well defined decision problem which states both the exact clinical definition of the indication of interest, and detailed information on the interventions being compared (4). This enables reviewers and readers to assess if the analyses and results are relevant to their own settings and practice. The patient populations were reasonably well defined in the majority of studies (Figure 2), but there were exceptions. In the colorectal and prostate studies, the exact patient population or stage of cancer was either not clear or was not provided in some articles (Reference Callejo, Guerra and Reza17;Reference Hayes and Hansen20;Reference Jensen and Abcarian21;Reference Jager, Severens and Thornbury26).

Figure 2. Quality of included studies compared to the NICE reference case.

The interventions and comparators were generally clearly defined in all the studies. The three breast cancer studies compared breast conserving surgery with mastectomy with one also comparing reconstruction after surgery (Reference Malin, Keeler and Wang14). Six of the colorectal studies compared laparoscopic surgery with open resection (Reference Callejo, Guerra and Reza17;Reference de Verteuil, Hernandez and Vale18,Reference Hayes and Hansen20;Reference Jensen and Abcarian21;Reference Murray, Lourenco and de Verteuil23), one compared surgical resection with diagnostic or palliative surgery (Reference Miller, Cantor and Peoples22), and one compared emergency colonic stenting as a bridge to definitive surgery with emergency surgery (Reference Govindarajan, Naimark and Coburn19). Two of the prostate cancer studies compared radical prostatectomy with watchful waiting (Reference Lyth, Andersson and Andren27;28), one compared radical prostatectomy for those with non-diploid test with either radical prostatectomy for all or, watchful waiting (Reference Calvert, Morgan and Catto24), one compared radical prostatectomy based on magnetic resonance (MR) imaging with radical prostatectomy based on clinical staging (Reference Jager, Severens and Thornbury26), one compared laparoscopic remotely assisted radical prostatectomy with open radical prostatectomy (Reference O’Malley, Jordan and O’Malley29), one compared laparoscopic prostatectomy and robotic prostatectomy with open surgery (Reference Ramsay, Pickard and Robertson30), and the final study compared robot-assisted laparoscopic prostatectomy with retropubic radical prostatectomy (Reference Hohwu, Borre and Ehlers25).

Clinical Evidence

Another essential requirement relates to the clinical data used to quantify potential benefits of the interventions under evaluation. Evidence of a full literature search and, where appropriate, a synthesis of evidence on health effects informed by the systematic review, are mandatory to ensure the data used represent all the evidence available at the time of the study and the associated uncertainty surrounding the point estimates (2–4;6). The effectiveness rates used in the majority of the studies included in this study were not informed by systematic literature reviews or a synthesis of all available data. In breast cancer, none of the studies used effectiveness evidence informed by a systematic literature review or a synthesis of data. In colorectal cancer, five of the seven studies used the results of meta-analyses (Reference Callejo, Guerra and Reza17–Reference Hayes and Hansen20;Reference Murray, Lourenco and de Verteuil23). One study used event rates observed in a single hospital as per their research question (Reference Miller, Cantor and Peoples22), and one provided insufficient detail to determine the source of the effectiveness evidence used (Reference Jensen and Abcarian21). In prostate cancer, one study used the results of a meta-analysis (Reference Ramsay, Pickard and Robertson30), two used data from single randomized control trials (RCTs) with no evidence to suggest alternative evidence was used in sensitivity analyses (Reference Lyth, Andersson and Andren27;28), two used data from retrospective cohort studies (Reference Hohwu, Borre and Ehlers25;Reference O’Malley, Jordan and O’Malley29), one study was informed by a mixture of evidence from RCTs and observational data obtained from a systematic search supplemented by expert opinions (Reference Jager, Severens and Thornbury26), and one provided insufficient detail to determine the source (Reference Calvert, Morgan and Catto24).

Time Horizon

Many of the interventions will have a differential effect on recurrence rates and/or mortality risk hence the most appropriate time horizon for the costs and benefits would be a lifetime (4). However, for individuals with cancer, a lifetime horizon is potentially considerably shorter than that of the general population due to the reduced life expectancy associated with the condition. Over half (10/17) of the studies took account of longer-term costs and benefits associated with the interventions using a minimum of 10 year horizons (Reference Malin, Keeler and Wang14;Reference de Verteuil, Hernandez and Vale18–Reference Govindarajan, Naimark and Coburn19;Reference Murray, Lourenco and de Verteuil23;Reference Calvert, Morgan and Catto24;Reference Jager, Severens and Thornbury26–28). The remaining studies arguably underestimated both the costs and benefits of the interventions. One study used the average weighted survival duration (42 months for resected patients) (Reference Miller, Cantor and Peoples22). One restricted costs and QALYs to the first post-operative year (Reference Hohwu, Borre and Ehlers25), while one used 5-year retrospective data without extrapolating either costs or benefits (Reference Polsky, Mandelblatt and Weeks16). Two studies, which limited the perspectives to that of single hospitals, presented incremental benefits over short horizons without extrapolating to account for downstream benefits or costs. One restricted the time horizon to length of stay, estimating the incremental difference in recovery time (Reference Hayes and Hansen20), while the second study estimated the reduction in months (range, 3.79 to 8.67 months) of incontinence or erectile dysfunction (Reference O’Malley, Jordan and O’Malley29).

HRQoL

HRQoL data should be collected directly from patients with the conditions described in the economic model and preferably from patients in receipt of the interventions under evaluation (4;Reference Lopez-Bastida, Oliva and Antonanzas7). When it is not possible for patients with the conditions to provide these values, then proxy values may be obtained from their principle carer. However, values obtained from healthcare professionals are not appropriate and have been shown to differ substantially from those obtained from patients with the condition being valued (Reference Ashby, O’Hanlon and Buxton31).

The description of sources of HRQoL data was poor in many of the studies. While over a third of the studies (7/17) incorporated HRQoL data which were collected directly from individuals with the condition of interest, three studies used HRQoL data obtained from clinicians (Reference Malin, Keeler and Wang14;Reference Norum, Olsen and Wist15,Reference Miller, Cantor and Peoples22;Reference Murray, Lourenco and de Verteuil23), or nurses (Reference Norum, Olsen and Wist15;Reference Murray, Lourenco and de Verteuil23), and one study assumed HRQoL values based on the New Zealand social tariff (Reference Hayes and Hansen20). There was insufficient evidence to determine the source of the HRQoL data in six of the studies (Reference Norum, Olsen and Wist15;Reference Callejo, Guerra and Reza17;Reference Jensen and Abcarian21;Reference Calvert, Morgan and Catto24;Reference Jager, Severens and Thornbury26;Reference O’Malley, Jordan and O’Malley29).

Uncertainty

All economic evaluations should accurately characterize the decision uncertainty associated with the interventions under evaluation. At the very least, univariate sensitivity analyses should be conducted to illustrate both structural uncertainty (for example assumptions relating to the description of the clinical pathway, or the extrapolation of data) and uncertainty around key parameter values (for example if different sources for HRQoL or cost data are available) (4). Ideally, a full probabilistic sensitivity analysis using Monte Carlo simulations would be performed to demonstrate the uncertainty around the mean point estimates used (Reference Gold, Siegel and Russell8), and is encouraged by several decision-making bodies (3;4;6).

The majority of studies (13/17) performed univariate sensitivity analyses to illustrate the effect on results associated with variations in key parameter inputs. Many of the authors reported cost-effectiveness results were sensitive to changes in parameter values and particularly for recurrence rates, hernia rates, HRQoL values, and side-effects of treatments. The full range of uncertainty was poorly captured with less than one third (5/17) of the studies presenting the results of a full probabilistic sensitivity analysis (Reference Callejo, Guerra and Reza17;Reference Govindarajan, Naimark and Coburn19;Reference Murray, Lourenco and de Verteuil23;Reference Lyth, Andersson and Andren27;Reference Ramsay, Pickard and Robertson30). It is perhaps surprising that some of the newer publications did not report these results as this is now considered to be standard practice in health economics.

Preferred Requirements

Although the preferred outcome from economic evaluations varies depending on the reimbursement body, the cost per QALY, where the valuation of the HRQoL is based on public preferences, is now a pivotal requirement for submissions to NICE (4). The QALY describes both survival and HRQoL weights in a single metric and is the recommended gold standard to facilitate comparison across disparate conditions and equity in policy decisions (Reference Gold, Siegel and Russell8). Although all studies reported results in terms of a cost per QALY (or cost per quality adjusted month) (Reference Govindarajan, Naimark and Coburn19), few (4/17) used general population weights for all utilities (Reference de Verteuil, Hernandez and Vale18;Reference Hayes and Hansen20;Reference Calvert, Morgan and Catto24;Reference Hohwu, Borre and Ehlers25). One study mapped from HRQoL values onto a scale assuming 0 and 100 were equal to death and full health, respectively (Reference Norum, Olsen and Wist15), while two studies used patient values from the patients’ completed Euroqol VAS, again assuming 0 and 100 were equal to death and full health, respectively (Reference Polsky, Mandelblatt and Weeks16;Reference Lyth, Andersson and Andren27). Three studies elicited weights from clinicians using standard gamble (Reference Malin, Keeler and Wang14;Reference Miller, Cantor and Peoples22;Reference Murray, Lourenco and de Verteuil23), while a fourth used time-trade off to derive values from patients. Six studies provided insufficient detail to determine if general population weights were used (Reference Malin, Keeler and Wang14;Reference Norum, Olsen and Wist15;Reference Callejo, Guerra and Reza17;Reference Jensen and Abcarian21;Reference Jager, Severens and Thornbury26;Reference O’Malley, Jordan and O’Malley29).

UK Specific Requirements

Discount rates are both jurisdiction and time dependent and are used to ensure results reflect the present value of the cost and benefits accrued over the duration of the analyses (4). Not surprisingly, due to the differences in settings and the publication dates, only 3/17 of the studies used the current UK recommended discount rates (3.5 percent per annum) for costs and benefits (Reference Lyth, Andersson and Andren27;28;Reference Ramsay, Pickard and Robertson30). However, all studies that used horizons over one year did discount costs and effects.

While some reimbursement bodies restrict the perspective to the direct healthcare resource costs and savings directly attributed to a health service, others take a broader societal view and require indirect costs such as productivity losses or caregiver's time (Reference Johannesson, Jonsson and Jonsson5). Two-thirds (Reference Drummond, Sculpher and Torrance13/17) of the studies limited costs to direct healthcare costs (Reference Malin, Keeler and Wang14–Reference Polsky, Mandelblatt and Weeks16;Reference de Verteuil, Hernandez and Vale18–Reference Hayes and Hansen20;Reference Miller, Cantor and Peoples22–Reference Calvert, Morgan and Catto24;Reference Jager, Severens and Thornbury26–Reference Ramsay, Pickard and Robertson30), and two of these restricted costs to those directly incurred by hospitals (Reference Hayes and Hansen20–Reference Miller, Cantor and Peoples22). The remainder took a broader perspective and costs such as productivity losses (Reference Jensen and Abcarian21;Reference Hohwu, Borre and Ehlers25), and caregiver's time (Reference Jensen and Abcarian21), were included.

Applying the NICE threshold value of GBP 20–30 k (EUR 25–37.4 k) per QALY, irrespective of the time of publication, the setting or adherence to the NICE methods guide, breast conserving surgery plus axillary mode dissection compared with modified radical mastectomy would be considered cost-effective while breast conservation surgery with radiation therapy compared with mastectomy would not be considered cost-effective (Reference Malin, Keeler and Wang14). There are multiple reasons for this including cosmesis, patient preferences, and the added cost of reconstruction which is used after mastectomy in over 20 percent of all cases and recurrence rates requiring further interventions (32;Reference Hazebroek33). It should be noted that the latter comparator is no longer appropriate in the current clinical context as it is now current practice, with rare exceptions, for radiation to be administered after conservation therapy.

Applying the GBP 20–30 k (EUR 25–37.4 k) per QALY threshold to the studies on colorectal cancer, the probability that laparoscopic surgery would be considered cost-effective compared with open surgery ranged between 30 percent (Reference de Verteuil, Hernandez and Vale18) and 40 percent (Reference Murray, Lourenco and de Verteuil23) and univariate analyses produced results where laparoscopic surgery either dominated (higher costs and lower benefits) open surgery or was dominated (lower costs and larger benefits) by open surgery suggesting considerable uncertainty (Reference de Verteuil, Hernandez and Vale18;Reference Hayes and Hansen20;Reference Murray, Lourenco and de Verteuil23). The evaluation of emergency colonic stenting as a bridge to definitive surgery dominated emergency surgery in individuals with emergent, malignant left colonic obstruction (Reference Govindarajan, Naimark and Coburn19), and non-surgical treatment dominated both surgical resection and diagnostic/palliative surgery in individuals with recurrent colorectal cancer (Reference Miller, Cantor and Peoples22).

Applying the GBP 20–30 k (EUR 25–37.4 k) per QALY threshold to the studies for prostate cancer, radical prostatectomy treatment is unlikely to be considered cost-effective compared with watchful waiting due to considerable uncertainty around the incremental cost-effectiveness ratio (ICER). Again, the cost per QALY results ranged from below GBP 20 k (EUR 25 k) per QALY (except for men aged 75 and over) when the decision was based on patient survival only (Reference Lyth, Andersson and Andren27), to dominated when side effects of the surgical technique were taken into account (28). While a marker-based (DNA-ploidy) treatment selection policy would be considered cost-effective compared with watchful waiting using these thresholds, both policies dominated the “prostatectomy for all” option (Reference Calvert, Morgan and Catto24). However, the ICER was highly sensitive to factors such as sensitivity and specificity (Reference Calvert, Morgan and Catto24), and in one study the ICER ranged from approximately GBP 2 k (EUR 2.5 k) per QALY to GBP 84 k (EUR 105 k) per QALY (Reference Lyth, Andersson and Andren27). Robotic prostatectomy was likely to be cost-effective at GBP 30 k (EUR 37.4 k) per QALY in comparison to laparoscopic prostatectomy when the number of procedures performed per year with the robotic system was over 150 (Reference Ramsay, Pickard and Robertson30), whereas radical prostatectomy was cost-effective at GBP 20 k (EUR 25 k) per QALY threshold compared with open surgery (Reference O’Malley, Jordan and O’Malley29).

DISCUSSION

The limited numbers of studies included in the reviews highlight the scarcity of economic evidence for surgical procedures in these indications. The searches identified just three studies for breast cancer surgery, all were published before 2003 (Reference Malin, Keeler and Wang14–Reference Polsky, Mandelblatt and Weeks16). Of the seven colorectal cancer studies, only two were published after 2010;(Reference Callejo, Guerra and Reza17,Reference Jensen and Abcarian21) and of the seven prostate cancer studies, three were published after 2010 (Reference Hohwu, Borre and Ehlers25;Reference Lyth, Andersson and Andren27;Reference Ramsay, Pickard and Robertson30).

Although some UK specific requirements were included in the methodological checklist (Figure 1), the vast majority of criteria used to assess the scientific rigor of the studies are fundamental requirements for any economic evaluation in health care and are, therefore, generalizable to other settings and decision-making bodies. Scientific rigor is improving in economic evaluations generally, and older publications may not conform to current practice. The studies included in the reviews cover a broad publication range with six (6/17) published before 2004 (older publications) and five (5/17) published after 2010 (recent publications). Assessing the studies chronologically (Figure 2), there is a marked difference in the proportion that satisfies the essential criteria. For example, although only 5/17 of all the studies reported results of full probabilistic sensitivity analyses, none (0/6) were older publications and three (3/5) were recent publications. Similarly, although only 6/17 used a systematic review and meta-analysis to inform the clinical evidence used, none (0/6) were older publications and three (3/5) were recent publications. Similar rates were observed for the use of HRQoL data reported directly by the patients or carers with none (0/6) of the older publications using these data compared with two (2/5) of the recent publications. While the majority of the older evaluations were not undertaken using the current requirements for sources of evidence (clinical data informed by a systematic literature review/synthesis, HRQoL data from patients with the condition) and uncertainty (probabilistic sensitivity analysis) the recent publications would be expected to conform to these basic standards.

There were several essential requirements which were not satisfied in many of the publications. First, much of the clinical evidence was not supported by a systematic review of the literature or a synthesis of the relevant evidence, and very few authors presented the results of a full probabilistic sensitivity analysis. Relying on the effectiveness evidence from a single clinical study or hospital setting in isolation, with no reference to the relevant published evidence, can be misleading as the effectiveness evidence from a single study may differ substantially from other evidence in the area. This may hinder and even mislead policy decision makers in some cases. While it is not always possible to conduct a full systematic literature review and evidence synthesis within the resource allocations of a project, it is generally possible to compare the effectiveness evidence used with the literature. If the data used are comparable to the published literature, this will increase confidence in the results presented. If the data differ substantially from the published literature, then the results obtained using the different values should be compared (preferably using a probabilistic sensitivity analysis) to determine if the evaluation is sensitive to variations in these parameters. Exploring the uncertainty around the point estimates is particularly relevant if the evidence comes from a novel intervention where there is no existing evidence.

The second and closely related area where many of the studies failed to meet requirements was in the choice of time horizon used to sum the benefits and costs accrued from the alternative interventions. It is widely accepted that if an intervention has a differential effect on survival, the most appropriate horizon would be a lifetime. As the patients in the evaluations had cancer, then a shorter horizon (say 10 to 20 years) may be considered sufficient. However, many of the studies restricted the horizon to less than 5 years, or did not provide details of the length of horizon used. It is unlikely that a time horizon of 1 year will accurately reflect the full benefits of an intervention in patients with cancer. While some analysts may be reluctant to rely on assumptions to extrapolate beyond the duration of the clinical evidence, the effect of longer horizons should be explored in sensitivity analyses to determine what if any difference this makes to the results generated.

The final area where many of the studies failed to satisfy the essential requirements relates to the HRQoL data used within the analyses. Over half of the studies did not use data collected from patients (or their carers) with the particular conditions. Reporting standards for both the source of evidence and the methodology used to collect these data were particularly low. Many policy makers require that the preference-weights used to determine QALYs are obtained from the general population. There is an argument that in some health conditions, generic measures such as the EQ-5D may not be sensitive to small changes in dimensions of health or features of a particular condition. In these cases, a condition specific measure of health, such as the Functional Assessment of Cancer Therapy (FACT) for patients with cancer, may be more relevant (Reference Cella, Tulsky and Gray34). At the moment the FACT measure does not have a corresponding preference-weighted index which could be used to determine the QALYs in an economic evaluation. While several of the studies (4/17) conducted analyses to elicit preference weights for defined health states in the evaluations, in many of the studies it was impossible to determine what preference-weights, if any, had been used.

To our knowledge, little or no research has been conducted to date which has compared the equivalence of reporting standards for economic evaluation for surgical procedures and those for pharmaceutical products. Comparing methods and standards in economic evaluation of medical devices and drugs, authors highlighted key challenges for evaluating medical devices. These may generalize to surgical interventions including the practical difficulties in conducting randomized controlled trials (Reference Drummond, Griffin and Tarricone35–Reference Sorenson, Tarricone and Siebert36), the dependence of outcome on the surgeon's expertise (Reference Drummond, Griffin and Tarricone35). and the wider organizational impacts associated with introducing new devices which may differ by location (Reference Drummond, Griffin and Tarricone35–Reference Sorenson, Tarricone and Siebert36). In some countries (for example Australia and Canada), HTAs and cost-effectiveness analyses focus on pharmaceuticals rather than the full range of health-care technologies, which may go some way to explaining why there is a dearth of evidence on surgical interventions.

Given the scarcity of literature and relatively dated clinical evidence for some of the indications and procedures, it is reasonable to question what economic evidence is used to support clinical guidelines in these areas. In the United Kingdom, the number of HTA and NICE guidance issued can be used as an indicator of the priority given to cancer health care (37;38). Since its inception, NICE has issued guidance relating to twenty-eight (11;13) pharmaceutical interventions and seven (13) intervention procedures for breast (colorectal, prostate) cancer, and there have been forty-five (25;32) HTA submissions.

The most recent clinical guideline for early (and locally advanced) breast cancer has very limited de-novo evidence in terms of alternative surgical techniques (39;40), The cost-effectiveness literature was not reviewed under the following topics: “Surgery to the breast” (including Paget's disease), “evaluation and management of a positive lymph node” and “breast reconstruction” as “the Guideline Development Group (GDG) did not consider this topic as a health economic priority”. The only independent evaluation conducted to inform the GDG's recommendations examined alternative strategies used for staging of the cancer before surgery.

For colorectal cancer, the recommendations for laparoscopic surgery in the current UK clinical guidelines (41) were taken directly from NICE technology appraisal guidance on “Laparoscopic surgery for colorectal cancer TA105” (42), which used cost-effectiveness evidence reported in an article included in the current review (Reference Murray, Lourenco and de Verteuil23). Although this evidence suggested laparoscopic surgery was dominated by open surgery, the laparoscopic surgery was recommended over open surgery as the NICE committee informed by TA105 were “persuaded there were important differences” in length of stay and return to normal activities for laparoscopic surgery, and even though costs were greater and there was little direct evidence of quality of life benefits, “it was likely that such benefits exist . . . . . . . and these would be sufficient to make laparoscopic procedures cost-effective”. For prostate cancer, the existing cost-effectiveness evidence was considered to be limited as none included evidence from a recently published RCT, hence, a de novo model (included in this review) was constructed to incorporate these data (see 28).

Authors of a recent report expressed concern with regard to the future of academic surgery in the United Kingdom and highlighted the need for additional research activities in cancer surgery and in particular multidisciplinary research (40). Surgery is often associated with the greatest potential benefit (cure/not cure) and potential harm (risk of morbidity/mortality) when comparing modalities in the cancer treatment pathway (40). It is, therefore, likely that new surgical devices and changes in techniques and timing will be cost-effective and even potentially cost-saving compared with pharmaceutical interventions. However, such a proposition needs to be evidence-based and the scarcity of economic evidence on surgical techniques suggests that this is one area where the surgical community and policy-decision makers could benefit from a wider multi-disciplinary approach to research.

CONCLUSION

There is a dearth of recent robust evidence describing the cost-effectiveness of surgical interventions in these indications. Many of the recent publications did not satisfy essential requirements such as exploring the full uncertainty associated with the evidence or using clinical evidence informed by a systematic review and synthesis. Policy decisions for surgical techniques in these indications do not appear to be informed by robust economic evidence in all cases and due to the ratio of potential benefit and harm associated with cancer surgery there is an urgent need to increase economic activities in this area.

CONTACT INFORMATION

Roberta Ara, MSc(hons), ScHARR, Senior Research Fellow, School of Health and Related Research (ScHARR), Regent Court, 30 Regent Street, Sheffield, S1 4DA

Hasan Basarir, PhD, ScHARR, Research Fellow, The University of Sheffield, School of Health and Related Research (ScHARR), Regent Court, 30 Regent Street, Sheffield, S1 4DA

Anju D. Keetharuth, Phd, ScHARR, Research Associate, The University of Sheffield, School of Health and Related Research (ScHARR), Regent Court, 30 Regent Street, Sheffield, S1 4DA

Marco Barbieri, MSc(hons), Centre for Health Economics (CHE), University of York, Heslington, York YO10 5DD, UK

Helen L.A. Weatherly, MSc(hons), Senior Research Fellow, Centre for Health Economics (CHE), University of York, Heslington, York YO10 5DD, UK

Mark J.S. Sculpher, Phd, Professor of Health Economics, Deputy Director of the Policy Research Unit in Economic Evaluation of Health and Care Interventions (EEPRU), Centre for Health Economics (CHE), University of York, Heslington, York YO10 5DD, UK

Hashim Ahmed, FRCS(Urol), PhD, BM, BCh (Oxon), BA(Hons), MRC Clinician Scientist and Honorary Consultant Urological Surgeon, University College Hospital, 235 Euston Road, London, NW1 2BU

Steven Brown, MBChB FRCS MD, Consultant Colorectal Surgeon, Sheffield Teaching Hospitals, NHS Foundation Trust, Northern General Hospital, Herries Road, Sheffield, S5 7AU

CONFLICTS OF INTEREST

The authors have no conflicts of interest to declare.

References

1. Sorenson, C, Drummond, MF, Kanavos, P. Ensuring value for money in health care: The role of health technology assessment in the European Union. Observatory Studies Report N.11. Copenhagen: European Observatory on Health Systems and Policies; 2008.Google Scholar

2. Pharmaceutical Benefits Advisory Committee. Guidelines for Preparing Submissions to the Pharmaceutical Benefits Advisory Committee (version 4.3). Australian Government: Department of Health and Ageing; 2008.Google Scholar

3. Dutch Healthcare Insurance Board. Guidelines for pharmacoeconomic research (updated version. Valid as from April 1st 2006). Dieman: Dutch Health Care Insurance Board; 2006.Google Scholar

4. NICE. Guide to methods of technology appraisal. London UK; 2013. [updated January 1, 2013]. http://publications.nice.org.uk/pmg9 (accessed March 31, 2014).Google Scholar

5. Johannesson, M, Jonsson, B, Jonsson, L, et al. Why should economic evaluation medical innovations have a societal perspective. No 51. London: Office for Health Economic Briefings; 2009.CrossRef Google Scholar

6. Canadian Agency for Drugs and Technologies in Health. Guidelines for the economic evaluation of health technologies: Canada, 3rd ed. Ottawa: Canadian Agency for Drugs and Technologies in Health; 2006.Google Scholar

7. Lopez-Bastida, J, Oliva, J, Antonanzas, F, et al. Spanish recommendations on economic evaluation of health technologies. Eur J Health Econ. 2010;11:513–520.CrossRef Google Scholar PubMed

8. Gold, MR, Siegel, JE, Russell, LB, et al. Cost-effectiveness in health and medicine. New York: Oxford University; 1996.CrossRef Google Scholar PubMed

9. National Audit Office. 2010. [Internet] [updated January 1, 2013]. http://www.nao.org.uk (accessed March 6, 2013).Google Scholar

10. Harker, R. House of Commons (2012). NHS funding and expenditure. Standard Note SN/SG/724. www.nhshistory.net/parlymoney.pdf (accessed February 20, 2013).Google Scholar

11. Leal, J. 2012. Pharmatimes online. [Internet] [updated January 1, 2013]. http://www.pharmatimes.com/article/12-11-08/Cancer_costing_UK_%C2%A315_8_billion_a_year.aspx (accessed March 6, 2013).Google Scholar

12. Ara, R, Barbieri, M, Weatherly, H, et al. A rapid review of the cost-effectiveness of interventions across the cancer patient pathways: Radiotherapy and surgery. Report for the Department of Health. www.eepru.org.uk (accessed March 6, 2014).Google Scholar

13. Drummond, MF, Sculpher, MJ, Torrance, GW, et al. Methods for the economic evaluation of health care programs. 3rd ed. Oxford: Oxford University Press; 2005.CrossRef Google Scholar

14. Malin, JL, Keeler, E, Wang, C, et al. Using cost-effectiveness analysis to define a breast cancer benefits package for the uninsured. Breast Cancer Res Treat. 2002;74:143–153.Google Scholar

15. Norum, J, Olsen, JA, Wist, EA. Lumpectomy or mastectomy? Is breast conserving surgery too expensive? Breast Cancer Res Treat. 1997;45:7–14.Google Scholar

16. Polsky, D, Mandelblatt, JS, Weeks, JC, et al. Economic evaluation of breast cancer treatment: Considering the value of patient choice. J Clin Oncol. 2003; 21:1139–1146.Google Scholar

17. Callejo, D, Guerra, M, Reza, M, et al. Cost-utility analysis of laparoscopic versus open surgery for colorectal cancer. Value Health. 2011;14:A263.CrossRef Google Scholar

18. de Verteuil, RM, Hernandez, RA, Vale, L, Aberdeen Health Technology Assessment Group. Economic evaluation of laparoscopic surgery for colorectal cancer. Int J Technol Assess Health Care. 2007;23:464–472.Google Scholar

19. Govindarajan, A, Naimark, D, Coburn, NG, et al. Use of colonic stents in emergent malignant left colonic obstruction: A Markov chain Monte Carlo decision analysis. Dis Colon Rectum. 2007;50:1811–1824.Google Scholar

20. Hayes, JL, Hansen, P. Is laparoscopic colectomy for cancer cost-effective relative to open colectomy? Anz J Surg. 2007;77:782–786.Google Scholar

21. Jensen, C, Abcarian, H. Cost-effectiveness of laparoscopic versus open resection for colon and rectal cancer. Dis Colon Rectum. 2011;54:E186–E187.Google Scholar

22. Miller, AR, Cantor, SB, Peoples, GE, et al. Quality of life and cost effectiveness analysis of therapy for locally recurrent rectal cancer. Dis Colon Rectum. 2000;43:1695–1701.Google Scholar

23. Murray, A, Lourenco, T, de Verteuil, R, et al. Clinical effectiveness and cost-effectiveness of laparoscopic surgery for colorectal cancer: Systematic reviews and economic evaluation. Health Technol Assess. 2006;10:1–141.Google Scholar

24. Calvert, NW, Morgan, AB, Catto, JW, et al. Effectiveness and cost-effectiveness of prognostic markers in prostate cancer. Br J Cancer. 2003;88:31–35.Google Scholar

25. Hohwu, L, Borre, M, Ehlers, L, et al. A short-term cost-effectiveness study comparing robot-assisted laparoscopic and open retropubic radical prostatectomy. J Med Econ. 2011;14:403–409.Google Scholar

26. Jager, GJ, Severens, JL, Thornbury, JR, et al. Prostate cancer staging: Should MR imaging be used?–A decision analytic approach. Radiology. 2000;215:445–451.Google Scholar

27. Lyth, J, Andersson, SO, Andren, O, et al. A decision support model for cost-effectiveness of radical prostatectomy in localized prostate cancer. Scan J Urol Nephrol. 2012;46:19–25.Google Scholar

28. The Guideline Development Group. Prostate cancer: Diagnosis and treatment. Clinical Guideline CG58 2008. http://www.nice.org.uk/CG58 (accessed March 6, 2014).Google Scholar

29. O’Malley, SP, Jordan, E, O’Malley, SP, et al. Review of a decision by the Medical Services Advisory Committee based on health technology assessment of an emerging technology: The case for remotely assisted radical prostatectomy. Int J Technol Assess Health Care. 2007;23:286–91.Google Scholar

30. Ramsay, C, Pickard, R, Robertson, C, et al. Systematic review and economic modelling of the relative clinical benefit and cost-effectiveness of laparoscopic surgery and robotic surgery for removal of the prostate in men with localised prostate cancer. Health Technol Assess. 2012;16:1–313.Google Scholar

31. Ashby, J, O’Hanlon, M, Buxton, MJ. The time trade-off technique: How do the valuations of breast cancer patients compare to those of other groups? Qual Life Res. 1994:3;257–265.Google Scholar

32. Clinical Outcomes of Surgical Therapy Group. COST trial: Clinical outcomes of surgical therapy group. A comparison of laparoscopically assisted and open colectomy for colon cancer. N Engl J Med. 2004;350:2050–2059.Google Scholar

33. Hazebroek, EJ; The COLOR Study Group,. COLOR: A randomized clinical trial comparing laparascopic and open resection for colon cancer. Surg Endosc. 2002;16:949–953.Google Scholar

34. Cella, DF, Tulsky, DS, Gray, G, et al. The functional assessment of cancer therapy scale: Development and validation of the general measure. J Clin Oncol. 1993;11:570–579.Google Scholar

35. Drummond, M, Griffin, A, Tarricone, R. Economic evaluation of medical devices and drugs - same or different. Value Health. 2009;12:402–403.Google Scholar

36. Sorenson, C, Tarricone, R, Siebert, M, et al. Applying health economics for policy decision making: Do devices differ from drugs? Eurospace. 2011;13 (Suppl 2):ii54–ii58.Google Scholar

37. nice.org.uk [Internet] [updated February 1, 2013]. http://www.nice.org.uk (accessed February 13, 2013).Google Scholar

38. hta.ac.uk [Internet] [updated February 1, 2013]. http://www.hta.ac.uk (accessed February 13, 2013).Google Scholar

39. The Guideline Development Group. Early and locally advanced breast cancer: Diagnosis and treatment. Clinical Guideline CG80 2009 London 2009. http://www.guidancenice.org.uk/CG80 (accessed February 13, 2013).Google Scholar

40. NCRI Secretariat. National Cancer Research Institute. Challenges and opportunities in surgical cancer research in the UK. http://www.ncri.org.uk (accessed April 2, 2013).Google Scholar

41. The Guideline Development Group. Colorectal cancer: The diagnosis and management of colorectal cancer. Clinical Guideline CG131 2011 London 2011. http://www.nice.org.uk/CG131 (accessed February 13, 2013).Google Scholar

42. NICE. Laparoscopic surgery for the treatment of colorectal cancer (review). NICE technology appraisal guidance 105 (2006). London 2006. http//www.nice.org.uk/TA105 (accessed February 13, 2013).Google Scholar

Table 1. Characteristics of the Studies Included

Table 2. Methods and Study Conclusions

Figure 1. Flow diagram for study inclusion.

Figure 2. Quality of included studies compared to the NICE reference case.

Article contents

ARE POLICY DECISIONS ON SURGICAL PROCEDURES INFORMED BY ROBUST ECONOMIC EVIDENCE? A SYSTEMATIC REVIEW

Abstract

Keywords

METHODS

Search Strategy

Study Selection Strategy

RESULTS

Essential Basic Requirements

Defined Research Question

Clinical Evidence

Time Horizon

HRQoL

Uncertainty

Preferred Requirements

UK Specific Requirements

DISCUSSION

CONCLUSION

CONTACT INFORMATION

CONFLICTS OF INTEREST

References

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests