Hostname: page-component-5cf477f64f-rdph2 Total loading time: 0 Render date: 2025-03-31T21:12:54.979Z Has data issue: false hasContentIssue false

Calibration of transition probabilities to model survival of adjuvant trastuzumab for early breast cancer in Indonesia

Published online by Cambridge University Press:  26 March 2025

Arie Rahadi*
Affiliation:
Management Sciences for Health, Arlington, VA, USA
Rizki Tsalatshita Khair Mahardya
Affiliation:
Center for Health Financing Policy and Insurance Management, Gadjah Mada University, Sleman, Yogyakarta, Indonesia
Putri Listiani
Affiliation:
Center for Health Financing Policy and Insurance Management, Gadjah Mada University, Sleman, Yogyakarta, Indonesia
Eva Herlinawaty
Affiliation:
Center for Health Financing and Decentralization Policy, Ministry of Health Republic of Indonesia, Central Jakarta, Jakarta, Indonesia
Ryan Rachmad Nugraha
Affiliation:
Management Sciences for Health, Arlington, VA, USA
Dani Ramdhani Budiman
Affiliation:
Center for Health Financing and Decentralization Policy, Ministry of Health Republic of Indonesia, Central Jakarta, Jakarta, Indonesia
Christian Suharlim
Affiliation:
Management Sciences for Health, Arlington, VA, USA
*
Corresponding author: Arie Rahadi; Email: [email protected]
Rights & Permissions [Opens in a new window]

Abstract

Objectives

Cost-effectiveness models fully informed by real-world epidemiological parameters yield the best results, but they are costly to obtain. Model calibration using real-world data/evidence (RWD/E) on routine health indicators can provide an alternative to improve the validity and acceptability of the results. We calibrated the transition probabilities of the reference chemotherapy treatment using RWE on patient overall survival (OS) to model the survival benefit of adjuvant trastuzumab in Indonesia.

Methods

A Markov model comprising four health states was initially parameterized using the reference-treatment transition probabilities, obtained from published international evidence. We then calibrated these probabilities, targeting a 2-year OS of 86.11 percent from the RWE sourced from hospital registries. We compared projected OS duration and life-years gained (LYG) before and after calibration for the Nelder–Mead, Bound Optimization BY Quadratic Approximation, and generalized reduced gradient (GRG) nonlinear optimization methods.

Results

The pre-calibrated transition probabilities overestimated the 2-year OS (92.25 percent). GRG nonlinear performed best and had the smallest difference with the RWD/E OS. After calibration, the projected OS duration was significantly lower than their pre-calibrated estimates across all optimization methods for both standard chemotherapy (~7.50 vs. 11.00 years) and adjuvant trastuzumab (~9.50 vs. 12.94 years). LYG measures were, however, similar (~2 years) for the pre-calibrated and calibrated models.

Conclusions

RWD/E calibration resulted in realistically lower survival estimates. Despite the little difference in LYG, calibration is useful to adapt external evidence commonly used to derive transition probabilities to the policy context, thereby enhancing the validity and acceptability of the modeling results.

Type
Assessment
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2025. Published by Cambridge University Press

Introduction

Breast cancer is the most common malignancy in Indonesia, with an estimated annual incidence and mortality of 44.0 and 15.3 per 100,000, respectively (1). Reports suggest that 40–70 percent of women with breast cancer in Indonesia had late clinical presentation and diagnosis (Reference Gautama2;Reference Hutajulu, Prabandari, Bintoro, Wiranata, Widiastuti and Suryani3), or approximately three times as high in proportions compared to high-income countries (Reference Ruddy, Gelber, Tamimi, Schapira, Come and Meyer4). Claims of National Health Insurance (NHI) for cancer rank the second after cardiovascular disease in the catastrophic illness category (5), and these costs are projected to grow as the country is in the cusp of an epidemiological transition in health burden to noncommunicable diseases (Reference Bloom, Chen, McGovern, Prettner, Candeias and Bernaert6;Reference Pinto, Masaki and Harimurti7).

Trastuzumab is a recombinant, humanized, monoclonal antibody with a demonstrated efficacy as an adjuvant treatment to standard chemotherapy for early breast cancer, with durable treatment effects observed for up to 10 years (Reference Spielmann, Roché, Delozier, Canon, Romieu and Bourgeois8Reference Cameron, Piccart-Gebhart, Gelber, Procter, Goldhirsch and de Azambuja10). The treatment suppresses abnormal amplification of the human epidermal growth factor receptor 2 (HER2) gene or overexpression of its protein, which is linked to faster disease progression, higher rates of relapse, and mortality (Reference Slamon, Godolphin, Jones, Holt, Wong and Keith11). The landmark joint analysis of the North Central Cancer Treatment Group and the National Surgical Adjuvant Breast and Bowel Project trials demonstrated significant improvements in disease-free survival (DFS) and overall survival (OS), with over one-third reduction in disease progression (hazard ratio (HR) = 0.63; 95 percent confidence interval (CI) = 0.54, 0.73; p < 0.001) and mortality (HR = 0.60; CI = 0.53, 0.68; p < 0.001) among patients who received 12-month adjuvant trastuzumab as compared to who received chemotherapy alone (Reference Perez, Romond, Suman, Jeong, Sledge and Geyer9). An effect size of similar magnitude and durability has been confirmed in a subsequent study (Reference Cameron, Piccart-Gebhart, Gelber, Procter, Goldhirsch and de Azambuja10).

Despite this favorable efficacy profile, the high cost of trastuzumab, at an estimated $20,000 in global average for a 12-month treatment course (12), is unaffordable for many health systems and is likely to exceed the eligible ceiling for public health funding to be included in the NHI coverage (Reference Bertram, Lauer, De Joncheere, Edejer, Hutubessy and Kieny13). Published economic evaluations in low- and middle-income countries (LMICs) report that 12-month adjuvant trastuzumab was cost-effective in China (Reference Chen, Jiang, Shao, Sun and Shen14) or Iran (Reference Ansaripour, Uyl-de Groot and Redekop15), but not in the Philippines (Reference Genuino, Chaikledkaew, Guerrero, Reungwetwattana and Thakkinstian16) or Latin American countries (Reference Buendía, Vallejos and Pichón-Rivière17;Reference Botelho, Estevez-Diz and Campolina18) by standard cost-effectiveness eligibility of being not in excess of three times per capita gross domestic product in its incremental cost per health gain. A previous economic evaluation of trastuzumab for metastatic breast cancer in Indonesia reported an incremental cost-effectiveness ratio (ICER) of USD 17,307 per gain in quality-adjusted life-years (Reference Kristin, Endarti, Khoe, Pratiwi, Pinzon and Febrinasari19). Although this figure exceeded three times per capita gross domestic product, trastuzumab for this indication received approval under NHI coverage in 2023 (20).

One critical input to a cost-effectiveness evaluation for health technology assessment (HTA) is a set of transition probabilities. In state-transition models that rely on a Markov process, a synthetic cohort moves (or “transitions”) from one health state to another with each passage of time (“cycle”) according to pre-defined probabilities until the entire cohort dies (the ultimate health state), from which no further transition is possible, or until the total number of cycles in units of time is exhausted (Reference Siebert, Alagoz, Bayoumi, Jahn, Owens and Cohen21). In a comparison of two (or more) treatment groups, transition probabilities in the reference, standard of care (SoC) group, are of paramount importance from which the transition probabilities for the other group(s) are calculated by multiplication with the corresponding relative treatment effects in HR or risk ratios. Real-world evidence (RWE), obtained from multiyear longitudinal real-world data (RWD) on local cohorts for whom coverage or pricing decisions are to be made, is best positioned to inform transition probabilities for its high contextualization with the policy decision and its representation of typical consumers of health technologies in the real-world setting (Reference Skovlund, Leufkens and Smyth22;Reference Katkade, Sanders and Zou23). In many settings, such RWE may be unavailable or only partially available, for example, in shorter follow-up or incomplete (missing) observations, or for only a few segments of the patient population (Reference Sievers, Joos and Hiligsmann24); and utilizing external evidence from multiple studies is a practical recourse to populate various transition probabilities and other model parameters.

In this study, we apply model calibration as a way to contextualize transition probabilities from external evidence for our Markov model of adjuvant trastuzumab. Model calibration compares the model projections with a target quantity from the observed RWE and iteratively calculates a new set of values for transition probabilities within their plausible range to provide the best fit to the target (Reference Vanni, Karnon, Madan, White, Edmunds and Foss25). The best model fit seeks to minimize the difference between the projected outcomes and those observed in RWE at a specified time point (e.g., 5-year survival), to which the initial transition probabilities are calibrated and the Markov model is adjusted to the new (calibrated) values (Reference Vanni, Karnon, Madan, White, Edmunds and Foss25). We feel that model calibration presents a valuable avenue to advance RWD/E utilization in HTA for its intuition, availability in standard decision-modeling packages, and non-exhaustive data requirements, all of which are features that appeal to jurisdictions with significant challenges in health data infrastructure, including ours.

The objective of the current study was therefore to: (a) calibrate the transition probabilities of a Markov model of adjuvant trastuzumab versus standard chemotherapy in HER2-positive, early breast cancer and (b) compare the survival outcomes on pre-calibrated and calibrated transition probabilities. The results of our model calibration will provide adjustments to the transition probability parameters for use in a cost-effectiveness evaluation of adjuvant trastuzumab for early breast cancer in Indonesia.

Methods

This study provides a groundwork for a cost-effectiveness analysis of adjuvant trastuzumab, which at the time of writing was recruiting participants for data collection on patient utilities (EQ-5D-5L) and out-of-pocket outlays. The SoC treatment comprises approved chemotherapies in the oncologic practice of Indonesia, typically given every 21 days for up to eight cycles. Trastuzumab 8 mg/kg loading dose and 6 mg/kg every 21 days for 18 cycles is administered after the first four cycles of chemotherapy are completed for a total treatment duration of 52 weeks.

Model structure

We developed a Markov cohort model comprising four health states: DFS, Locoregional Recurrence, Distant Metastasis, and Death for SoC and adjuvant trastuzumab (Figure 1). The cohort assumed a representative female patient aged 50 years with HER2-positive, early stage breast cancer (stages IA–IIIA) who initiated treatment and were disease-free at the start of analysis. After transitioning from DFS, a return to a less severe health state was not possible. The model was evaluated over a 50-year time horizon in annual cycles. Markov models were built using TreeAge Pro 2023 (Williamstown, MA) and Microsoft Excel®.

Figure 1. Model structure.

Transition probabilities and treatment effects

The initial transition probabilities in the SoC group were informed by literature (Reference Perez, Romond, Suman, Jeong, Sledge and Geyer9;Reference Cameron, Piccart-Gebhart, Gelber, Procter, Goldhirsch and de Azambuja10;Reference Botelho, Estevez-Diz and Campolina18;Reference Kristin, Endarti, Khoe, Pratiwi, Pinzon and Febrinasari19), selected to represent a plausible range and likely values (Table 1). Because 20 percent of the SoC participants in the source trial received adjuvant trastuzumab after completion of their primary chemotherapies (Reference Perez, Romond, Suman, Jeong, Sledge and Geyer9), we applied adjustments in health state transitions from DFS within the SoC group using the rate difference by crossover status from another trial (Reference Cameron, Piccart-Gebhart, Gelber, Procter, Goldhirsch and de Azambuja10). The mortality risk of cardiotoxicity was built into the transition probability from DFS to death for each treatment group from a 10-year cumulative incidence (1.1 percent and 5.4 percent for the SoC and trastuzumab groups, respectively) from the same trial (Reference Cameron, Piccart-Gebhart, Gelber, Procter, Goldhirsch and de Azambuja10).

Table 1. Initial transition probabilities of the chemotherapy group and treatment effects of adjuvant trastuzumab

DFS: disease-free survival; HR: hazard ratio; OS: overall survival.

a See Supplementary Material 1 for details on varying treatment effects by follow-up duration for DFS.

We pooled the treatment effects of adjuvant trastuzumab for DFS (HRpooled = 0.67; CI = 0.52, 0.87; P = 0.016; I2 = 0.633) and OS (HRpooled = 0.67; CI = 0.57, 0.79; P = 0.004; I2 = 0.203) from four major trials (Reference Perez, Romond, Suman, Jeong, Sledge and Geyer9;Reference Cameron, Piccart-Gebhart, Gelber, Procter, Goldhirsch and de Azambuja10;Reference Joensuu, Bono, Kataja, Alanko, Kokko and Asola27;Reference Slamon, Eiermann, Robert, Giermek, Martin and Jasiowka28). These pooled HRs were applied to the transition probabilities of SoC for health state transitions from DFS to obtain the corresponding transition probabilities of the trastuzumab treatment group. Therefore, adjuvant trastuzumab was assumed to have no effect on prognosis once the cancer recurred locoregionally or distantly. Because the magnitude of HRs for DFS varied by trial follow-up duration, we applied time-varying treatment effects specific to the first five cycles (years), corresponding to the minimum follow-up duration in the trials, and to each subsequent cycle until 11 years. We assumed that treatment effects would last for 11 years as per the longest follow-up during which between-group differences persisted (Reference Cameron, Piccart-Gebhart, Gelber, Procter, Goldhirsch and de Azambuja10). From year 12 onward, mortality and all other transitions from DFS for adjuvant trastuzumab referenced the SoC rates, as defined by a time-exponential function of the baseline transition probabilities. Supplementary Material 1 provides information on study eligibility and the meta-analysis methodology used to pool study outcomes.

Outcome measures

We computed OS duration, measured in the number of years spent in health states other than death, and the life-years gained (LYG) from adjuvant trastuzumab, defined as the difference in OS duration between the treatment groups. Outcomes were projected over the analysis time horizon of 50 years.

RWE of OS

We abstracted data from cancer registries at two major hospitals in Jakarta and Yogyakarta on 1,007 female patients (51 years in average age) diagnosed with stages IA–IIIA breast cancer during 2016–2019, who received chemotherapy without trastuzumab, with follow-up through December 2022 (median: 24 months). The Jakarta hospital is a national-referent hospital for cancer care and has the largest volume of cancer patients in the country. The Yogyakarta hospital is a regional-referent hospital for cancer care. The ethics committees of Gadjah Mada University and the respective hospitals approved the use of the registry data for the study. One hundred and twenty-two patients died (12 percent) over 84 months, with most deaths occurring within the first 24 months (n = 97), including half (n = 60) during the second year alone. Over 60 percent of the cohort were censored at the last visit date and no longer contributed survival data beyond 36 months, thereby likely misrepresenting the low number of deaths in the later period of the registry. Given this limitation, we computed RWE survival in the chemotherapy group at 2 years of follow-up, which was 86.11 percent (CI = 83.21, 88.54). This time point corresponds to the average follow-up period and falls within the period of peak mortality (Reference Jatoi, Anderson, Jeong and Redmond29;Reference Jayasekara, MacInnis, Chamberlain, Dite, Leoce and Dowty30). We set this survival proportion as the target OS to calibrate the initial values of transition probabilities for the SoC group. Supplementary Material 2 shows the Kaplan–Meier curve of RWE OS.

Calibration techniques

We compared the performances of three optimization methods in calibration: (a) Nelder–Mead simplex; (b) Bound Optimization BY Quadratic Approximation (BOBYQA); and (c) generalized reduced gradient (GRG nonlinear). These methods were chosen because of their availability in standard modeling packages and ease of implementation. The Nelder–Mead algorithm is a direct search method that iteratively transforms the simplex using the plausible range of transition probabilities to find the optimal solution that minimizes the discrepancy between the projected 2-year OS and its observed RWE rates (Reference Nelder and Mead31). Similar to the Nelder–Mead method, BOBYQA employs a derivative-free algorithm. In finding an optimal solution, BOBYQA utilizes a trust region procedure to iteratively update the values of the transition probabilities around neighboring values within the region using quadratic approximations (Reference Powell32). By contrast, the GRG nonlinear method uses partial derivatives in combination with nonlinear programming to find the optimal solution (Reference Lasdon, Waren, Jain and Ratner33). Of the three optimization methods, BOBYQA and GRG nonlinear are less sensitive to local minima than Nelder–Mead (Reference Powell32).

Data analysis

We implemented the Nelder–Mead and BOBYQA optimization in TreeAge Pro 2023 using the calibration menu and set the optimization thresholds to 1E-10 and 1E-13 in relative and absolute terms, respectively. The GRG nonlinear was implemented using the Solver add-in in Microsoft Excel®, with an equivalent relative threshold and 200 sets of starting values of transition probabilities within their plausible bounds. Satisfactory model fit was achieved if the projected 2-year OS falls within the CI coverage of the RWE OS (Reference Vanni, Karnon, Madan, White, Edmunds and Foss25). The goodness of fit was judged by the difference in the 2-year OS that each optimization method projected relative to the RWE, with a smaller value indicating a better fit to the RWE OS. Model projections of OS duration and LYG with parameterization using the pre-calibrated and calibrated transition probabilities were compared. Bias-corrected CIs (CIbc) and P values (P bc) were calculated from a beta and log-normal distribution of 10,000 Monte Carlo draws for transition probabilities and treatment effects, respectively. All outcomes were discounted at a 3 percent rate per year.

Results

Calibration performance and results

The projected 2-year OS was 92.25 percent in contrast to the observed RWE at 86.11 percent. All the optimization methods produced a satisfactory fit. However, the GRG nonlinear had the best goodness of fit with the smallest difference (1.64E-14) compared to the Nelder–Mead (1.89E-07) or BOBYQA optimization (6.18E-11) (Table 2). The calibrated transition probabilities from DFS to death significantly increased from the initial, pre-calibrated values, approaching the plausible upper bound for all three optimization methods. Calibration also significantly increased the transition probabilities leading to metastasis either from DFS or from recurrence. All methods targeted these transitions because of their role in terminating cohort membership from death or elevating its risk in the metastasis health state.

Table 2. Calibrated transition probabilities of the chemotherapy group

BOBYQA: Bound Optimization BY Quadratic Approximation; DFS: disease-free survival; GRG: generalized reduced gradient; OS: overall survival; SE: standard error. Stars denote a statistically significant difference between calibrated and pre-calibrated values at P bc < 0.050 (*) or P bc < 0.010 (**), computed from normally distributed Monte Carlo draws with the expected mean difference and combined variance.

By magnitude, when compared to the other optimization methods, the calibrated transition probabilities in the DFS health state under the GRG nonlinear were higher for transitions to death (+87 percent change from initial values) and metastasis (+52 percent change), but lower for transitions to recurrence (+20 percent change) (Table 2). In finding the optimal solution, the Nelder–Mead and BOBYQA, respectively, maximized the values of transition probabilities for transitions from DFS to recurrence and metastasis, whereas the GRG nonlinear leveraged transitions from DFS to death and from recurrence to metastasis. On average, changes in the remaining transition probabilities after calibration were appreciable (from recurrence to death) or minor (from metastasis to death) for all methods.

Comparison of projected OS and LYG

The pre-calibrated OS duration was 11.00 years (CIbc = 9.56, 12.39) for SoC and 12.94 years (CIbc = 11.50, 14.32) for adjuvant trastuzumab (Figure 2). The calibrated OS duration averaged ~7.50 years for SoC and ~9.50 years for adjuvant trastuzumab across all optimization methods. All calibrated estimates were significantly shorter in both treatment groups (P bc < 0.001), suggesting an overestimation of the pre-calibrated survival projections. The calibrated survival curves display a steeper downward slope that more realistically corresponds to the average life expectancy of the female population.

Figure 2. Projected overall survival and life-years gained. BOBYQA: Bound Optimization BY Quadratic Approximation; GRG: generalized reduced gradient; LY: life-years; LYG: life-years gained. Stars denote the statistical comparison of life-years with pre-calibrated values within a treatment group (*P bc < 0.050; **P bc < 0.010; ***P bc < 0.001). All statistical tests and the 95% confidence intervals were based on the Monte Carlo empirical distribution with bias corrections.

Adjuvant trastuzumab prolonged OS duration by approximately 2 years in the pre-calibrated model (LYG = 1.94; CIbc = 1.54, 2.35; P bc < 0.001) and calibrated model for all optimization methods (LYG = ~2.00; P bc < 0.001) (Figure 2). The difference with the pre-calibrated LYG was small and statistically non-significant for the Nelder–Mead (∆LYG = 0.06; CIbc = −0.09, 0.22; P bc = 0.445), BOBYQA (∆LYG = 0.04; CIbc = −0.09, 0.20; P bc = 0.559), and GRG nonlinear methods (∆LYG = 0.03; CIbc = −0.11, 0.17; P bc = 0.717).

Discussion

Calibration was used in this study to adjust transition probabilities and model projections against RWE survival. Survival projections using pre-calibrated transition probabilities overestimated the mean survival by around half as much as the calibrated values for SoC (11.00 vs. ~7.50 years) and by more than one-third as much for adjuvant trastuzumab (12.94 vs. ~9.50 years). Calibration, as applied in this study, shifted the initial survival curve downward with a proportional change in survival rates. The survival benefits of LYG were similar between the pre-calibrated and calibrated models.

The GRG nonlinear method had the best fit, and the transition probabilities thus calibrated can be used to inform a cost-effectiveness analysis of adjuvant trastuzumab for HER2-positive early breast cancer in the country. In comparison with other studies, our calibrated estimates of OS duration for SoC (7.80 years) and trastuzumab (9.78 years) are lower than the estimates for China (Reference Chen, Jiang, Shao, Sun and Shen14), Iran (Reference Ansaripour, Uyl-de Groot and Redekop15), and Thailand (Reference Kongsakon, Lochid-amnuay, Kapol and Pattanaprateep34) where a wide range of projected OS duration for standard chemotherapy (8.40–15.30 years) and trastuzumab (15.82–18.20 years) was reported. Our calibrated estimates of LYG broadly fall in the high range, but were less than those reported for China (Reference Chen, Jiang, Shao, Sun and Shen14) or Thailand (Reference Kongsakon, Lochid-amnuay, Kapol and Pattanaprateep34). Interestingly, studies reporting both greater OS duration and LYG of adjuvant trastuzumab also assumed a shorter duration of treatment effects than in this study (Reference Chen, Jiang, Shao, Sun and Shen14;Reference Kongsakon, Lochid-amnuay, Kapol and Pattanaprateep34), which would have led to lower estimates in our model. This incongruity may be attributable to variations in input parameters, model structures, and other aspects of modeling that are specific for the decision context in the respective study jurisdictions. The RWE OS in this study mirrors the practice in service delivery where a shortfall in the supply of health human resources negatively impacts on access to and the quality of oncologic care (Reference Gautama2;Reference Trapani, Murthy, Boniol, Booth, Simensen and Kasumba35;Reference Schaefers, Wenang, Afdal, Mukti, Sundari and Haier36). This health system context was thus captured and represented in the survival projections by virtue of model calibration.

Our study is the first known official HTA evaluation in the country to apply calibration to empirical RWE. We also compared different optimization methods, all of which passed the standard fit criteria in relation to the observed RWE and provided comparable survival projections. However, some key limitations of this study should be acknowledged. First, the limited scope of RWD encompassing two major hospitals may underrepresent patients with early breast cancer at the national level, rendering the results less generalizable to other settings of care providers, particularly those accessing care other than at high-caseload hospitals.

Second, we used only a single target, 2-year OS, in the calibration process. Both the follow-up duration and high attrition in the registry-based RWD are major limitations. The decision to focus on this early survival period in our calibration exercise was motivated by the need to anchor the model and its long-term calibrated projections to a time interval for which robust data were available and during which the risk of mortality is increasing (Reference Jatoi, Anderson, Jeong and Redmond29;Reference Jayasekara, MacInnis, Chamberlain, Dite, Leoce and Dowty30). However, this means that the trajectory of OS projections would proceed at a slope reflecting these high early mortality rates in the remaining time horizon, potentially underestimating survival in both treatment groups, particularly if the true mortality rates rapidly declined at subsequently nearby time points. Additional time points covering a meaningful portion of the model’s time horizon would help reduce uncertainties in our calibrated projections. We anticipate that the uncertainties surrounding the calibration targets, currently undefined in this study, will have the largest impact on our cost-effectiveness findings if they substantively influence the projected survival gains (Reference Moriña, de Sanjosé and Diaz37).

Establishing registries that integrate and harmonize data across key care providers is a crucial step toward ensuring the quality generation of RWE for economic evaluations in the country. While this process is ongoing and in the absence of empirical data to inform uncertainties in long-term survival, structured expert elicitation, which seeks to formally estimate uncertain quantities from expert knowledge in a probability distribution (Reference Soares, Colson, Bojke, Ghabri, Garay and Felli38), offers a valuable avenue. This method is frequently applied to extrapolate long-term treatment benefits or survival over periods of up to 20 years in published studies (Reference Willigers, Ouwens, Briggs, Heerspink, Pollock and Pecoits-Filho39) and frames uncertainties in the knowledge of local experts, rather than relying on evidence from different country contexts. Additionally, future evaluations may attempt calibrating against more than one index of key transitional drivers and at different time points concurrently, as more quality RWD becomes available or by utilizing structured expert elicitation.

Finally, the full implications of our calibrated model for costs and cost-effectiveness remain to be explored, as data collection on costs and health utilities was still ongoing at the time of writing this manuscript. However, we surmise that while the downward corrections in LYG will lower cost estimates in both treatment groups, the change is likely to be less pronounced for the trastuzumab group compared to SoC due to its high drug cost, which could result in larger incremental costs and ICER than pre-calibrated estimates. For instance, by parameterizing the model with direct medical costs from a trastuzumab study in the Philippines (Reference Genuino, Chaikledkaew, Guerrero, Reungwetwattana and Thakkinstian16), our calibrated model using GRG nonlinear optimization shows a borderline statistical difference, with a modest 8.22 percent increase in incremental costs (CIbc = 3.57, 16.43; P bc = 0.004) and a similar 6.56 percent increase in ICER per LYG (CIbc = −0.43, 15.37; P bc = 0.065). We anticipate corrections of similar magnitude to ICER in our upcoming cost-effectiveness analysis results.

Our study illustrates the realities of incorporating RWD/E for cost-effective assessments of health technologies within the constraints of the country’s fragmented health system (Reference Mahendradhata, Trisnantoro, Listyadewi, Soewondo, Marthias and Harimurti40). The regulatory framework for population-based cancer registries has been in place since 2016 involving 15 regional and national referral hospitals (20). However, its current unavailability suggests the need for greater investments in realizing this objective. Although our data acquisition was somewhat partial and ad hoc, it is preferable to forgoing HTA due to a perceived lack of viable RWE or capacity to conduct HTA, which are listed among the common reasons to avoid HTA (Reference Teerawattananon, Painter, Dabak, Ottersen, Gopinathan and Chola41).

The RWD/E application in HTA has gained traction globally and in Asia to strengthen the conduct of cost-effectiveness assessments to fill the evidence gap from randomized trials in a local setting or boost the applicability of such an assessment to the setting where coverage or pricing decisions are to be made (Reference Kc, Lin, Bayani, Zemlyanska, Adler and Ahn42). With regard to input parameters, the use of RWD/E is imperative to derive disease background parameters such as mortality or prevalence (Reference Lee, Dayer, Jiao, Carlson, Devine and Veenstra43), essential to inform key parameters that cannot be generalized from external settings such as costs and health utilities (Reference Trapani, Murthy, Boniol, Booth, Simensen and Kasumba35;Reference Lee, Dayer, Jiao, Carlson, Devine and Veenstra43), and strategic to validate experimental evidence in the practice setting (Reference Bartlett, Dhruva, Shah, Ryan and Ross44;Reference Sheffield, Dreyer, Murray, Faries and Klopchin45). Transition probabilities of the reference treatment are a product of incidence rates of various health events in the natural course of a disease, which extended longitudinal observations are best to inform (Reference Gidwani and Russell46). However, acquiring such rich RWD to construct a full set of RWE-informed transition probabilities is infeasible or highly costly within the remit of the surveillance system of many LMICs (Reference Lazem and Sheikhtaheri47). Model calibration can therefore be a pragmatic solution to compromise this gap between the high demand for quality, extensive RWD/E and its short supply.

In this view of pragmatism, model calibration works to align transition probabilities extracted from multiple studies, which may inadequately represent the practice setting of a new health technology, with the existing RWE at the local level. We believe that advocating the utilization of existing RWD/E with a fair assessment of its inherent bias should be pursued for better informed policy decisions. The annual number of HTA submissions processed by the Indonesia’s Health Technology Assessment Committee and the National Formulary can reach hundreds and is expected to grow steadily (Personal Communication). In this setting, RWE calibration can lend more credence to assessment results in a more accessible way by utilizing one or more RWE data points to target in model projections, which also enhances the face validity of the results to policy makers (Reference Vanni, Karnon, Madan, White, Edmunds and Foss25). Noticeable differences between the pre-calibrated and calibrated results should, at minimum, lead to a discussion for more informed decision making or be presented as a form of sensitivity analysis to enhance the credibility of the modeling results.

Conclusions

Our calibrated transition probabilities produced survival projections that were more congruent with the observed RWE for the treatment of early breast cancer in Indonesia. The pre-calibrated transition probabilities overestimated OS duration, which could potentially confound the results when interacting with cost parameters in a cost-effectiveness analysis. Our model calibration had low data burden and provides an example of how model refinements for more credible decision making in HTA can be performed within the constraints of the existing health data infrastructure for RWE in LMICs. Model calibration can be applied with relative ease in other disease areas, and its applications can encourage the validation of model parameters and promote RWD/E utilization toward advancing the health data infrastructure.

Supplementary material

The supplementary material for this article can be found at http://doi.org/10.1017/S0266462325000157.

Data availability statement

Summary data from cancer registries, calibration macros (.xlsm) and the TreeAge model (.trex) used in the analysis of the current study are available from the corresponding author upon reasonable request.

Acknowledgments

The authors gratefully acknowledge the invaluable supervision and consultation provided by Indonesia’s Health Technology Assessment Committee and Professor Erna Kristin at the Faculty of Medicine, Public Health, and Nursing, Gadjah Mada University throughout the study. The authors would like to thank Herindita Puspitaningtyas, M.D. at Dr. Sardjito Referral General Hospital and Evelina Suzanna, M.D. at Dharmais Cancer Hospital for their facilitation in making the patient registry data available for this study.

Author contributions

CS, RRN, and EH conceptualized the study. RTKM, PL, EH, and DRB designed the adapted Markov model. DRB coordinated procurement of registry data. AR, RTKM, and PL conducted all analyses and wrote the first draft. CS, RRN, EH, and DRB provided a critical review of this manuscript and revised the manuscript for important intellectual content. All authors have read and approved the final manuscript.

Funding statement

Funding was provided by the Health Development Policy Agency, Ministry of Health Republic of Indonesia as part of the health technology assessment program under the Budget Implementation List of the State Budget. This study was made possible by the generous support of the American people through the US Agency for International Development (USAID) contract No. 7200AA18C00074. The contents are the responsibility of the authors and do not necessarily reflect the views of the Ministry of Health Republic of Indonesia and USAID or the United States Government.

Competing interest

The authors declare that they have no competing interests.

Ethics approval

The use of deidentified registry data for the purpose of the study was approved by the ethics committees of Gadjah Mada University (KE/FK/1209/EC), Dharmais Cancer Hospital (260/KEPK/XI/2022), and Dr. Sardjito Referral General Hospital (LB.02.03/XI.2.2/19105/2022).

References

World Health Organization [Internet]. Global cancer observatory: Indonesia. Geneva: World Health Organization; 2021. Available from: https://gco.iarc.fr/today/data/factsheets/populations/360-indonesia-fact-sheets.pdf.Google Scholar
Gautama, W. Breast cancer in Indonesia in 2022: 30 years of marching in place. Indones J Cancer. 2022;16:12.CrossRefGoogle Scholar
Hutajulu, SH, Prabandari, YS, Bintoro, BS, Wiranata, JA, Widiastuti, M, Suryani, ND, et al. Delays in the presentation and diagnosis of women with breast cancer in Yogyakarta, Indonesia: a retrospective observational study. PLoS ONE. 2022;17:e0262468.CrossRefGoogle ScholarPubMed
Ruddy, KJ, Gelber, S, Tamimi, RM, Schapira, L, Come, SE, Meyer, ME, et al. Breast cancer presentation and diagnostic delays in young women. Cancer. 2014;120:2025.Google ScholarPubMed
Badan Penyelenggara Jaminan Sosial (BPJS Kesehatan). Report of program and financial management of National Health Insurance (BPJS Kesehatan). Jakarta: BPJS Kesehatan; 2020.Google Scholar
Bloom, D, Chen, S, McGovern, M, Prettner, K, Candeias, V, Bernaert, A, et al. Economics of non-communicable diseases in Indonesia. Geneva: World Economic Forum; 2015.Google Scholar
Pinto, R, Masaki, E, Harimurti, P. Indonesia health financing system assessment: spend more, right & better. Jakarta: The World Bank Group; 2016.CrossRefGoogle Scholar
Spielmann, M, Roché, H, Delozier, T, Canon, JL, Romieu, G, Bourgeois, H, et al. Trastuzumab for patients with axillary-node-positive breast cancer: Results of the FNCLCC-PACS 04 trial. J Clin Oncol. 2009;27:61296134.CrossRefGoogle ScholarPubMed
Perez, EA, Romond, EH, Suman, VJ, Jeong, JH, Sledge, G, Geyer, CE Jr. et al. Trastuzumab plus adjuvant chemotherapy for human epidermal growth factor receptor 2- positive breast cancer: planned joint analysis of overall survival from NSABP B-31 and NCCTG N9831. J Clin Oncol. 2014;32:37443752.CrossRefGoogle ScholarPubMed
Cameron, D, Piccart-Gebhart, MJ, Gelber, RD, Procter, M, Goldhirsch, A, de Azambuja, E, et al. 11 years’ follow-up of trastuzumab after adjuvant chemotherapy in HER2-positive early breast cancer: final analysis of the HERceptin Adjuvant (HERA) trial. Lancet. 2017;389:11951205.CrossRefGoogle Scholar
Slamon, DJ, Godolphin, W, Jones, LA, Holt, JA, Wong, SG, Keith, DE, et al. Studies of the HER-2/neu proto-oncogene in human breast and ovarian cancer. Science. 1989;244:707712.CrossRefGoogle Scholar
World Health Organization [Internet]. WHO prequalifies first biosimilar medicine to increase worldwide access to life-saving breast cancer treatment. Geneva: World Health Organization; 2019. Available from: https://www.who.int/news/item/18-12-2019-who-prequalifies-first-biosimilar-medicine-to-increase-worldwide-access-to-life-saving-breast-cancer-treatment.Google Scholar
Bertram, MY, Lauer, JA, De Joncheere, K, Edejer, T, Hutubessy, R, Kieny, MP, et al. Cost- effectiveness thresholds: pros and cons. Bull World Health Organ. 2016;94:925930.CrossRefGoogle ScholarPubMed
Chen, W, Jiang, Z, Shao, Z, Sun, Q, Shen, K. An economic evaluation of adjuvant trastuzumab therapy in HER2-positive early breast cancer. Value Health. 2009;12:S82–84.Google ScholarPubMed
Ansaripour, A, Uyl-de Groot, CA, Redekop, WK. Adjuvant trastuzumab therapy for early HER2-positive breast cancer in Iran: a cost-effectiveness and scenario analysis for an optimal treatment strategy. Pharmacoeconomics. 2018;36:91103.CrossRefGoogle ScholarPubMed
Genuino, AJ, Chaikledkaew, U, Guerrero, AM, Reungwetwattana, T, Thakkinstian, A. Cost-utility analysis of adjuvant trastuzumab therapy for HER2-positive early-stage breast cancer in the Philippines. BMC Health Serv Res. 2019;19:874.CrossRefGoogle ScholarPubMed
Buendía, JA, Vallejos, C, Pichón-Rivière, A. An economic evaluation of trastuzumab as adjuvant treatment of early HER2-positive breast cancer patients in Colombia. Biomedica. 2013;33:411417.Google ScholarPubMed
Botelho, CH, Estevez-Diz, MDP, Campolina, AG. Cost-effectiveness analysis of trastuzumab for early breast cancer in Brazil. Expert Rev Pharmacoecon Outcomes Res. 2021;22:6372.CrossRefGoogle ScholarPubMed
Kristin, E, Endarti, D, Khoe, LC, Pratiwi, WR, Pinzon, RT, Febrinasari, RP, et al. Does trastuzumab offer good value for money for breast cancer patients with metastasis in Indonesia? Asian Pac J Cancer Prev. 2022;23:24412447.CrossRefGoogle ScholarPubMed
Decree of the Minister of Health of the Republic of Indonesia number 328/MENKES/IX/2013 on the national formulary [Keputusan Menteri Kesehatan Republik Indonesia nomor 328/MENKES/IX/2013 tentang formularium nasional]. Jakarta, 2013.Google Scholar
Siebert, U, Alagoz, O, Bayoumi, AM, Jahn, B, Owens, DK, Cohen, DJ, et al. State-transition modeling: a report of the ISPOR-SMDM modeling good research practices task force-3. Value Health. 2012;15:812820.Google ScholarPubMed
Skovlund, E, Leufkens, HGM, Smyth, JF. The use of real-world data in cancer drug development. Eur J Cancer. 2018;101:6976.CrossRefGoogle ScholarPubMed
Katkade, VB, Sanders, KN, Zou, KH. Real world data: an opportunity to supplement existing evidence for the use of long-established medicines in health care decision making. J Multidiscip Healthc. 2018;11:295304.CrossRefGoogle ScholarPubMed
Sievers, H, Joos, A, Hiligsmann, M. Real-world evidence: perspectives on challenges, value, and alignment of regulatory and national health technology assessment data collection requirements. Int J Technol Assess Health Care. 2021;37:e40.CrossRefGoogle ScholarPubMed
Vanni, T, Karnon, J, Madan, J, White, RG, Edmunds, WJ, Foss, AM, et al. Calibrating models in economic evaluation: a seven-step approach. Pharmacoeconomics. 2011;29:3549.Google ScholarPubMed
Allen, K, Lohrisch, CA, Le, D, Diocee, RM, Speers, C, Nichol, A, et al. Survival following locoregional recurrence in breast cancer by clinical subtype. J Clin Oncol. 2021;39:543.CrossRefGoogle Scholar
Joensuu, H, Bono, P, Kataja, V, Alanko, T, Kokko, R, Asola, R, et al. Fluorouracil, epirubicin, and cyclophosphamide with either docetaxel or vinorelbine, with or without trastuzumab, as adjuvant treatments of breast cancer: final results of the FinHer trial. J Clin Oncol. 2009;27:56855692.CrossRefGoogle ScholarPubMed
Slamon, DJ, Eiermann, W, Robert, NJ, Giermek, J, Martin, M, Jasiowka, M, et al. Ten year follow-up of BCIRG-006 comparing doxorubicin plus cyclophosphamide followed by docetaxel (AC→T) with doxorubicin plus cyclophosphamide followed by docetaxel and trastuzumab (AC→TH) with docetaxel, carboplatin and trastuzumab (TCH) in HER2+ early breast cancer. Cancer Res. 2016;76:S504.CrossRefGoogle Scholar
Jatoi, I, Anderson, WF, Jeong, JH, Redmond, CK. Breast cancer adjuvant therapy: time to consider its time-dependent effects. J Clin Oncol. 2011;29(17):23012304.CrossRefGoogle ScholarPubMed
Jayasekara, H, MacInnis, RJ, Chamberlain, JA, Dite, GS, Leoce, NM, Dowty, JG, et al. Mortality after breast cancer as a function of time since diagnosis by estrogen receptor status and age at diagnosis. Int J Cancer. 2019;145(12):32073217.CrossRefGoogle ScholarPubMed
Nelder, JA, Mead, R. A simplex method for function minimization. Comput J. 1965;7:308313.Google Scholar
Powell, M. The BOBYQA algorithm for bound constrained optimization without derivatives. Cambridge: University of Cambridge, Physics DoAMaT; 2009.Google Scholar
Lasdon, LS, Waren, AD, Jain, A, Ratner, MW. Design and testing of a generalized reduced gradient code for nonlinear programming. ACM Trans Math Softw. 1978;4:3450.CrossRefGoogle Scholar
Kongsakon, R, Lochid-amnuay, S, Kapol, N, Pattanaprateep, O. From research to policy implementation: trastuzumab in early-stage breast cancer treatment in Thailand. Value Health Reg Issues. 2019;18:4753.CrossRefGoogle ScholarPubMed
Trapani, D, Murthy, SS, Boniol, M, Booth, C, Simensen, VC, Kasumba, MK, et al. Distribution of the workforce involved in cancer care: a systematic review of the literature. ESMO Open. 2021;6:100292.CrossRefGoogle Scholar
Schaefers, J, Wenang, S, Afdal, A, Mukti, AG, Sundari, S, Haier, J. Population-based study on coverage and healthcare processes for cancer during implementation of national healthcare insurance in Indonesia. Lancet Reg Health Southeast Asia. 2022;6:100045.CrossRefGoogle Scholar
Moriña, D, de Sanjosé, S, Diaz, M. Impact of model calibration on cost-effectiveness analysis of cervical cancer prevention. Sci Rep. 2017;7(1):17208.Google ScholarPubMed
Soares, M, Colson, A, Bojke, L, Ghabri, S, Garay, OU, Felli, JK, et al. Recommendations on the use of structured expert elicitation protocols for healthcare decision making: a good practices report of an ISPOR Task Force. Value Health. 2024;27(11):14691478.CrossRefGoogle ScholarPubMed
Willigers, BJA, Ouwens, M, Briggs, A, Heerspink, HJL, Pollock, C, Pecoits-Filho, R, et al. The role of expert opinion in projecting long-term survival outcomes beyond the horizon of a clinical trial. Adv Ther. 2023;40(6):27412751.CrossRefGoogle ScholarPubMed
Mahendradhata, Y, Trisnantoro, L, Listyadewi, S, Soewondo, P, Marthias, T, Harimurti, P, et al. The Republic of Indonesia health system review. New Delhi: Asia Pacific Observatory on Health Systems and Policies; 2017.Google Scholar
Teerawattananon, Y, Painter, C, Dabak, S, Ottersen, T, Gopinathan, U, Chola, L, et al. Avoiding health technology assessment: a global survey of reasons for not using health technology assessment in decision making. Cost Effect Resour Alloc. 2021;19:62.CrossRefGoogle Scholar
Kc, S, Lin, LW, Bayani, DBS, Zemlyanska, Y, Adler, A, Ahn, J, et al. What, where, and how to collect real-world data and generate real-world evidence to support drug reimbursement decision-making in Asia: a reflection into the past and a way forward. Int J Health Policy Manag. 2023;12:6858.Google Scholar
Lee, W, Dayer, V, Jiao, B, Carlson, JJ, Devine, B, Veenstra, DL. Use of real-world evidence in economic assessments of pharmaceuticals in the United States. J Manag Care Spec Pharm. 2021;27:514.Google ScholarPubMed
Bartlett, VL, Dhruva, SS, Shah, ND, Ryan, P, Ross, JS. Feasibility of using real-world data to replicate clinical trial evidence. JAMA Netw Open. 2019;2:e1912869.Google ScholarPubMed
Sheffield, KM, Dreyer, NA, Murray, JF, Faries, DE, Klopchin, MN. Replication of randomized clinical trial results using real-world data: paving the way for effectiveness decisions. J Comp Eff Res. 2020;9:10431050.CrossRefGoogle ScholarPubMed
Gidwani, R, Russell, LB. Estimating transition probabilities from published evidence: a tutorial for decision modelers. Pharmacoeconomics. 2020;38:11531164.CrossRefGoogle ScholarPubMed
Lazem, M, Sheikhtaheri, A. Barriers and facilitators for the implementation of health condition and outcome registry systems: a systematic literature review. J Am Med Inform Assoc. 2022;29(4):723734.CrossRefGoogle ScholarPubMed
Figure 0

Figure 1. Model structure.

Figure 1

Table 1. Initial transition probabilities of the chemotherapy group and treatment effects of adjuvant trastuzumab

Figure 2

Table 2. Calibrated transition probabilities of the chemotherapy group

Figure 3

Figure 2. Projected overall survival and life-years gained. BOBYQA: Bound Optimization BY Quadratic Approximation; GRG: generalized reduced gradient; LY: life-years; LYG: life-years gained. Stars denote the statistical comparison of life-years with pre-calibrated values within a treatment group (*Pbc < 0.050; **Pbc < 0.010; ***Pbc < 0.001). All statistical tests and the 95% confidence intervals were based on the Monte Carlo empirical distribution with bias corrections.

Supplementary material: File

Rahadi et al. supplementary material

Rahadi et al. supplementary material
Download Rahadi et al. supplementary material(File)
File 398 KB