The median prevalence of childhood psychiatric disorders has been reported as 12% for prepubescent school-age children and 15% among adolescents. Reference Roberts1 Among British children aged 5–15 years, it has been estimated that 10% have a psychiatric disorder, with 5% having conduct disorder and 4% having emotional disorders. Reference Meltzer, Gatward, Goodman and Ford2 A longitudinal community study of children aged 9–13 years from the American state of North Carolina estimated that the 3-month prevalence of any psychiatric disorder averaged 13.3% (95% CI 11.7–15.0%). Reference Costello, Mustillo, Erkanli, Keeler and Angold3 There is increasing evidence from longitudinal studies Reference Kovacs, Feinberg, Crouse-Novak, Palauskas and Finkelstein4–Reference Biederman, Monuteaux, Mick, Spencer, Wilens and Silva14 and retrospective reports Reference Giaconia, Reinberg, Silverman, Pakiz, Frost and Cohen15 that childhood psychiatric disorders may have significant adverse consequences for mental health, educational outcomes and substance misuse through childhood and into adulthood. However, relatively little is known about their consequences in terms of costs or health utilities (preference-based measures of health outcome developed from economic theory), which can directly or indirectly inform resource-allocation decisions. Cost of illness studies have been conducted for a relatively small number of childhood psychiatric disorders, including childhood depression, Reference McCrone, Knapp and Fombonne16 separation anxiety Reference Bodden, Dirksen and Bögels17 and attention-deficit hyperactivity disorder (ADHD), Reference Leibson and Long18 and health utilities have been estimated for ADHD Reference Matza, Secnik, Rentz, Mannix, Sallee and Gilbert19,Reference Petrou and Kupek20 and autism-spectrum disorders. Reference Petrou and Kupek20 The purpose of this research is to augment the limited economic evidence in this area by estimating costs and health utilities associated with childhood psychiatric disorders during mid-childhood. This will provide a significant new resource for clinical decision-makers and budgetary and service planners and to analysts estimating the cost-effectiveness of preventive or treatment interventions for these disorders.
Method
Study population
Children who participated in the EPICure study represented the study population for this empirical investigation. The EPICure study is a whole-population longitudinal study of all infants born at 20 to 25 completed weeks of gestation in all 276 maternity units in the UK and Republic of Ireland from March to December 1995. Reference Wood, Marlow, Costeloe and Gibson21 Participants in the EPICure study were selected for this investigation as previous research had indicated higher prevalence rates of psychiatric disorders among children born preterm or with a low birth weight compared with general population samples. Reference Botting, Powls, Cooke and Marlow22–Reference Johnson, Hollis, Kochhar, Hennessy, Wolke and Marlow25 Of the 307 surviving extremely preterm infants in the EPICure study, 241 (78.2%) were assessed at a median age of 6 years 4 months (range: 5 years 2 months to 7 years 3 months) Reference Marlow, Wolke and Bracewell26 and 219 (71.3%) again at a median age of 10 years 11 months (range: 10 years 1 month to 12 years 1 month). Reference Johnson, Hennessy, Smith, Trikic, Wolke and Marlow27 A control group of 153 mainstream school classmates who were born at full term and matched for age, gender and ethnic group was also evaluated at a median age of 10 years 11 months (range: 9 years 9 months to 12 years 3 months). A full description of the EPICure study is available elsewhere. Reference Wood, Marlow, Costeloe and Gibson21,Reference Johnson, Hennessy, Smith, Trikic, Wolke and Marlow27 The extremely preterm children and their classmate controls were analysed separately for the purposes of the empirical investigation reported in this paper. Additionally, we analysed the pooled study population controlled for clinical and sociodemographic confounders, including gestational age at birth and a measure of neurosensory or motor impairment. Ethical approval for the study was obtained from the Southampton and South West Hampshire Research Ethics Committee and approved by the Central Office for Research Ethics Committees (COREC).
Psychiatric assessments
Childhood psychiatric disorders were diagnosed using the Development and Well Being Assessment (DAWBA), Reference Goodman, Ford, Richards, Gatward and Meltzer28 a semi-structured interview conducted with the main parent (usually the mother) or completed online by the main parent around the child's eleventh birthday. Information obtained from the DAWBA was used to assign ICD–10 29 and DSM–IV–TR 30 diagnoses. Supplemental information was provided by teachers who completed a questionnaire-based version of the DAWBA. Multi-informant data were collated and potential diagnoses were computer-generated using scoring algorithms. These were reviewed by two child and adolescent psychiatrists (C.H. and P.K.) masked to group allocation who assigned final DSM–IV and ICD–10 consensus diagnoses by reviewing quantitative symptom data and qualitative transcripts. In this paper we primarily refer to psychiatric disorders defined using DSM–IV–TR criteria. 31 Diagnostic classifications were assigned to the categories of emotional, ADHD, conduct, autistic and tic disorders. In addition, the Kaufman–Assessment Battery for Children (K–ABC) was used to obtain IQ scores (range 40–160). Reference Kaufman and Kaufman32 Cognitive impairment was defined as either moderate (–3 standard deviations to –2 standard deviations or IQ scores of 71 to 81) or severe (< –3 standard deviations or IQ scores of ≤70) using the mean (s.d.) of classmates to account for the secular drift in IQ scores over time. Reference Johnson, Hennessy, Smith, Trikic, Wolke and Marlow27,Reference Wolke, Ratschinski, Ohrt and Riegel33
Estimation of costs
As part of the battery of assessments performed at 11 years, the main parent was asked to complete a detailed postal questionnaire about their child's resource utilisation over the previous year of life. The questionnaire was piloted to ascertain its acceptability, comprehension and reliability and reminder letters were sent to parents to increase the response and completion rates. The data collected from the main parent included their child's use of hospital in-patient and day care services, community health services, prescribed medications, social services and education services. The components of resource utilisation and the units in which they were measured are summarised in online Table DS1. Estimates of service provision were derived from these data and usually expressed in terms of contact hours. For all hospital admissions, estimates of service provision were expressed in terms of patient days with part of a day at each level of care counted as a 24-hour period. For education services, estimates of service provision reflected the level of educational assistance within each type of educational establishment (mainstream school, mainstream school with special unit attached, special school for the physically disabled and special school for children with intellectual disability or learning difficulties). In addition to information provided by parents, teachers were asked to identify children with special educational needs, defined in the educational context as those with intellectual disability or learning difficulties that make it harder for them to learn or access education than most children of the same age, and they were also asked to detail any special educational needs support the child received. Reference Johnson, Hennessy, Smith, Trikic, Wolke and Marlow27 This included information on the type and duration of individual education or behavioural plans, one-to-one special needs provision, small group special needs provision, outreach support and support from educational psychologists, clinical psychologists, physiotherapists, speech therapists and occupational therapists. All resource-use data were entered directly from the research instruments into a purpose-built data-collection program with in-built safeguards against inconsistent entries and then verified by dual coding.
United Kingdom unit costs were applied to each resource item to value total resource use for each study child over an annual period. All unit costs employed followed recent guidelines on costing public services as part of economic evaluation. Reference Allen, Beecham, Netten and Beecham34–36 The calculation of these costs was underpinned by the concept of opportunity cost, which can be defined as the value of the next best alternative for using these resources. Reference Drummond, Sculpher, Torrance, O'Brien and Stoddart35 The costs of hospital in-patient and day care services were largely derived from English Department of Health reference costs based upon National Health Service trust financial returns. 37 The unit costs of community health and social services were largely derived from national sources, Reference Curtis38 and took account of time spent by professionals on indirect activities, such as travelling and paper work. Some unit costs of health and social services were calculated from first principles using established accounting methods. Reference Allen, Beecham, Netten and Beecham34 Drug costs were obtained from the British National Formulary. 39 Educational costs were based upon data for different types of educational establishment obtained from the Department of Education and Skills in England (details available from the author on request). These included salaries, employer on-costs and revenue and capital overheads associated with each form of special educational needs support described above. All costs were expressed in pounds sterling and reflected values for the financial year 2006–7.
Estimation of health utilities
The postal questionnaire completed by the main parent around the child's eleventh birthday included the Health Utilities Index (HUI), which can be described as a family of health status classification systems with preference weights (or multi-attribute utility scores) attached to each permutation of responses. Reference Torrance, Furlong, Feeny and Boyle40 The main parent was considered the appropriate subject for completing the HUI as related research had indicated that the comprehension level required for successful completion is somewhat higher for a paediatric sample where a number of children have developmental disabilities. Reference Eiser and Morse41 The main parent completed the unedited 15-item questionnaire for proxy-assessed usual health status assessment, which was obtained from the HUI developers and covers both Mark 2 (HUI2) and Mark 3 (HUI3) health status classification systems. The ‘usual’ health focus of the questions has previously been applied in population health surveys, where short-term illnesses such as influenza are not the major concern. Reference Furlong, Feeny, Torrance and Barr42 The HUI2 health status classification system covers seven attributes: sensation, mobility, emotion, cognition, self-care, pain and fertility. The HUI3 health status classification system covers eight attributes: cognition, vision, hearing, speech, ambulation, dexterity, emotion and pain. The HUI3 health status classification system is now recommended by the HUI developers as the preferred measure of primary analyses because of its broad applicability in both clinical and general population health studies, improvements in a number of definitions, and an increased orthogonality of its attributes for structural independence. Reference Furlong, Feeny, Torrance and Barr42,Reference Horsman, Furlong, Feeny and Torrance43 Consequently, our primarily analyses of preference-based health-related quality of life outcomes were based on the HUI3 health status classification system. Function within each HUI3 attribute is graded on a five- or six-point scale corresponding to the level of severity, ranging from normal function (level 1) to severe impairment (level 5 or 6). Responses to the HUI3 health status classification system were converted into multiplicative multi-attribute utility scores using a published utility function. Reference Feeny, Furlong, Torrance, Goldsmith, Zhu and DePauw44,Reference Furlong, Feeny, Torrance, Goldsmith, DePauw and Zhu45 These multi-attribute utility scores are based on the permutation of responses across the eight attributes and are expressed on an interval scale ranging from –0.36 (representing the health state with the lowest level of function for all attributes) to 1.00 (representing the health state with the highest level of function for all attributes). The multi-attribute utility scoring algorithm for the HUI3 can be summarised as
where u* is the utility score for the overall health state being measured and the b j 's are substituted from a table of coefficients provided by the HUI developers for the appropriate attribute and level. Reference Furlong, Feeny, Torrance, Goldsmith, DePauw and Zhu45 To develop the multi-attribute utility scoring algorithm a random sample of 504 general population adults living in the city of Hamilton, Canada had previously been asked to value selected health states using both a visual analogue scaling technique and a standard gamble instrument. Reference Furlong, Feeny, Torrance, Goldsmith, DePauw and Zhu45 Analyses of preference-based health-related quality of life outcomes in our study were repeated using the HUI2 health status classification system and an underpinning multi-attribute utility scoring algorithm recently estimated on the basis of the preferences of 198 members of the UK general population. Reference McCabe, Stevens, Roberts and Brazier46 The latter measure and underpinning multi-attribute utility scoring algorithm might be considered to generate relevant values for UK policy purposes. 36
Statistical methods
Differences in baseline sociodemographic and clinical characteristics between children with and without a psychiatric disorder, as defined by DSM–IV–TR criteria, were tested using the Pearson chi-squared test.
A number of statistical approaches were tested in order to impute costs for children with some missing data. Given the negligible level of missing cost data (<2%) in the final study sample, simple linear regression and simulation-based multiple imputation Reference Schafer47 for each psychiatric group produced similar results. Therefore, it was considered appropriate to use the estimates generated by the simple linear regression in this analysis. Reference Schafer47–Reference Gray, Wolstenholme, Wordsworth and Clarke49
For the preterm population, term-born population and pooled study population, comparisons of each category of public sector costs and of total public sector costs were made between children with and without a psychiatric disorder as defined by DSM–IV–TR criteria. Similarly, for the preterm, term-born and pooled populations, comparisons of total public sector costs were made between children with and without varying levels of cognitive impairment. Comparisons of total public sector costs between children with and without individual psychiatric disorders (emotional, ADHD, conduct, autistic, tic) as defined by DSM–IV–TR criteria were restricted to the pooled population because of the relatively small sample sizes for some disorders (e.g. n = 4 for tic disorders). Comparisons of costs are reported as mean values with standard deviations and mean differences in costs between the comparison groups with 95% confidence intervals. As the data for costs were skewed, in addition to Student t-tests of cost differences, non-parametric bootstrap estimation was used to derive 95% confidence intervals for mean cost differences between the comparison groups. Reference Barber and Thompson50 The bootstrap method does not rely on parametric assumptions concerning the underlying distribution of data, hence its usefulness for generating confidence intervals for skewed data. Reference Briggs, Claxton and Sculpher51 Using a large number of simulations, and based on sampling with replacement from the original data, the bootstrap method estimates the distribution of a sampling statistic. Reference Briggs, Claxton and Sculpher51 Each of the confidence intervals surrounding mean cost differences was calculated using 10 000 bias-corrected bootstrap replications.
In addition, generalised linear regressions Reference Briggs, Claxton and Sculpher51 were performed for the preterm, term-born and pooled populations with total public sector costs over the previous year of life representing the dependent variable in the analyses. Three regression models were constructed for each population group. In the first model, the main independent variable was a dichotomous variable of whether or not the child had a psychiatric disorder as defined by DSM–IV–TR criteria. In the second model, the main independent variable was a dichotomous variable of whether or not the child had moderate cognitive impairment, whereas in the third model it was a dichotomous variable of whether or not the child had severe cognitive impairment. For each generalised linear regression model, a gamma distribution and linear (identity) link function for costs was selected on the basis of its low Akaike information criterion (AIC) statistic (AIC statistics of 19.24, 19.24 and 19.23 for models 1, 2 and 3 respectively for the pooled population) compared with alternative distributional forms (e.g. Gaussian, inverse Gaussian and Poisson distributional families) and link functions (e.g. log link function). For the preterm and term-born populations, covariates included in the generalised linear regressions were gender (male, female), maternal marital status (married, single, cohabiting, widowed, separated or divorced), respondent parent's current age (<30, 30–39, 40–49, ≥50 years), type of accommodation (owner occupied, rented, other), access to car (yes, no), highest parental qualification (vocational or equivalent, ordinary level or equivalent, advanced level or equivalent, diploma or equivalent, university degree, postgraduate qualification, other, none), highest parental occupational status (professional or managerial, intermediate, routine and manual, long-term unemployed), language spoken at home (English only, English and other language(s)), number of smokers at home (0, 1, ≥2) and a measure of neurosensory or motor impairment (no, yes). Reference Johnson, Fawke, Hennessy, Rowell, Thomas and Wolke52 For the pooled population, covariates includes in the generalised linear regressions additionally included gestational age at birth (≤23 weeks, 24 weeks, 25 weeks, term). In sensitivity analyses, the measure of neurosensory or motor impairment was replaced by an interaction term between gestational age at birth and psychiatric disorder for the pooled population.
For the preterm population, term-born population and pooled study population, we used Fisher's exact test for equality of proportions to compare the proportion of children with suboptimal levels of function (defined as below level 1 function) within each of the eight attributes of the HUI3 health status classification system between children with and without a psychiatric disorder as defined by DSM–IV–TR criteria. The same comparison groups used as a basis for analysing total public sector costs were also used as a basis for analysing health utilities. Differences in the HUI3 and UK HUI2 multi-attribute utility scores between the comparison groups were tested using two-sample t-tests for unequal variance. Finally, we performed Tobit regressions to explore the effects of psychiatric disorders on the HUI3 and UK HUI2 multi-attribute utility scores for the preterm, term-born and pooled populations. Tobit regression was required to account for the censoring of the dependent variable, the multi-attribute utility score, which has an upper value of 1.0. Reference Greene53 As with costs, three regression models were constructed for each population group, which differed in terms of the main psychiatric independent variable. The same covariates incorporated into the generalised linear regressions on costs were incorporated into the Tobit regressions on health utilities.
All analyses were performed with a microcomputer using the Statistical Package for the Social Sciences (SPSS) (version 15.0) software and STATA (version 10.0) software for Windows XP. P-values of 0.05 or less were considered statistically significant.
Results
Postal questionnaires reporting costs and health utilities were returned for a total of 331 study children, including 190 extremely preterm children (representing 86.8% of extremely preterm children undergoing neurodevelopmental assessments at a median age of 10 years 11 months) and 141 term-born classmates (representing 92.2% of term-born classmates undergoing neurodevelopmental assessments at this time point). Multi-informant psychiatric assessments were performed on 321 (97.0%) of these 331 study children, with multiple imputation techniques used to estimate psychiatric diagnoses for the remaining 10 children. Reference Johnson, Hollis, Kochhar, Hennessy, Wolke and Marlow25 Children for whom postal questionnaires were not returned were more likely to be born at between 25 weeks exactly and 25 weeks 6 days, be of Black and minority ethnic origin, have had an operation for necrotising enterocolitis, to have unemployed parents and to have had lower cognitive scores or cognitive impairment at 2.5 and 6 years (P≥0.05). There were a number of significant differences between the 50 children with and 281 without a DSM–IV–TR clinical diagnosis, for whom postal questionnaires were returned, in terms of sociodemographic and clinical characteristics. Notably, children with a DSM–IV–TR clinical diagnosis were more likely to be male, less likely to have married parents, less likely to live in owner-occupied accommodation, more likely to live with smokers and more likely to have been born preterm (online Table DS2).
Resource-use values for children with and without a psychiatric disorder as defined by DSM–IV–TR criteria are summarised in online Table DS1 for the pooled study population. This table highlights significant differences in mean utilisation of a number of mental and non-mental health and social care services between these groups.
Public sector costs over the previous year of life for the preterm population, term-born population and pooled study population are summarised in online Table DS3. Among the pooled study population, mean (s.d.) public sector costs over the 12-month period were £7188 (s.d. = £5869) for the 50 children with a psychiatric disorder, as defined by DSM–IV–TR criteria, and £5116 (s.d. = £4370) for the 281 children without a psychiatric disorder, generating a mean cost difference of £2072 (bootstrap 95% CI £349–£3795), which was statistically significant (P = 0.020). Among the preterm population, the mean public sector cost difference between the 39 children with and 151 children without a psychiatric disorder, as defined by DSM–IV–TR criteria, was estimated at £1998 (bootstrap 95% CI –£164 to £4160, P = 0.076). Among the term population, the respective public sector cost difference between the 11 children with and 130 children without a psychiatric disorder, as defined by DSM–IV–TR criteria, was estimated at £51 (bootstrap 95% CI –£752 to £854, P = 0.903). When the data were analysed by cost category, a DSM–IV–TR clinical diagnosis was associated with significantly higher community health and social care costs in all population groups and significantly higher total health and social care costs in the preterm population and pooled study population.
Mean public sector costs over the previous year of life and mean cost differences between children with and without individual psychiatric disorders are summarised in Table 1 for the preterm population, term-born population and pooled study population. Of particular note are the additional £3170 (P = 0.001) and £8877 (P<0.0001) annual public sector costs associated with moderate cognitive impairment and severe cognitive impairment, respectively, in the preterm population; and the additional £6745 (P = 0.014), £3375 (P<0.0001) and £8530 (P<0.0001) annual public sector costs associated with a diagnosis of an autistic disorder, moderate cognitive impairment and severe cognitive impairment, respectively, in the pooled study population.
With disorder | Without disorder | Cost difference | |||||||
---|---|---|---|---|---|---|---|---|---|
Psychiatric disorder | n | Mean | s.d. | n | Mean | s.d. | Mean | 95% CIa | P b |
Preterm sample | |||||||||
Any DSM–IV clinical diagnosis | 39 | 8 071.8 | 6 358.5 | 151 | 6 073.8 | 5 264.9 | 1 998.0 | –164.4 to 4 160.4 | 0.076 |
Moderate cognitive impairmentc | 67 | 8 536.1 | 6 665.3 | 123 | 5 366.0 | 4 481.2 | 3 170.0 | 1 384.4 to 4 955.7 | 0.001 |
Severe cognitive impairmentd | 18 | 1 4519.4 | 6 187.6 | 172 | 5 643.0 | 4 765.3 | 8 876.5 | 5 923.5 to 11 829.5 | <0.0001 |
Term sample | |||||||||
Any DSM–IV clinical diagnosis | 11 | 4 053.9 | 1 127.7 | 130 | 4 003.3 | 2 624.7 | 50.6 | –752.3 to 853.6 | 0.903 |
Moderate cognitive impairmentc | 2 | 3 333.0 | 121.6 | 139 | 4 017.0 | 2 554.5 | e | e | e |
Severe cognitive impairmentd | 0 | – | – | 141 | 4 007.3 | 2 537.5 | e | e | e |
Total sample | |||||||||
Any emotional disorderf | 16 | 6 860.1 | 5 259.2 | 315 | 5 433.4 | 4 738.9 | 1 426.8 | –1 195.1 to 4 048.7 | 0.304 |
Any ADHD diagnosisg | 17 | 5 812.0 | 3 832.6 | 314 | 5 551.1 | 4 852.0 | 261.0 | –1 656.8 to 2 178.7 | 0.792 |
Any conduct disorderh | 17 | 7 033.5 | 5 700.1 | 314 | 5 342.0 | 4 609.3 | 1 691.5 | –1 006.2 to 4 389.2 | 0.246 |
Any autistic disorderi | 11 | 12 016.1 | 7 568.1 | 320 | 5 270.8 | 4 481.2 | 6 745.3 | 2 232.9 to 11 257.7 | 0.014 |
Tic disorder | 4 | 7 022.4 | 6 474.7 | 327 | 5 482.7 | 4 760.2 | 1 539.7 | –4 815.1 to 7 894.5 | 0.667 |
Any DSM–IV clinical diagnosis | 50 | 7 187.8 | 5 868.6 | 281 | 5 115.9 | 4 369.5 | 2 071.9 | 348.7 to 3 795.2 | 0.020 |
Moderate cognitive impairmentc | 69 | 8 385.3 | 6 625.2 | 262 | 4 650.3 | 3 645.8 | 3 735.0 | 2 087.8 to 5 382.1 | <0.0001 |
Severe cognitive impairmentd | 18 | 13 443.3 | 6 725.1 | 313 | 4 913.5 | 4 011.8 | 8 529.8 | 5 554.9 to 11 504.7 | <0.0001 |
The results of generalised linear regressions exploring the relationship between psychiatric disorders and total public sector costs over the previous year of life are shown in Table 2 for the preterm population, term-born population and pooled study population. After controlling for clinical and sociodemographic confounders, a DSM–IV–TR clinical diagnosis was associated with increases of £1499 (95% CI –£758 to £3755, P = 0.193) and £1505 (95% CI –£40 to £3049, P = 0.056) in annual public sector costs for the preterm and pooled study populations, respectively. Moderate cognitive impairment and severe cognitive impairment were associated with increases in annual public sector costs of £915 (95% CI –£687 to £2517, P = 0.263) and £4852 (95% CI –£839 to £10 542, P = 0.095), respectively, for the preterm population, and £1402 (95% CI –£88 to £2891, P = 0.065) and £5662 (95% CI £238 to £11 086, P = 0.041), respectively, for the pooled study population. The only other factor associated with significantly increased public sector costs across the regression models was extremely preterm birth for the pooled study population. Replacing the measure of neurosensory or motor impairment Reference Johnson, Fawke, Hennessy, Rowell, Thomas and Wolke52 by an interaction term between gestational age at birth and psychiatric disorder had no discernible effects on the results of the generalised linear regressions.
Psychiatric disorder | Adjusted regression coefficienta | Robust standard error | 95% CI | P |
---|---|---|---|---|
Preterm sample b | ||||
DSM–IV clinical diagnosis | ||||
No (reference group) | ||||
Yes | 1 498.7 | 11 51.2 | –757.7 to 3 755.0 | 0.193 |
Moderate cognitive impairmentc | ||||
No (reference group) | ||||
Yes | 915.1 | 817.2 | –686.6 to 2 516.8 | 0.263 |
Severe cognitive impairmentd | ||||
No (reference group) | ||||
Yes | 4 851.5 | 2 903.5 | –839.3 to 10 542.2 | 0.095 |
Term sample b | ||||
DSM–IV clinical diagnosis | ||||
No (reference group) | ||||
Yes | –33.8 | 829.1 | –1 658.8 to 1 591.2 | 0.967 |
Moderate cognitive impairmentc | ||||
No (reference group) | ||||
Yes | e | e | e | e |
Severe cognitive impairmentd | ||||
No (reference group) | ||||
Yes | e | e | e | e |
Total sample f | ||||
DSM–IV clinical diagnosis | ||||
No (reference group) | ||||
Yes | 1 504.5 | 788.2 | –40.3 to 3 049.3 | 0.056 |
Moderate cognitive impairmentc | ||||
No (reference group) | ||||
Yes | 1 401.6 | 760.1 | –88.1 to 2 891.2 | 0.065 |
Severe cognitive impairmentd | ||||
No (reference group) | ||||
Yes | 5 662.2 | 2 767.4 | 238.2 to 11 086.3 | 0.041 |
In each population group, comparisons of the frequency and proportion of suboptimal levels of function were made between the children with and without a DSM–IV–TR clinical diagnosis for each of the eight attributes of the HUI3. These analyses revealed significantly higher proportions of suboptimal levels of function among the children with a DSM–IV–TR clinical diagnosis for four attributes (emotion, pain, dexterity and cognition) for the preterm population, three attributes (speech, emotion and dexterity) for the term population and five attributes (speech, emotion, pain, dexterity and cognition) for the pooled study population (P≤0.05).
Mean HUI3 multi-attribute utility scores and mean utility differences between children with and without individual psychiatric disorders are summarised in Table 3 for all three populations. Of particular note are the 0.165 (P = 0.003), 0.232 (P<0.0001) and 0.512 (P<0.0001) mean utility decrements associated with a psychiatric disorder as defined by DSM–IV–TR criteria, moderate cognitive impairment and severe cognitive impairment, respectively, in the preterm population; and the 0.198 (P = 0.027), 0.250 (P = 0.003), 0.261 (P = 0.011), 0.192 (P<0.0001), 0.273 (P<0.0001) and 0.571 (P<0.0001) mean utility decrements associated with an emotional disorder, an ADHD diagnosis, an autistic disorder, a psychiatric disorder as defined by DSM–IV–TR criteria, moderate cognitive impairment and severe cognitive impairment, respectively, in the pooled study population. Analogous results were generated using the alternative UK HUI2 multi-attribute utility measure with the exception that statistically significant utility decrements were also generated for a psychiatric disorder as defined by DSM–IV–TR criteria for the term-born population and a conduct disorder for the pooled study population (Table 4).
With disorder | Without disorder | Cost difference | |||||||
---|---|---|---|---|---|---|---|---|---|
Psychiatric disorder | n | Mean | s.d. | n | Mean | s.d. | Mean | 95% CI | P a |
Preterm sample | |||||||||
Any DSM–IV clinical diagnosis | 39 | 0.656 | 0.270 | 151 | 0.820 | 0.254 | 0.165 | 0.061 to 0.269 | 0.003 |
Moderate cognitive impairmentb | 67 | 0.635 | 0.331 | 123 | 0.867 | 0.180 | 0.232 | 0.140 to 0.324 | <0.0001 |
Severe cognitive impairmentc | 18 | 0.318 | 0.390 | 172 | 0.830 | 0.207 | 0.512 | 0.285 to 0.739 | <0.0001 |
Term sample | |||||||||
Any DSM–IV clinical diagnosis | 11 | 0.826 | 0.251 | 130 | 0.967 | 0.070 | 0.141 | –0.027 to 0.310 | 0.093 |
Moderate cognitive impairmentb | 2 | 0.884 | 0.165 | 139 | 0.957 | 0.102 | d | d | d |
Severe cognitive impairmentc | 0 | – | – | 141 | 0.956 | 0.102 | d | d | d |
Total sample | |||||||||
Any emotional disordere | 16 | 0.672 | 0.296 | 315 | 0.871 | 0.220 | 0.198 | 0.026 to 0.371 | 0.027 |
Any ADHD diagnosisf | 17 | 0.629 | 0.271 | 314 | 0.879 | 0.215 | 0.250 | 0.099 to 0.402 | 0.003 |
Any conduct disorderg | 17 | 0.727 | 0.260 | 314 | 0.870 | 0.221 | 0.143 | –0.008 to 0.294 | 0.062 |
Any autistic disorderh | 11 | 0.609 | 0.257 | 320 | 0.870 | 0.222 | 0.261 | 0.076 to 0.446 | 0.011 |
Tic disorder | 4 | 0.675 | 0.292 | 327 | 0.866 | 0.224 | 0.190 | –0.529 to 0.909 | 0.376 |
Any DSM–IV clinical diagnosis | 50 | 0.698 | 0.273 | 281 | 0.890 | 0.203 | 0.192 | 0.105 to 0.278 | <0.0001 |
Moderate cognitive impairmentb | 69 | 0.643 | 0.329 | 262 | 0.916 | 0.149 | 0.273 | 0.187 to 0.359 | <0.0001 |
Severe cognitive impairmentc | 18 | 0.318 | 0.390 | 313 | 0.889 | 0.178 | 0.571 | 0.345 to 0.797 | <0.0001 |
With disorder | Without disorder | Cost difference | |||||||
---|---|---|---|---|---|---|---|---|---|
Psychiatric disorder | n | Mean | s.d. | n | Mean | s.d. | Mean | 95% CI | P a |
Preterm sample | |||||||||
Any DSM–IV clinical diagnosis | 39 | 0.759 | 0.148 | 151 | 0.858 | 0.157 | 0.098 | 0.041 to 0.155 | 0.001 |
Moderate cognitive impairmentb | 67 | 0.754 | 0.186 | 123 | 0.883 | 0.123 | 0.130 | 0.077 to 0.182 | <0.0001 |
Severe cognitive impairmentc | 18 | 0.612 | 0.245 | 172 | 0.861 | 0.130 | 0.249 | 0.117 to 0.381 | 0.001 |
Term sample | |||||||||
Any DSM–IV clinical diagnosis | 11 | 0.854 | 0.131 | 130 | 0.948 | 0.077 | 0.094 | 0.005 to 0.183 | 0.040 |
Moderate cognitive impairmentb | 2 | 0.871 | 0.105 | 139 | 0.941 | 0.085 | d | d | d |
Severe cognitive impairmentc | 0 | – | – | 141 | 0.940 | 0.086 | d | d | d |
Total sample | |||||||||
Any emotional disordere | 16 | 0.760 | 0.161 | 315 | 0.888 | 0.139 | 0.127 | 0.037 to 0.218 | 0.009 |
Any ADHD diagnosisf | 17 | 0.792 | 0.120 | 314 | 0.888 | 0.142 | 0.096 | 0.028 to 0.164 | 0.008 |
Any conduct disorderg | 17 | 0.802 | 0.129 | 314 | 0.888 | 0.141 | 0.085 | 0.009 to 0.161 | 0.030 |
Any autistic disorderh | 11 | 0.721 | 0.152 | 320 | 0.887 | 0.140 | 0.165 | 0.056 to 0.275 | 0.007 |
Tic disorder | 4 | 0.801 | 0.156 | 327 | 0.884 | 0.141 | 0.083 | –0.164 to 0.329 | 0.367 |
Any DSM–IV clinical diagnosis | 50 | 0.782 | 0.149 | 281 | 0.901 | 0.133 | 0.118 | 0.071 to 0.165 | <0.0001 |
Moderate cognitive impairmentb | 69 | 0.757 | 0.185 | 262 | 0.915 | 0.108 | 0.158 | 0.109 to 0.206 | <0.0001 |
Severe cognitive impairmentc | 18 | 0.612 | 0.245 | 313 | 0.898 | 0.118 | 0.286 | 0.155 to 0.417 | <0.0001 |
Finally, the separate Tobit regressions revealed that, even after controlling for clinical and sociodemographic confounders, a psychiatric disorder as defined by DSM–IV–TR criteria, moderate cognitive impairment and severe cognitive impairment were associated with significant decrements in the HUI3 multi-attribute utility score of 0.226 (P<0.0001), 0.205 (P<0.0001) and 0.342 (P<0.0001), respectively, for the preterm population, and 0.213 (P<0.0001), 0.198 (P<0.0001) and 0.324 (P<0.0001), respectively, for the pooled study population (Table 5). Analogous results were generated using the alternative UK HUI2 multi-attribute utility measure (Table 6); the decrements in these utility scores were smaller in magnitude, but remained statistically significant. The only other factor associated with statistically significant decrements in utility scores across the regression models was extremely preterm birth for the pooled study population. Replacing the measure of neurosensory or motor impairment Reference Greene53 by an interaction term between gestational age at birth and psychiatric disorder had no discernible effects on the results of the Tobit regressions.
Psychiatric disorder | Adjusted regression coefficienta | Robust standard error | 95% CI | P |
---|---|---|---|---|
Preterm sample b | ||||
DSM–IV clinical diagnosis | ||||
No (reference group) | ||||
Yes | –0.226 | 0.053 | –0.332 to –0.120 | <0.0001 |
Moderate cognitive impairmentc | ||||
No (reference group) | ||||
Yes | –0.205 | 0.046 | –0.297 to –0.113 | <0.0001 |
Severe cognitive impairmentd | ||||
No (reference group) | ||||
Yes | –0.342 | 0.093 | –0.526 to –0.158 | <0.0001 |
Term sample b | ||||
DSM–IV clinical diagnosis | ||||
No (reference group) | ||||
Yes | –0.144 | 0.088 | –0.317 to 0.030 | 0.104 |
Moderate cognitive impairmentc | ||||
No (reference group) | ||||
Yes | e | e | e | e |
Severe cognitive impairmentd | ||||
No(reference group) | ||||
Yes | e | e | e | e |
Total sample f | ||||
DSM–IV clinical diagnosis | ||||
No (reference group) | ||||
Yes | –0.213 | 0.045 | –0.302 to –0.124 | <0.0001 |
Moderate cognitive impairmentc | ||||
No (reference group) | ||||
Yes | –0.198 | 0.043 | –0.282 to –0.113 | <0.0001 |
Severe cognitive impairmentd | ||||
No (reference group) | ||||
Yes | –0.324 | 0.090 | –0.501 to –0.146 | <0.0001 |
Psychiatric disorder | Adjusted regression coefficienta | Robust standard error | 95% CI | P |
---|---|---|---|---|
Preterm sample b | ||||
DSM–IV clinical diagnosis | ||||
No (reference group) | ||||
Yes | –0.130 | 0.033 | –0.196 to –0.064 | <0.0001 |
Moderate cognitive impairmentc | ||||
No (reference group) | ||||
Yes | –0.124 | 0.029 | –0.181 to –0.067 | <0.0001 |
Severe cognitive impairmentd | ||||
No (reference group) | ||||
Yes | –0.177 | 0.057 | –0.290 to –0.065 | 0.002 |
Term sample b | ||||
DSM–IV clinical diagnosis | ||||
No (reference group) | ||||
Yes | –0.103 | 0.067 | –0.237 to 0.030 | 0.128 |
Moderate cognitive impairmentc | ||||
No (reference group) | ||||
Yes | e | e | e | e |
Severe cognitive impairmentd | ||||
No (reference group) | ||||
Yes | e | e | e | e |
Total sample f | ||||
DSM–IV clinical diagnosis | ||||
No (reference group) | ||||
Yes | –0.121 | 0.030 | –0.180 to –0.063 | <0.0001 |
Moderate cognitive impairmentc | ||||
No (reference group) | ||||
Yes | –0.123 | 0.028 | –0.178 to –0.068 | <0.0001 |
Severe cognitive impairmentd | ||||
No (reference group) | ||||
Yes | –0.171 | 0.058 | –0.285 to –0.057 | 0.004 |
Discussion
Main findings
This paper augments the limited published evidence on the economic consequences of childhood psychiatric disorders. Reference McCrone, Knapp and Fombonne16–Reference Petrou and Kupek20 Its unique contribution is twofold. First, it focuses on a broader range of childhood psychiatric disorders than has hitherto been studied by health economists within one sample, from relatively rare tic disorders to more common emotional and behavioural disorders, such as ADHD. Second, it reports both cost and preference-based health-related quality of life (or health utility) outcomes for these disorders. In the process, it provides a broader set of data inputs for directly or indirectly informing resource-allocation decisions than has hitherto been provided.
The study revealed an average annual cost difference of over £2000 across the pooled study population between children with and without a psychiatric disorder as defined by DSM–IV–TR criteria. This exceeds that identified for several other childhood conditions, Reference Ungar54 including childhood asthma Reference To, Dell, Dick and Cicutto55 and juvenile idiopathic arthritis, Reference Thornton, Lunt, Ashcroft, Baildam, Foster and Davidson56 and compares with additional annual cost burdens reported elsewhere of £890 (1996–7 prices) for childhood depression, Reference McCrone, Knapp and Fombonne16 e2748 (2003 prices) for separation anxiety Reference Bodden, Dirksen and Bögels17 and between US$1100 and 1800 (1996 prices) for ADHD. Reference Leibson and Long18 The study also revealed mean differences in the HUI3 and UK HUI2 multi-attribute utility scores of 0.192 and 0.118, respectively, across the pooled study population between these comparison groups, which far exceeds the 0.03 minimally important difference in utility score postulated in the literature as clinically important for evaluative purposes. Reference Drummond57 Notably, the difference in the mean HUI3 multi-attribute utility scores between children with (0.698) and without (0.890) a psychiatric disorder can be interpreted as a difference between being in a state of severe overall disability rather than a mild overall disability according to the classification of HUI3 multi-attribute utility scores published by the HUI developers. Reference Feeny, Furlong, Saigal and Sun58
Strengths and limitations
The study population was drawn from participants in the EPICure study, a whole-population longitudinal study of all infants born extremely preterm in the UK and Republic of Ireland over a 10-month period and a contemporaneous classroom control group born at full term and matched for age, gender and ethnic group. As such, the study population consists of two distinct groups of children: one that can be characterised as at high risk for psychiatric disorders and a more representative general population sample. The two groups were analysed separately for the purposes of our empirical investigation. Of particular note were the mean adjusted additional costs of £915 and £4852, the mean adjusted HUI3 utility decrements of 0.205 and 0.342 and the mean adjusted UK HUI2 utility decrements of 0.124 and 0.177 associated with cognitive impairment and severe cognitive impairment, respectively, in the extremely preterm children. Analyses of the term-born children were restricted by the limited number of children in this population who were diagnosed with a psychiatric disorder (n = 11). Nevertheless, separate economic results for this population are presented for completeness. Additionally, analyses of the pooled study population controlled for clinical and sociodemographic confounders, including gestational age at birth and, alternatively, either a measure of neurosensory or motor impairment or an interaction term between gestational age at birth and psychiatric disorder. Consequently, we adopted a strategy that disentangled the effects of psychiatric disorders on economic outcomes from those that might be attributable to comorbidities. The study has a number of other strengths. Multi-informant psychiatric data were collected using the DAWBA for all children, rather than a subset identified at high risk, and diagnoses were made by consensus between two expert clinical raters who were masked to group allocation. The DAWBA has excellent reliability and validity Reference Goodman, Ford, Richards, Gatward and Meltzer28 and was the principal measure of psychopathology in the British mental health surveys. Reference Green, McGinnity, Meltzer, Ford and Goodman59 Cognitive ability was assessed by psychologists who achieved ≥95% interrater reliability across standardised tests, and neurosensory function was evaluated by experienced paediatricians. As loss to follow-up was more common among high-risk children, multiple imputation was used to estimate psychiatric diagnoses for 10 of the 331 study children for whom multi-informant psychiatric diagnoses were not performed. Reference Johnson, Hollis, Kochhar, Hennessy, Wolke and Marlow25 Other strengths of the study include validated and reliable approaches to measuring and valuing costs and preference-based health-related quality of life outcomes during childhood and a comprehensive analytical strategy.
There are a number of caveats to the study findings. First, the children included in the EPICure study, but excluded from our analyses because of loss to follow-up, were more likely to have had lower cognitive scores or cognitive impairment at 2.5 and 6 years. This suggests that we might have underestimated the true extent of psychiatric disorders in the study population. Second, the study population was too small to present cost and utility estimates at a granulated level for all childhood psychiatric disorders. A number of disorders, such as panic disorder, agoraphobia, obsessive–compulsive disorder, elective mutism, disinhibited attachment disorder of childhood, reactive attachment disorder, eating disorder, schizophrenia, manic episodes, ADHD hyperactive–impulsive subtype and Asperger syndrome, were not diagnosed in our study population. In addition, individual disorders had to be grouped into relatively broad categories, which might reflect disparate experiences in terms of resource utilisation and health-related quality of life. A much larger study population would be required to estimate costs and health utilities with sufficient statistical power for all childhood psychiatric disorders. Indeed, McClellan and colleagues have argued that a longitudinal study of 10 000 children, consisting of 5000 characterised as at high risk of neurodevelopmental disabilities and 5000 randomly selected from the entire childhood population, is required to generate subtle information for the broad spectrum of conditions. Reference McClellan, Bresnahan, Echeverria, Knox and Susser60 A third caveat to the study findings is that the analysis of cost differences was conducted from a public sector perspective and encompassed costs of health, social and education services. It is likely that many psychiatric disorders have an impact on other sectors of the economy and on families and carers, Reference McCrone, Knapp and Fombonne16 suggesting that adopting a broader perspective would increase the cost differences between the study groups. A fourth caveat is that our cost estimates are based on parental reports of their child's resource utilisation over the previous year of life. Previous research has indicated that parents accurately recall their child's hospital service utilisation over extended periods when validated against medical records, but tend to underreport their child's community service utilisation. Reference Petrou, Murray, Cooper and Davidson61 If this were the case for our study our absolute costs for community service utilisation may be underestimates. A fifth caveat is that, given the large number of children in our study with serious cognitive impairment and learning difficulties, the main parent rather than the child was considered the appropriate person to complete the HUI. Empirical evidence of the concordance between child and parent ratings of attributes of children's health-related quality of life suggests that parents are able to accurately rate observable behaviours, such as physical functioning and physical symptoms, but are less successful at identifying social or emotional impairments. Reference Petrou62,Reference Verrips, Stuifbergen, den Ouden, Bonsel, Gemke and Paneth63 However, there is no consistent evidence to suggest that parents consistently either underreport or overreport social or emotional impairments, Reference Eiser and Morse64 which suggests that there are unlikely to be systematic biases in the measurement of health-related quality of life in our study. A final caveat is that the underlying preference weights for the HUI3 and UK HUI2 multi-attribute utility measures have been derived from surveys of adults rather than of children. Reference Feeny, Furlong, Torrance, Goldsmith, Zhu and DePauw44–Reference McCabe, Stevens, Roberts and Brazier46 The cognitive requirements entailed in directly estimating preference weights among our study population, many of whom had developmental disabilities, was considered too burdensome. Nevertheless, our approach of valuing health outcomes using population-based preferences is in line with the recommendations of many decision-making bodies, such as the National Institute for Health and Clinical Excellence in England and Wales. 36
Implications
How might the results of our study be used? Given recent evidence of the increasing incidence of some childhood psychiatric disorders, Reference McClellan, Bresnahan, Echeverria, Knox and Susser60 it is imperative that clinical decision-makers and budgetary and service planners recognise the overall economic impact of each condition in their service planning, as well as the potential contribution of clinical and sociodemographic factors to economic outcomes. More pertinently, in our opinion, our mean cost and utility estimates and their associated distributions can act as data inputs for cost-effectiveness models of preventive or treatment interventions for childhood psychiatric disorders. Economic analysts who construct decision-analytic models are often faced with estimating costs and health utilities for a large number of health conditions or states with limited resources or time. Under these circumstances our catalogue can be viewed as a significant new resource that can act as data inputs or be pooled with the totality of the existing evidence base. It should be noted, however, that analysts may face a particular methodological challenge when the time horizon for the cost-effectiveness model spans the entire period of childhood or further into adulthood. Under these circumstances, the impact of age on costs and health utilities should be estimated from data gathered in large-scale longitudinal studies as they become available. When such data are not available, techniques such as meta-regression of data across a number of studies should be considered as a means of disentangling the impact of age.
Acknowledgements
We are indebted to the EPICure Study Group, which includes paediatricians in 276 maternity units in the UK and Republic of Ireland who identified the original cohort, contributed perinatal data and whose help was invaluable. We would also like to thank the children who participated in the EPICure Study and the parents who completed the relevant research instruments.
eLetters
No eLetters have been published for this article.