Unipolar major depression is predicted to become the second leading cause of disease burden worldwide by the year 2020, accounting for 12% of disability-adjusted life-years (Reference Murray and LopezMurray & Lopez, 1996). Depression is comorbid with other psychiatric harm, including substance use disorders (Reference Rohde, Lewinsohn and SeeleyRohde et al, 1991; Reference GilvarryGilvarry, 2000), and anxiety and personality disorders (Reference Andrews, Henderson and HallAndrews et al, 2001). Personality, low competence, vulnerability to stress, and marital, occupational, financial and neighbourhood stressors are some of the risk factors for depression, and can be used to identify potential target groups for prevention (Reference Gillham, Shatte and FreresGillham et al, 2000).
Can depression be prevented?
Since the 1980s there has been increased development and implementation of universal, selective and indicated programmes (Reference Mrazek and HaggertyMrazek & Haggerty, 1994) that aim to reduce risk factors for depression, depressive symptoms and depressive disorders. Universal prevention interventions target a whole population group that has not been identified on the basis of increased risk; selective prevention targets subgroups of population whose risk of developing a mental disorder is significantly higher than average, as evidenced by biological, psychological or social risk factors; indicated prevention targets high-risk persons who are identified as having minimal symptoms foreshadowing mental disorder or biological markers indicating predisposition for mental disorder but do not meet diagnostic criteria for disorder at that time. Although there is substantial evidence that depressive symptoms can be reduced (Reference Munoz, Ying and Perez-StableMuñoz et al, 1993; Reference Gillham, Shatte and FreresGillham et al, 2000), only a few programmes have shown that depression can be prevented (Clarke et al, Reference Clarke, Hawkins and Murphy1995,Reference Clarke, Hawkins and Murphy1995, Reference Clarke, Hornbrook and Lynch2001). This meta-analysis aims to identify some of the population, programme and research design characteristics that predict the effect of primary prevention programmes targeting depression.
METHOD
Search procedures and inclusion criteria
Trials were identified through systematic literature searches in four databases (Current Contents, ERIC, Medline, PsycINFO), published meta-analyses, review articles, reference lists and through contact with members of the Society for Prevention Research. Key words for the literature searches were divided into five groups (details available from the author upon request). Two groups aimed to identify studies dealing with depression and mental health: resilience and other protective mental health factors related to depression (e.g. MASTERY, SELF-ESTEEM), and specific risk factors for depression (e.g. NEGATIVE THINKING, DYSTHYMIA). Three groups were defined to restrict the hits to the inclusion criteria: prevention and outcome measures (e.g. UNIVERSAL, SELECTIVE, DEPRESSIVE SYMPTOMS), evaluation (e.g. EFFICACY, CONTROL GROUP) and research design (e.g. RANDOMISED CONTROLLED TRIAL).
Studies were selected within the definitions of universal, selective and indicated prevention, excluding any pharmacological intervention, if they included the prevention of depression as a primary or secondary goal or outcome, the improvement of protective factors for depression or mental health (e.g. self-esteem) or the reduction of risk factors related to depression (e.g. negative thinking). Selection was restricted to English-language publications between the years 1985 and 2000 that could be retrieved through the library system, had either a randomly allocated control or equivalent comparison group, had pre–post measures, had objective outcome measures and had sufficient statistical information to calculate an effect size. From the selected trials, only those that had depressive symptoms or incidence of depression as an outcome measure were included for analysis.
Coding system and procedures
A coding instrument was developed to operationalise programme outcomes and hypothesised effect predictors. The coding instrument included trial descriptors, target group characteristics, programme characteristics, programme development characteristics, implementation characteristics, quality of the research design, and outcome indicators. The coding system comprised a code book with codes for each variable, a coding sheet to operationalise the information, and a coding instructions book with definitions and instructions (details available from the author upon request).
A trained coder and E.J.-L. undertook the coding process. Measures to minimise bias were taken into account, such as coders’ training (Reference Cooper and HedgesCooper & Hedges, 1994). A random sample of one in five trials was double-coded to assess interrater reliability. The kappa coefficient averaged across codes was 0.91, indicating excellent agreement beyond chance between the two coders (Reference Cooper and HedgesCooper & Hedges, 1994). Data were entered into a Statistical Package for the Social Sciences (Version 10) data file, checked and cleaned to control for data entry and coding errors.
Calculation of effect sizes and weighted effect sizes
An effect size estimate using the standardised mean difference (Reference Hedges and OlkinHedges & Olkin, 1985: p. 79, formula 3) was calculated from the published data for every outcome measure reported, corrected for pre-test measures and small sample sizes (Reference Lipsey and WilsonLipsey & Wilson, 2001). Positive effect sizes indicate improvement for the intervention group. Each effect size was weighted based on the inverse of its variance (Reference Hedges and OlkinHedges & Olkin, 1985: p. 86, formula 15). For multiple programmes within one study, weights were calculated according to Gleser & Olkin (Reference Gleser, Olkin, Cooper and Hedges1994: p. 346, formula 22.13). Weighted mean effect sizes (Reference Hedges and OlkinHedges & Olkin, 1985: p. 111, formula 6) and 95% confidence intervals were calculated (Reference Hedges and OlkinHedges & Olkin, 1985: p. 86, formula 16).
Unit of analysis and sample heterogeneity
The unit of analysis in this study is at the programme level, where a programme was defined as an intervention with a preventive goal and a measure of depressive symptoms. When the efficacy of more than one programme within a study was compared with a control condition, the different interventions were treated independently and an effect size was calculated for each (Reference Gleser, Olkin, Cooper and HedgesGleser & Olkin, 1994: p. 346, formula 22.13). Effect sizes were averaged within each programme across outcome measures and follow-up times, resulting in one effect size per programme.
The Q statistic was calculated to test for heterogeneity (Reference Hedges and OlkinHedges & Olkin, 1985). The sample was heterogeneous (Q=474.72, d.f.=69, P<0.001) and characteristics of the programmes, target groups and research methodology were examined as independent variables to account for this heterogeneity (Reference Lipsey and WilsonLipsey & Wilson, 2001).
Predictors in the study
Gender was coded as a continuous variable, as the percentage of male participants in the programmes. Initial level of risk was defined as universal, selective or indicated (Reference Mrazek and HaggertyMrazek & Haggerty, 1994). The duration of programmes was coded in months, and the length of individual programme sessions in minutes. The quality of the research design was assessed with the Cochrane nine-item dichotomous scale (items scores 1 or 0); high-quality programmes were considered to be those with a score of 8 or above (Reference Brown, Berndt and BrinalesBrown et al, 2000). Intervention methods were classified into one of five groups: behaviour (e.g. behaviour change, pleasant activities, modelling); cognition (e.g. cognitive restructuring, counselling, explanatory-style training); competence (e.g. broad skill training, social resistance skills); education (e.g. direct instruction, lectures and workshops); and social support (e.g. network building, fostering socialisation). Programme providers were divided into healthcare personnel (physical and mental health professionals) and lay personnel (peers, family members, schoolteachers).
Statistical analyses to test hypotheses
The z-scores were used as significance tests to compare the weighted mean effect sizes of the different values of the categorical independent variables. Weighted least squares regression analyses were used to identify possible relationships and interactions between predictors in explaining the variation in effect sizes between studies for continuous and categorical independent variables, recoded to dummy variables. Each unweighted programme effect size was corrected by its weight as defined above. Regression coefficients were obtained and the adjusted R 2 was used to measure the proportion of variance accounted for by an independent variable. The standard errors of the regression coefficients (B) were corrected according to Hedges & Olkin (Reference Hedges and Olkin1985: p. 174) and used in a z test. Separate regression models were built, first testing for main effects and then for interaction effects of the target group characteristics gender, age and level of risk, which had been identified in previous research as possible moderators of effect (Reference Price, Van Ryn and VinokurPrice et al, 1992,Reference Price, Van Ryn and VinokurPrice et al, 1992; Reference Gillham, Shatte and FreresGillham et al, 2000). Scatter plots and plots of residuals found no evidence for violation of regression model assumptions. All the tests for statistical significance were based on two-tailed tests.
RESULTS
Trial flow and distribution of effect sizes
The searches identified 1474 publications. Screening on the basis of titles and abstracts reduced the number of publications according to the inclusion criteria to 201. Detailed inspection of the retrieved 201 publications reduced the sample to 108 studies that reported sufficient information to calculate an effect size (excluding those with missing data) and that had either a randomly allocated control or equivalent comparison group. From the 108 studies, only 54 trials reported an outcome measure for depressive symptoms, 11 of which compared more than one type of programme, resulting in 69 programmes for analysis. Table 1 provides a description of the included programmes. The effect sizes of the 69 programmes were normally distributed. The unweighted mean effect size was 0.25 (range 71.08 to 1.8; 95% CI 0.16–0.35) and the weighted mean effect size was 0.22 (95% CI 0.14–0.30).
Study | Intervention provided | Age group and age range (years) | Male participants (%) | Level of risk | Programme duration (months) | Number of sessions | Length of session (min) | Intervention methods | Programme providers | Quality of research design (score) | Mean effect size |
---|---|---|---|---|---|---|---|---|---|---|---|
Emery et al (Reference Emery, Schein and Hauck1998) | Education and stress management | Elderly | 47 | Selective | 2-3 | > 8 | 60-90 | Educational | High (8) | -0.05 | |
Emery et al (Reference Emery, Schein and Hauck1998) | Exercise, education and stress management | Elderly | 47 | Selective | 2-3 | > 8 | 60-90 | Behavioural | High (8) | 0.33 | |
Muñoz et al (Reference Muñoz, Ying and Bernal1995) | Coping with depression, primary care | Adults | 36 | Selective | 2-3 | ≤ 8 | > 90 | Cognitive and competence | HP only | High (9) | 0.28 |
Price et al (Reference Price, Van Ryn and Vinokur1992,Reference Price, Van Ryn and Vinokur1992) | Job search intervention | Adults | Selective | <2 | ≤ 8 | > 90 | Behavioural, cognitive, competence and social support | Lay only | High (8) | 0.20 | |
Aronen & Kurkela (Reference Aronen and Kurkela1996) | Home-based counselling | Children | Universal | > 3 | > 8 | Cognitive, competence and social support | HP only | Low (7) | 0.40 | ||
Eggert et al (Reference Eggert, Thompson and Herting1995) | Personal growth class (one semester) | Adolescents | 42 | Indicated | > 3 | > 8 | <60 | Behavioural, competence and social support | HP only | Low (7) | 0.11 |
Eggert et al (Reference Eggert, Thompson and Herting1995) | Personal growth class (two semesters) | Adolescents | 42 | Indicated | > 3 | > 8 | <60 | Behavioural, competence and social support | HP only | Low (7) | -0.10 |
Stolberg & Mahler (Reference Stolberg and Mahler1994) | Skills and support | Children 8-12 | 45 | Selective | > 3 | > 8 | Behavioural, educational and social support | Both | High (8) | 0.4 | |
Stolberg & Mahler (Reference Stolberg and Mahler1994) | Support | Children 8-12 | 45 | Selective | > 3 | > 8 | Behavioural, educational and social support | Both | High (8) | 0.08 | |
Stolberg & Mahler (Reference Stolberg and Mahler1994) | Transfer skills, support | Children 8-12 | 45 | Selective | > 3 | > 8 | Behavioural, educational and social support | Both | High (8) | 0.26 | |
Clarke et al (Reference Clarke, Hawkins and Murphy1995,Reference Clarke, Hawkins and Murphy1995) | Coping with depression course | Adolescents | 30 | Indicated | <2 | > 8 | <60 | Behavioural, cognitive, educational and social support | HP only | High (9) | 0.18 |
Jaycox et al (Reference Jaycox, Reivich and Gillham1994) | Penn prevention programme | Children 10-13 | 54 | Selective | 2-3 | > 8 | 60-90 | Behavioural cognitive and educational | HP only | High (8) | 0.38 |
Short (Reference Short1998) | Stress management and alcohol awareness programme | Children 10-13 | 47 | Selective | > 8 | Behavioural, cognitive, competence, social support | HP only | Low (5) | 0.22 | ||
Seligman et al (Reference Seligman, Schulman and DeRubeis1999,Reference Seligman, Schulman and DeRubeis1999) | Workshop based on cognitive therapy | Adults | 48 | Indicated | 2-3 | ≤ 8 | > 90 | Behavioural, educational and social support | HP only | High (9) | 0.16 |
Cuijpers (Reference Cuijpers1998) | Coping with chronic illness | Adults 30-60 | 24 | Indicated | > 8 | Cognitive | Low (7) | 1.81 | |||
Barth (Reference Barth1991) | Child—parent enrichment project | Adults | 0 | Selective | > 3 | > 8 | Behavioural, educational and social support | Lay only | High (9) | 0.01 | |
Foa et al (Reference Foa, Hearst-Ikeda and Perry1995) | PTSD intervention | Adults | 0 | Indicated | <2 | ≤ 8 | > 90 | Behavioural, cognitive and competence | Lay only | High (8) | 0.84 |
DiLorenzo et al (Reference DiLorenzo, Bargman and Stucky-Ropp1999) | Aerobic fitness programme | Adults 18-39 | 32 | Universal | 2-3 | > 8 | <60 | Behavioural | Both | Low (7) | 0.74 |
Haight (Reference Haight1992) | Life review | Elderly 61-99 | 22 | Universal | <2 | ≤ 8 | 60-90 | Cognitive and social support | Lay only | High (9) | 0.32 |
Hains (Reference Hains1992) | Anxiety management training | Adolescents 15-16 | 100 | Universal | > 8 | <60 | Behavioural, cognitive and competence | Both | High (8) | 0.75 | |
Hains (Reference Hains1992) | Cognitive intervention | Adolescents 15-16 | 100 | Universal | > 8 | <60 | Behavioural, cognitive and social support | Both | High (8) | 0.66 | |
Zubernis et al (Reference Zubernis, Cassidy and Gillham1999) | Depression prevention programme for children | Children | 51 | Selective | 2-3 | > 8 | 60-90 | Cognitive and competence | HP only | Low (7) | 0.61 |
Heller et al (Reference Heller, Thompson and Trueba1991) | Peer support telephone dyads: telephone contact discontinued | Elderly | 0 | Selective | 2-3 | > 8 | Behavioural | Lay only | Low (7) | -0.15 | |
Heller et al (Reference Heller, Thompson and Trueba1991) | Peer support telephone dyads: dyad initiators | Elderly | 0 | Selective | 2-3 | > 8 | Behavioural | Lay only | Low (7) | -0.23 | |
Heller et al (Reference Heller, Thompson and Trueba1991) | Peer support telephone dyads: dyad receivers | Elderly | 0 | Selective | 2-3 | > 8 | Behavioural | Lay only | Low (7) | -0.21 | |
Heller et al (Reference Heller, Thompson and Trueba1991) | Peer support telephone dyads: dyad refusers | Elderly | 0 | Selective | 2-3 | > 8 | Behavioural | Lay only | Low (7) | -0.12 | |
Heller et al (Reference Heller, Thompson and Trueba1991) | Peer support telephone dyads: telephone staff contact continues | Elderly | 0 | Selective | 2-3 | > 8 | Behavioural | Lay only | Low (7) | -0.14 | |
Barth et al (Reference Barth, Hacking and Ash1988) | Child—parent enrichment project | Adults | 0 | Selective | > 3 | > 8 | Behavioural, educational and social support | Lay only | High (8) | 0.08 | |
Skitka & Frazier (Reference Skitka and Frazier1995) | Support group for children of divorce | Children 6-12 | 42 | Selective | 2-3 | > 8 | <60 | Educational and social support | Lay only | Low (6) | -0.09 |
Astin (Reference Astin1997) | Stress reduction | Adolescents | 5 | Universal | 2-3 | ≤ 8 | > 90 | Behavioural, competence and social support | Lay only | Low (5) | -1.09 |
Gelfand et al (Reference Gelfand, Teti and Seiner1996) | Home-based intervention | Adults 19-45 | 0 | Indicated | > 3 | > 8 | Behavioural, cognitive, competence, educational and social support | HP only | Low (7) | 0.3 | |
Haight (Reference Haight1988) | Life review | Elderly 73-79 | 22 | Universal | <2 | ≤ 8 | 60-90 | Cognitive and social support | Lay only | High (8) | 0.54 |
Irvine et al (Reference Irvine, Biglan and Smolkowski1999) | Parenting skills intervention | Children | 61 | Selective | 2-3 | > 8 | > 90 | Behavioural, competence and social support | Lay only | High (9) | 0.22 |
Klein et al (Reference Klein, Wing and Simkin-Silverman1997) | Lifestyle intervention | Adults 44-50 | 0 | Universal | > 3 | > 8 | Behavioural and educational | High (8) | 0.22 | ||
Levy et al (Reference Levy, Derby and Martinkowski1993) | Bereavement support group | Adults | 28 | Selective | Social support | Low (6) | 0.06 | ||||
Ostwald et al (Reference Ostwald, Hepburn and Caron1999) | Minnesota family workshop | Adults | 44 | Selective | <2 | ≤ 8 | > 90 | Behavioural, competence, educational and social support | Both | High (9) | 0.25 |
Elderen-van-Kemenade et al (Reference Elderen-van-Kemenade, Maes and Van den Broek1994) | Health education | Adults 33-69 | 82 | Selective | > 8 | Behavioural, cognitive and educational | HP only | Low (7) | 0.22 | ||
Dracup et al (Reference Dracup, Moser and Taylor1997) | CPR training | Adults | 17 | Selective | ≤ 8 | 60-90 | Behavioural and competence | HP only | High (8) | -0.05 | |
Dracup et al (Reference Dracup, Moser and Taylor1997) | CPR training and education | Adults | 17 | Selective | ≤ 8 | > 90 | Behavioural, competence, educational and social support | HP only | High (8) | 0.06 | |
Dracup et al (Reference Dracup, Moser and Taylor1997) | CPR training and social support | Adults | 17 | Selective | Behavioural, competence and social support | HP only | High (8) | 0.10 | |||
Mignone & Guidotti (Reference Mignone and Guidotti1999) | Support group | Adults 20-69 | 38 | Selective | 2-3 | ≤ 8 | > 90 | Social support | Low (7) | -0.03 | |
Grey et al (Reference Grey, Boland and Davidson1998) | Coping skills training and diabetes management | Adolescents 13-20 | 43 | Selective | 2-3 | ≤ 8 | 60-90 | Behavioural, cognitive and competence | HP only | High (8) | 0.48 |
McCallion et al (Reference McCallion, Toseland and Freeman1999) | Family visit education plan | Elderly | 21 | Indicated | 2-3 | ≤ 8 | 60-90 | Behavioural, competence and social support | HP only | High (8) | 0.07 |
Goodkin et al (Reference Goodkin, Blaney and Feaster1999) | Bereavement intervention | Adults 18-65 | 100 | Indicated | 2-3 | > 8 | 60-90 | Competence and social support | HP only | Low (7) | 0.26 |
Worrall et al (Reference Worrall, Angel and Chaulk1999) | Educational intervention | Adults | Universal | ≤ 8 | > 90 | Behavioural, educational and social support | HP only | High (8) | 0.34 | ||
Burns et al (Reference Burns, Nichols and Martindale-Adams2000) | Geriatric evaluation and management | Elderly 66-95 | 97 | Indicated | HP only | High (8) | 0.48 | ||||
Bisson et al (Reference Bisson, Jenkins and Alexander1997) | Psychological debriefing | Adults 16-65 | 75 | Indicated | ≤ 8 | <60 | Behavioural, cognitive and educational | HP only | High (8) | -0.26 | |
Phillips (Reference Phillips2000) | Education group | Elderly 67-75 | 40 | Selective | 2-3 | > 8 | 60-90 | Competence, educational and social support | HP only | High (8) | 1.44 |
Trzcieniecka-Green & Steptoe (Reference Trzcieniecka-Green and Steptoe1996) | Stress management | Adults | 87 | Selective | 2-3 | > 8 | Behavioural, competence, educational and social support | HP only | Low (7) | 0.40 | |
Chang (Reference Chang1999) | Nurse line video-assisted modelling programme | Adults | 0 | Selective | 2-3 | ≤ 8 | Behavioural, cognitive, competence, educational and social support | HP only | High (9) | 0.05 | |
Partonen et al (Reference Partonen, Leppaemaeki and Hurme1998) | Exercise training | Adults 22-57 | 22 | Selective | 2-3 | > 8 | 60-90 | Behavioural | HP only | High (8) | 0.22 |
Partonen et al (Reference Partonen, Leppaemaeki and Hurme1998) | Exercise training and bright light | Adults 22-57 | 22 | Selective | 2-3 | > 8 | 60-90 | Behavioural | HP only | High (8) | 0.73 |
Lorig et al (Reference Lorig, Gonzalez and Ritter1999) | Arthritis self-management programme | Adults | 17 | Indicated | <2 | ≤ 8 | > 90 | Behavioural and educational | Lay only | High (9) | 0.33 |
Harris et al (Reference Harris, Bausell and Scott1998) | Peer counselling and leadership training | Adults | 0 | Selective | > 3 | > 8 | 60-90 | Behavioural, cognitive, competence and educational | HP only | High (9) | 0.31 |
Allart et al (Reference Allart, Hosman and Hoogduin2003) | ‘Coping with depression’ course | Adults 18-65 | 39 | Indicated | 2-3 | > 8 | > 90 | Behavioural, cognitive, competence and educational | HP only | High (9) | 0.54 |
Kahan et al (Reference Kahan, Kemp and Staples1985) | Education, caregiver intervention | Adults 16-77 | Selective | 2-3 | ≤ 8 | > 90 | Behavioural, competence, educational and social support | HP only | Low (6) | 0.57 | |
Lovett & Gallagher (Reference Lovett and Gallagher1988) | Life satisfaction | Adults | 17 | Selective | 2-3 | > 8 | > 90 | Behavioural and cognitive | HP only | High (9) | 0.29 |
Lovett & Gallagher (Reference Lovett and Gallagher1988) | Problem solving | Adults | 17 | Selective | 2-3 | > 8 | > 90 | Cognitive | HP only | High (9) | 0.22 |
LaFromboise & Howard-Pitney (Reference LaFromboise and Howard-Pitney1995) | Skills curriculum | Adolescents 14-19 | 36 | Selective | > 3 | > 8 | Behavioural, competence and educational | Lay only | High (8) | 0.44 | |
Gross & McCaul (Reference Gross and McCaul1992) | Psychoeducational, for children of substance misusers | Children 11-18 | 47 | Selective | > 3 | > 8 | 60-90 | Behavioural, cognitive, competence, educational and social support | HP only | Low (7) | 0.59 |
King & Kirschenbaum (Reference King and Kirschenbaum1990) | Wisconsin early intervention — full service | Children | 61 | Indicated | > 3 | > 8 | <60 | Behavioural, cognitive competence and educational | Lay only | High (8) | 0.42 |
King & Kirschenbaum (Reference King and Kirschenbaum1990) | Wisconsin early intervention — partial service | Children | 61 | Indicated | Behavioural and educational | Lay only | High (8) | 0.02 | |||
Gwynn & Brantley (Reference Gwynn and Brantley1987) | Divorce group intervention | Children 9-11 | 50 | Selective | 2-3 | ≤ 8 | Behavioural, cognitive, educational and social support | Low (4) | -0.52 | ||
Hains & Ellmann (Reference Hains and Ellmann1994) | Stress inoculation training | Adolescents | 76 | Universal | > 8 | <60 | Behavioural, cognitive, competence and social support | HP only | Low (7) | 0.28 | |
Peters & Carlson (Reference Peters and Carlson1999) | Stress management training | Adults | 60 | Selective | > 3 | > 8 | 60-90 | Behavioural, cognitive, competence and educational | Both | High (8) | 0.58 |
Cunningham et al (Reference Cunningham, Bremner and Boyle1995) | Clinic-based parent training | Children | Selective | 2-3 | > 8 | 60-90 | Behavioural, cognitive, competence and social support | HP only | High (8) | 0.06 | |
Cunningham et al (Reference Cunningham, Bremner and Boyle1995) | Community-based parent training | Children | Selective | 2-3 | > 8 | 60-90 | Behavioural, cognitive, competence and social support | HP only | High (8) | -0.02 | |
Sheeber & Johnson (Reference Sheeber and Johnson1994) | Parent training | Children 3-5 | 60 | Selective | 2-3 | > 8 | > 90 | Behavioural, competence, educational, social support | Lay only | High (8) | 0.7 |
Wolfe et al (Reference Wolfe, Edwards and Manion1988) | Parent intervention | Adults 16-25 | 0 | Selective | 60-90 | Behavioural, cognitive competence, educational and social support | Both | High (9) | 0.34 |
Participant characteristics
About a quarter of the programmes (16) targeted children, 9 targeted adolescents, almost a half (32) were aimed at adults and 12 were for older people (Table 2). There was no significant difference in effect size between the different age groups. There was also no significant difference in effect size between universal, selective and indicated programmes. Of the 63 programmes in which gender distribution was specified, weighted least squares regression analyses indicated a direct positive relationship between percentage of male participants and effect size (Table 3). There was an interaction between percentage of male participants and level of risk, so that the relationship between percentage of males in the programme and effect size was present for universal and selective programmes, but not for indicated programmes (Table 3).
Variable | Number of programmes | Weighted mean effect size | 95% CI |
---|---|---|---|
Age | |||
Children (0-14 years) | 16 | 0.21 | 0.09 to 0.32 |
Adolescents (15-18 years) | 9 | 0.19 | 0.007 to 0.38 |
Adults (19-65 years) | 32 | 0.21 | 0.15 to 0.28 |
Elderly (> 65 years) | 12 | 0.24 | 0.12 to 0.37 |
Gender | |||
Male only | 3 | 0.38 | 0.01 to 0.75 |
Female only | 13 | 0.08 | -0.01 to 0.18 |
Both | 47 | 0.26 | 0.20 to 0.32 |
Not reported | 6 | 0.24 | 0.10 to 0.38 |
Programme | |||
Universal | 10 | 0.30 | 0.18 to 0.43 |
Selective | 44 | 0.19 | 0.12 to 0.25 |
Indicated | 15 | 0.23 | 0.12 to 0.34 |
Independent variables | Model 1: percentage of male participants | Model 2: percentage of male participants, level of risk |
---|---|---|
Percentage of male participants | ||
β′ | 0.244 | |
B | 0.0030 | |
s.e. | 0.0010 | |
z-score | 2.99 | |
P | 0.002 | |
Universal programme × percentage of male participants | ||
β′ | 0.181 | |
B | 0.0064 | |
s.e. | 0.0031 | |
z-score | 2.04* | |
P | 0.045 | |
Selective programme × percentage of male participants | ||
β′ | 0.421 | |
B | 0.0059 | |
s.e. | 0.0014 | |
z-score | 4.14*** | |
P | 0.000 | |
Indicated programme × percentage of male participants | ||
β′ | -0.110 | |
B | -0.0015 | |
s.e. | 0.0019 | |
z-score | -0.806 | |
P | 0.423 | |
Adjusted R 2 | 0.044 | 0.082 |
Programme characteristics
Time descriptors
Programmes with more than eight sessions were significantly better than those with eight sessions or fewer. Programmes with session lengths of 60–90 min were significantly better than those with sessions lasting less than 60 min or longer than 90 min. No significant difference was found for duration of programmes or distribution of sessions (Table 4).
Number of programmes | Weighted mean effect size | 95% Cl | z 1 | P | |
---|---|---|---|---|---|
Programme duration (months) | |||||
<2 | 7 | 0.27 | 0.13 to 0.41 | ||
2-3 | 32 | 0.22 | 0.15 to 0.30 | ||
> 3 | 15 | 0.23 | 0.13 to 0.33 | ||
Total | 54 | ||||
Number of sessions | |||||
1-8 | 19 | 0.14 | 0.05 to 0.23 | ||
> 8 | 45 | 0.26 | 0.19 to 0.32 | 2.05* | 0.045 |
Total | 64 | ||||
Session distribution | |||||
More than once a week | 7 | 0.31 | 0.13 to 0.48 | ||
Once a week or less | 37 | 0.30 | 0.23 to 0.37 | ||
Total | 44 | ||||
Length of session (min) | |||||
<60 | 10 | 0.14 | -0.01 to 0.31 | 2.40* | 0.016 |
60-90 | 19 | 0.38 | 0.27 to 0.49 | ||
> 90 | 16 | 0.24 | 0.15 to 0.33 | 2.00* | 0.045 |
Total | 45 |
Programme providers
There were 102 programme providers reported in 62 programmes (Table 5). Programmes that used a combination of health care professionals and lay personnel had the largest effect sizes. Programmes provided by health care professionals (physical and mental health personnel) and those provided by both health care and lay personnel yielded significantly larger effect sizes than programmes provided by lay personnel alone. Programmes provided by health care professionals had larger effect sizes than programmes run by lay personnel only for selective (z=2.04, P=0.045) and indicated populations (z=2.37, P=0.016), but the difference was not significant for universal populations (z=1.90, P=0.057).
Programme providers | Number of programmes | Weighted mean effect size | 95% Cl | z | P |
---|---|---|---|---|---|
Health professionals only | 34 | 0.28 | 0.20-0.35 | 2.88** | 0.003 |
Lay personnel only | 19 | 0.10 | 0.01-0.20 | ||
Both (lay and health personnel) | 9 | 0.42 | 0.23-0.62 | 2.85** | 0.004 |
Methods and techniques
Programmes that involved a competence enhancement component yielded the largest effect sizes, whereas programmes including behavioural methods yielded the lowest effect sizes (Table 6). When analysed by age group, the worse performance of programmes that included behavioural methods was present for all age groups, although it was significant only for the older population (with a behavioural component, the weighted effect size (WES) was –0.10; without, WES=0.95; z=7.14, P<0.001). Programmes that included competence enhancement techniques did significantly better than those that did not include them. Programmes that included social support did generally worse than those that did not, except for the older group, for which social support programmes yielded larger weighted effect sizes (with social support, WES=0.92; without, WES=–0.12; z=7.13, P<0.001). Programmes that included three or more different types of methods were significantly better than those that included only one or two.
Programme content | Number of programmes1 | Weighted mean effect size | 95% CI | z | P |
---|---|---|---|---|---|
Behavioural techniques | 55 | 0.17 | 0.12-0.23 | 3.46*** | 0.000 |
No behavioural techniques | 13 | 0.42 | 0.29-0.55 | ||
Cognitive techniques | 30 | 0.26 | 0.17-0.34 | 1.28 | 0.193 |
No cognitive techniques | 38 | 0.19 | 0.13-0.25 | ||
Competence techniques | 34 | 0.29 | 0.22-0.36 | 3.15** | 0.001 |
No competence techniques | 34 | 0.13 | 0.06-0.20 | ||
Education methods | 32 | 0.27 | 0.20-0.34 | 2.47* | 0.012 |
No education methods | 36 | 0.14 | 0.07-0.22 | ||
Social support methods | 38 | 0.23 | 0.16-0.29 | 0.57 | 0.548 |
No social support methods | 30 | 0.19 | 0.12-0.27 | ||
One or two methods | 25 | 0.14 | 0.06-0.22 | 2.17* | 0.027 |
Three or more methods | 43 | 0.26 | 0.19-0.32 |
Research methodological characteristics
The programmes with a high quality of research design were significantly more effective than those of low quality (Table 7). Programmes that reported attrition rates were significantly better than those that did not. Programmes rated as having a well-defined intervention were better than those that did not.
n 1 | Weighted mean effect size | 95% CI | z 2 | P 3 | |
---|---|---|---|---|---|
Quality of research design | |||||
High-quality design (65%) | 45 | 0.26 | 0.20 to 0.31 | 2.60** | 0.009 |
Low-quality design (35%) | 24 | 0.11 | 0.01 to 0.20 | ||
Items in Cochrane scale | |||||
Defined aims | |||||
Reported | 66 | 0.22 | 0.17 to 0.27 | ||
Not reported | 3 | -0.08 | -0.38 to 0.21 | ||
Well-defined intervention | |||||
Reported | 54 | 0.24 | 0.19 to 0.30 | 2.17* | 0.027 |
Not reported | 15 | 0.10 | -0.007 to 0.21 | ||
Randomly allocated control | |||||
Reported | 52 | 0.21 | 0.15 to 0.26 | 0.40 | 0.689 |
Not reported | 17 | 0.24 | 0.12 to 0.35 | ||
Equivalent comparison group | |||||
Reported | 51 | 0.21 | 0.16 to 0.26 | 0.25 | 0.764 |
Not reported | 18 | 0.23 | 0.11 to 0.34 | ||
Number of subjects in trial | |||||
Reported | 69 | 0.21 | 0.16 to 0.26 | ||
Pre-intervention data | |||||
Reported | 65 | 0.19 | 0.14 to 0.24 | ||
Not reported | 4 | 0.45 | 0.28 to 0.62 | ||
Post-intervention data | |||||
Reported | 69 | 0.21 | 0.16 to 0.26 | ||
Attrition rates | |||||
Reported | 41 | 0.29 | 0.23 to 0.36 | 3.65*** | 0.000 |
Not reported | 28 | 0.11 | 0.03 to 0.18 | ||
Findings for all outcomes | |||||
Reported | 69 | 0.21 | 0.16 to 0.26 |
Changes in risk factors and symptoms
The outcome measures of each programme were subsequently divided into an averaged measure per programme indicating changes in risk factors (n=49), changes in depressive symptoms (n=69), and changes in psychiatric symptoms other than depression, such as anxiety (n=51). Measures for each group were averaged across programmes to obtain three mean effect sizes, one per group. There was no significant difference in effect size between depressive symptoms (WES=0.24, 95% CI 0.13–0.35), risk factors (WES=0.28, 95% CI 0.15–0.41) 0.15–0.41) and other psychiatric symptoms (WES=0.18, 95% CI 0.09–0.27), all three of which had significant and independent positive outcomes. Weighted mean effect sizes were further subdivided within these three groups into universal, selective and indicated approaches (Fig. 1). Comparisons of means indicated no significant difference between the type of preventive approach and changes in depressive symptoms, risk factors and changes in other psychiatric symptoms.
DISCUSSION
Limitations of the study: sampling bias
Caution is needed in interpreting metaanalytical findings because of the potential upward bias of the mean effect size (Reference Williams and GarnerWilliams & Garner, 2002), which can be examined on a funnel plot (Reference Begg, Cooper and HedgesBegg, 1994). The graph indicated no evidence of publication bias (Fig. 2). The fail-safe N is an estimate of the number of unpublished studies reporting null results needed to reduce the cumulated effect to the point of non-significance (Reference WolfWolf, 1986). The calculation of the fail-safe N to a criterion level of 0.1 resulted in a figure of 104 studies that would need to be included to reduce the effects to this criterion level. As the availability of such a large number of studies with null effects is unlikely, we assume that the studies included in the analysis are reasonably representative of the mean effect size.
Effective prevention and variation in outcome
Consistent with earlier meta-analyses for mental health promotion (Reference Durlak and WellsDurlak & Wells, 1997; Reference Tobler and StrattonTobler & Stratton, 1997; Reference Brown, Berndt and BrinalesBrown et al, 2000), our meta-analysis found a weighted mean effect size of 0.22. This is equivalent to an 11% improvement in the intervention groups compared with the control groups. Effect sizes for prevention programmes tend to be smaller than those of treatment, largely because prevention applies the same strategies to a population group that might or might not be at risk for a later mental health problem. However, from a public health perspective the prevention strategy can be cost-effective, as a small effect size in a large number of people can lead to a greater population gain than a large effect size in a small number of people (Reference RoseRose, 1993).
What leads to increased effects in depression prevention?
There was a large variation in programme outcomes. Subsequent analyses aimed to identify what might predict this variation.
Gender differences
There was a relationship between the percentage of male participants in universal and selective programmes and effect size, but not in indicated programmes. This finding is consistent with some within-trial findings (Reference Gillham, Reivich and JaycoxGillham et al, 1995) but not with others (Reference Seligman, Schulman and DeRubeisSeligman et al, 1999,Reference Seligman, Schulman and DeRubeisSeligman et al, 1999). It is possible that indicated programmes, which target specific problems with focused techniques, are more tailored to depressive symptoms and disorder than universal and selective programmes, and also take into account gender differences in the development of the programme. The results should be interpreted with caution, because the relationship is between the proportion of male participants in the programme and the effect size, not the actual effect size for each gender subgroup, which unfortunately is rarely reported. The results stress the importance of analysing and reporting gender differences in single trial evaluations to understand gender-specific programme effectiveness.
Initial level of risk
No difference was found between universal, selective and indicated programmes. There has been a marked preference for targeted interventions for depression prevention, because of evidence in reducing symptoms and incidence (Reference Clarke, Hawkins and MurphyClarke et al, 1995,Reference Clarke, Hawkins and MurphyClarke et al, 1995) and because subgroups identified at increased risk have seemed to benefit the most (Reference Price, Van Ryn and VinokurPrice et al, 1992,Reference Price, Van Ryn and VinokurPrice et al, 1992; Reference Gillham, Reivich and JaycoxGillham et al, 1995). However, evidence has also accumulated that universal preventive interventions can be beneficial for those at risk, because of lowered stigma and better socialisation (Reference Kellam, Ling and MeriscaKellam et al, 1998; Reference Reid, Eddy and FetrowReid et al, 1999). The results of our analysis have supported both these directions and there seems merit in interventions that combine both universal and targeted prevention (Conduct Problems Prevention Research Group, 2000).
Number and length of sessions
Research has focused on testing the efficacy of shortened versions of existing prevention programmes (Reference Munoz, Ying and Perez-StableMuñoz et al, 1993). The results of our analysis indicated that programmes with more than eight sessions and programmes with session lengths of 60–90 min yielded the larger effect sizes. The number of sessions is relevant for participants’ ability to internalise methods and processes offered by the interventions; fewer than nine sessions might not be enough. The length of sessions is important because of the group focus of prevention, where sufficient time needs to be allocated for interaction and group processes; less than an hour might not allow participants to feel engaged in a group process.
The promise of competence enhancement techniques
In addition to cognitive techniques (Reference Price and Bennett JohnsonPrice & Bennett Johnson, 1999; Reference Seligman, Schulman and DeRubeisSeligman et al, 1999,Reference Seligman, Schulman and DeRubeisSeligman et al, 1999; Reference Gillham, Shatte and FreresGillham et al, 2000; Reference Clarke, Hornbrook and LynchClarke et al, 2001), competence methods were also found to be effective across different age groups. Programmes that included behavioural techniques were detrimental for the elderly and were not superior for the other age groups. Programmes that combined three or more intervention methods were more effective than those that did not, suggesting the importance of multi-component programmes.
Provider qualifications
Lay personnel have been proposed as potential efficient programme providers for preventing depression (Reference Munoz, Ying and Perez-StableMuñoz et al, 1993). However, our meta-analysis found that lay personnel alone were not the best providers for selective and indicated programmes. The specificity and severity of depression in targeted populations who are already experiencing risk factors or symptoms may require trained personnel who are aware of and skilled in dealing with depressive symptoms.
Quality of the research design
Consistent with earlier findings (Reference Tobler and StrattonTobler & Stratton, 1997; Reference Brown, Berndt and BrinalesBrown et al, 2000) high-quality research trials were predictive of better outcomes. Well-defined intervention aims and accounting for attrition rates were independent predictors of effect size. Well-defined aims have already been identified as effect predictors in health promotion (Reference Kok, van den Borne and MullenKok et al, 1997). Reporting attrition rates might indicate a deeper analysis of intervention effects, and studies that do so might be more likely to have accounted for patient withdrawal at the outset, and to have provided the target group with incentives to continue in the programme.
Changes in depressive outcomes, risk factors and other psychiatric symptoms
Simultaneous positive changes in risk and protective factors and in related psychiatric symptoms (e.g. anxiety) were found in addition to the reductions in depressive symptoms, indicating the multiple outcome potential of prevention programmes. However, despite the evidence that prevention programmes can reduce depressive symptoms for both universal and targeted populations, few have demonstrated that the incidence of depression can be reduced (Clarke et al, Reference Clarke, Hawkins and Murphy1995,Reference Clarke, Hawkins and Murphy1995, Reference Clarke, Hornbrook and Lynch2001). There is an urgent need for further trials of sufficient power to study the impact of preventing the onset of depression and the role of moderating and mediating variables.
Clinical Implications and Limitations
CLINICAL IMPLICATIONS
-
▪ Prevention programmes to reduce depressive symptoms can lead to an 11% improvement in the intervention groups compared with control groups. However, the large variation in outcome stresses the importance of implementing only practices for which there is evidence of effect.
-
▪ Health and mental health care providers should be informed and provided with training in interventions to reduce and prevent depressive symptoms for targeted populations.
-
▪ Programmes that do not primarily target depression can lead to reductions in depressive symptoms, although unfortunately this is not often measured. When making choices for implementation, programmes targeting common risk and protective factors in addition to those focusing on depressive symptoms could lead to larger gains in other associated symptoms and disorders.
LIMITATIONS
-
▪ The inclusion criteria set for this study and insufficient information reported in single trial evaluations might have excluded other programmes that have targeted depressive symptoms.
-
▪ Although there was no evidence of publication bias, as with all meta-analyses, caution should be taken into account when interpreting the results because of non-included studies and non-reported findings.
-
▪ The gender result should be treated with caution because the finding is related to the proportion of male participants and the programme effect size, not the actual effect sizes of the two gender subgroups, which unfortunately are rarely reported.
Acknowledgements
This research was supported by the Dutch Health Research and Development. Council (ZON), grant number 2200.0020.
We express our deep appreciation to Dr Hendricks Brown for his statistical. and methodological support and feedback during the different research phases. of the project. We also thank Sietske van Haren, the second coder, for her. dedicated input during the coding process and Rianne Kassander for her support. during the revision of the paper.
eLetters
No eLetters have been published for this article.