Introduction
Obsessive-compulsive disorder (OCD) is a chronic and debilitating mental disorder (Abramowitz & Jacoby, Reference Abramowitz and Jacoby2014), with a lifetime prevalence of 1–3% worldwide (Fawcett, Power, & Fawcett, Reference Fawcett, Power and Fawcett2020; Kessler, Petukhova, Sampson, Zaslavsky, & Wittchen, Reference Kessler, Petukhova, Sampson, Zaslavsky and Wittchen2012; Ruscio, Stein, Chiu, & Kessler, Reference Ruscio, Stein, Chiu and Kessler2010; Subramaniam, Abdin, Vaingankar, & Chong, Reference Subramaniam, Abdin, Vaingankar and Chong2012), adversely affecting both individual patients and their families (Steketee, Reference Steketee1997; Stengler-Wenzke, Kroll, Matschinger, & Angermeyer, Reference Stengler-Wenzke, Kroll, Matschinger and Angermeyer2006).
Exposure and response prevention (ERP) is widely considered the gold standard psychological treatment for individuals with OCD and is often the core component of cognitive-behavioral therapy (CBT; Baruah et al., Reference Baruah, Pandian, Narayanaswamy, Bada Math, Kandavel and Reddy2018; Fals-Stewart & Schafer, Reference Fals-Stewart and Schafer1992; Mancebo, Yip, Boisseau, Rasmussen, & Zlotnick, Reference Mancebo, Yip, Boisseau, Rasmussen and Zlotnick2021). Additionally, other psychological treatments have also shown effectiveness in reducing OCD symptom severity, including cognitive therapy (CT; Alcolado & Radomsky, Reference Alcolado and Radomsky2016; van Balkom et al., Reference van Balkom, de Haan, van Oppen, Spinhoven, Hoogduin and van Dyck1998) and third-wave CBT, such as mindfulness-based cognitive therapy (MBCT; Zhang et al., Reference Zhang, Lu, Didonna, Wang, Zhang and Fan2021) and acceptance and commitment therapy (ACT; Schneider, Wittekind, Talhof, Korrelboom, & Moritz, Reference Schneider, Wittekind, Talhof, Korrelboom and Moritz2015; Twohig et al., Reference Twohig, Hayes, Plumb, Pruitt, Collins, Hazlett-Stevens and Woidneck2010).
Previous meta-analyses have examined the effectiveness of specific therapeutic approaches, such as CBT, ERP, and MBCT (Chien, Tse, Chan, Cheng, & Chen, Reference Chien, Tse, Chan, Cheng and Chen2022; Ferrando & Selai, Reference Ferrando and Selai2021; Reid et al., Reference Reid, Laws, Drummond, Vismara, Grancini, Mpavaenda and Fineberg2021; Spencer et al., Reference Spencer, Stiede, Wiese, Goodman, Guzick and Storch2023). An early meta-analysis of 19 studies (Rosa-Alcázar, Sánchez-Meca, Gómez-Conesa, & Marín-Martínez, Reference Rosa-Alcázar, Sánchez-Meca, Gómez-Conesa and Marín-Martínez2008) found similar effects of ERP, CT, and ERP + CT, compared to control groups. However, this meta-analysis synthesized evidence published from 1980 to 2006. The latest relevant meta-analysis (Öst, Havnen, Hansen, & Kvale, Reference Öst, Havnen, Hansen and Kvale2015) included 37 randomized controlled trials (RCTs) conducted between 1993 and 2014, and investigated the effectiveness of CBT (including CT, ERP, or CT + ERP) on OCD symptom severity. Yet, this analysis compared active treatments with each other and with control groups, making interpretation particularly complex for head-to-head comparisons due to the small number of analyzed studies. Moreover, it solely relied on the clinician-rated Yale-Brown Obsessive-Compulsive Scale (Y-BOCS), overlooking other common measures of OCD symptom severity, such as self-rated Y-BOCS and Obsessive-Compulsive Inventory-Revised (OCI/OCI-R), and only included evidence on a limited range of treatment delivery formats, including individual, group, and family involved. Additionally, the common practice of including studies published in English in meta-analyses might result in missing potentially important studies published in non-English languages, which could significantly contribute to the existing knowledge base.
To address the previous limitations, we aimed to conduct a new meta-analysis. First, we conducted extensive searches to include up-to-date RCTs comparing psychological treatments to control groups; second, we considered all available psychological treatments and outcomes measuring OCD symptom severity; lastly, in addition to widely used international databases, we also included Chinese bibliographic databases, recognizing China being the second most populous country in Asia and the world, with a substantial number of individuals suffering from OCD (“中国强迫症防治指,南2016(精编版),” 2016). While many Chinese researchers have explored OCD (符泽娟 & 谢海玲, Reference Fu and Xie2016; 耿艳萌, 许志鹏, & 吴长海, Reference Geng, Xu and Wu2009; 胡思思, Reference Hu2016), their work might not be accessible in international databases because of language barriers. By encompassing all relevant RCTs, we aimed to gain a comprehensive understanding of the overall effectiveness of psychological treatments for OCD relative to control groups. Drawing on existing knowledge (Öst et al., Reference Öst, Havnen, Hansen and Kvale2015; Rosa-Alcázar et al., Reference Rosa-Alcázar, Sánchez-Meca, Gómez-Conesa and Marín-Martínez2008), we hypothesized that psychological treatments would significantly reduce OCD symptom severity compared to control groups.
Methods
Identification of studies
The protocol for the current meta-analysis has been preregistered on the Open Science Framework and is accessible at https://archive.org/details/osf-registrations-n3rxf-v1.
We conducted comprehensive searches in both international and Chinese databases from inception to 1 January 2023. International databases include PubMed, Embase, PsycINFO, and the International clinical trials registry platform of WHO (ICTRP). Chinese databases include CNKI, WanFang, WeiPu, and China Clinical Trial Registry (CCTR). The full search strings can be found in Appendix A. Moreover, we searched the reference lists of earlier meta-analyses on psychological treatments for OCD (Gava et al., Reference Gava, Barbui, Aguglia, Carlino, Churchill, De Vanna and McGuire2007; Öst et al., Reference Öst, Havnen, Hansen and Kvale2015; Rosa-Alcázar et al., Reference Rosa-Alcázar, Sánchez-Meca, Gómez-Conesa and Marín-Martínez2008).
All records were screened by two researchers independently. Full-text retrieval was performed for studies meeting potential inclusion criteria by either researcher. Decisions regarding study inclusion or exclusion were made jointly, with any disagreements resolved through discussion.
Selection of studies
Inclusion criteria:
• RCTs comparing psychological treatments to control groups
• Control groups include waitlist, care-as-usual (CAU), pill placebo, and psychological placebo
• Participants diagnosed with primary OCD, determined through valid semi-structured interviews, including different versions of the Diagnostic and Statistical Manual of Mental Disorders (DSM) and the International Classification of Diseases and Related Health Problems (ICD)
• Any outcome measuring OCD symptom severity
• Be published, or in press
We did not exclude participants diagnosed with OCD and comorbid psychiatric conditions to increase the generalizability of the effects. No restrictions on setting, gender or age were applied.
Exclusion criteria:
• Stepped care management, maintenance, and augmentation trials;
• Unguided self-help psychological treatments (Cuijpers & Schuurmans, Reference Cuijpers and Schuurmans2007) where individuals work independently on the program/treatment protocol to learn and apply psychotherapeutic strategies.
Coding domains
Included RCTs were coded based on three categories:
• Study design: (a) type of psychological treatment (the definition of psychological treatment can be found in Appendix B); (b) type of control group.
• Effect size calculation: (a) outcome measure; (b) type of measurement (clinician-rated, self-rated); (c) assessment point (at baseline, post-treatment, follow-up); (d) treatment length; e) the analysed sample (completers or intention-to-treat); (f) mean, standard deviation, and number of participants in both groups.
• Moderators: (a) publication year; (b) region of study origin; (c) mean age; (d) age group (child: <13, adolescence: 13–18, adult: >18); (e) proportion of women; (f) recruitment methods of participants (clinical, community, other); (g) co-occurring disorder among participants (yes/no, defined as participants exhibiting at least one shared type of mental disorder); (h) proportion of participants using psychiatric medicine in both groups; (i) number of treatment sessions; (j) treatment delivery format, including individual, group, family-involved (involving the patient's family members in treatment sessions), guided self-help, and time-intensive (defined by a maximum total duration of 4 weeks, necessitating a minimum of 10 therapist hours overall, with an average weekly therapist commitment of at least 5 h; Jónsson, Kristensen, & Arendt, Reference Jónsson, Kristensen and Arendt2015).
Risk of bias
The risk of bias was assessed by the revised Cochrane risk-of-bias tool for randomized trials (ROB 2; Sterne et al., Reference Sterne, Savović, Page, Elbers, Blencowe, Boutron and Eldridge2019), which comprises five domains: (1) bias arising from the randomization process; (2) bias due to deviations from intended interventions; (3) bias due to missing outcome data; (4) bias in the measurement of the outcome; and (5) bias in the selection of the reported result.
Coding and assessment of risk of bias were conducted by two independent researchers, and any disagreements were resolved through discussion.
Outcome measures
Our primary outcome was OCD symptom severity. Any outcomes measuring OCD symptom severity were included, such as the clinician/self-rated Y-BOCS (Goodman et al., Reference Goodman, Price, Rasmussen, Mazure, Fleischmann, Hill and Charney1989; Schaible, Armbrust, & Nutzinger, Reference Schaible, Armbrust and Nutzinger2001), the Children's Y-BOCS (Scahill et al., Reference Scahill, Riddle, McSwiggin-Hardin, Ort, King, Goodman and Leckman1997), OCI/OCI-R (Foa et al., Reference Foa, Huppert, Leiberg, Langner, Kichic, Hajcak and Salkovskis2002; Foa, Kozak, Salkovskis, Coles, & Amir, Reference Foa, Kozak, Salkovskis, Coles and Amir1998), the Dimensional Obsessive-Compulsive Scale-Short Form (DOCS-SF; Abramowitz et al., Reference Abramowitz, Deacon, Olatunji, Wheaton, Berman, Losardo and Adams2010), the Vancouver Obsessional Compulsive Inventory (VOCI; Thordarson et al., Reference Thordarson, Radomsky, Rachman, Shafran, Sawchuk and Hakstian2004), and the National Institute of Mental Health Obsessive-Compulsive Scale/National Institute of Mental Health Global Obsessive Compulsive Scale (NIMH-OC/GOCS; Baer & Minichiello, Reference Baer and Minichiello1990) etc.
Meta-analysis
We conducted meta-analysis using packages ‘metapsytools’, ‘meta’ (Schwarzer, Reference Schwarzer2007), ‘metafor’ (Viechtbauer, Reference Viechtbauer2010), and ‘dmetar’ (Harrer, Cuijpers, Furukawa, & Ebert, Reference Harrer, Cuijpers, Furukawa and Ebert2021) in R version of 4.2.1 through R studio.
We calculated Hedges' g (Hedges & Olkin, Reference Hedges and Olkin2014) for each study, considering small sample sizes. Mean, standard deviation, and the number of participants in treatment and control conditions at post-treatment and follow-up were used for effect size calculations. Effect sizes were calculated by subtracting the average score of the control group from the average score of the psychological treatment group, and dividing the result by the pooled standard deviation. If means and standard deviations were not reported, we converted dichotomous outcomes into effect sizes using the methods described by Borenstein, Hedges, Higgins, & Rothstein (Reference Borenstein, Hedges, Higgins and Rothstein2021). If dichotomous outcomes were not available either, we extracted change scores from baseline in both groups. If none of them were available, other statistics (such as t value or p value) were used to calculate the effect size. An effect size of 0.2, 0.5, and 0.8 was interpreted as small, moderate, and large, respectively (Cohen, Reference Cohen2013).
In our primary analysis for calculating the pooled effect size, we first aggregated all available effect size data for a comparison between psychological treatment and control group within a specific study. Then, we pooled these aggregated effects across comparisons before pooling across studies (Borenstein, Hedges, Higgins, & Rothstein, Reference Borenstein, Hedges, Higgins and Rothstein2009).
To account for the expected heterogeneity among the studies, we used random-effects models in all analyses. Heterogeneity was assessed using the I 2 statistic (Ioannidis, Patsopoulos, & Evangelou, Reference Ioannidis, Patsopoulos and Evangelou2007) and its 95% confidence interval (CI), with values quantified as low (25%), moderate (50%), and high (75%) (Higgins, Thompson, Deeks, & Altman, Reference Higgins, Thompson, Deeks and Altman2003). Prediction intervals (PI) were included to represent the 95% CI of the predictive distribution of effects in future comparable trials.
Next to examining the funnel plot, we corrected our primary analyses by the presence of publication bias employing three methods: Duval and Tweedie's trim and fill procedure (Duval & Tweedie, Reference Duval and Tweedie2000), Rücker's Limit Meta-Analysis (Rücker, Schwarzer, Carpenter, Binder, & Schumacher, Reference Rücker, Schwarzer, Carpenter, Binder and Schumacher2011), and selection models (McShane, Böckenholt, & Hansen, Reference McShane, Böckenholt and Hansen2016). Additionally, we conducted Egger's test of the intercept to quantify the bias captured by the funnel plot and to test whether it was significant (Egger, Smith, Schneider, & Minder, Reference Egger, Smith, Schneider and Minder1997).
Subgroup analyses were conducted for categorical variables to explore potential moderators of effect sizes, using a mixed-effects model. Within subgroups, effect sizes were pooled using the random-effects model, and differences between subgroups were tested using the fixed-effects model. Subgroup analyses required at least three studies per subgroup. Additionally, bivariate meta-regression analyses were performed for continuous variables (i.e. the proportion of female, and the proportion of participants using psychiatric medicine in the treatment group), using clinician-rated Y-BOCS as the main outcome, which is considered the gold standard for assessing OCD symptom severity (Goodman et al., Reference Goodman, Price, Rasmussen, Mazure, Fleischmann, Hill and Charney1989), to investigate potential predictors of effect sizes,
Sensitivity analyses were performed by: (a) excluding outliers (studies whose 95% CIs of effect sizes did not overlap with the 95% CI of the pooled effect size); (b) limiting analysis to only studies with low risk of bias (all domains were rated as ‘low risk’ by RoB 2); (c) focusing on each specific instrument of OCD symptom severity reported across studies: (d) estimating the pooled effect size using a three-level correlated hierarchical effects (CHE) model (Cheung, Reference Cheung2014) with an assumed intra-study correlation of ρ = 0.5, and (e) calculating the effect considering the smallest or largest effect in each study.
Additionally, we calculated the relative risk (RR) of study dropout (any cause discontinuation) in the treatment groups compared with the control groups at post-treatment and pooled them using the Mantel–Haenszel method (Robins, Greenland, & Breslow, Reference Robins, Greenland and Breslow1986).
Results
Search, selection, and inclusion of studies
Our search strategy identified a total of 11 235 records (3057 international, 8178 Chinese). After removing duplicates, 7752 records were screened based on titles and abstracts and 576 studies underwent full-text screening. Ultimately, 48 RCTs were included (47 international, 1 Chinese). The PRISMA flowchart describing the study search, selection, and inclusion process is presented in Fig. 1.
Characteristics of included studies
Key characteristics of the included studies are summarized in Table 1. The references of included studies are provided in online supplementary material. A total of 48 RCTs, comprising 55 comparisons between psychological treatments and control groups, involving 2731 patients with OCD, were included in the analysis. Among these comparisons, 38 focused on adults, 11 on children, and 6 on adolescents. Participants' mean age ranged from 6 to 43 years, with an average of 52% female participants. Three studies focused on OCD with co-occurring disorder, including substance abuse (Fals-Stewart & Schafer, Reference Fals-Stewart and Schafer1992), autism spectrum disorder (Russell et al., Reference Russell, Jassi, Fullana, Mack, Johnston, Heyman and Mataix-Cols2013), and autistic symptoms (Wolters, de Haan, Hogendoorn, Boer, & Prins, Reference Wolters, de Haan, Hogendoorn, Boer and Prins2016). Geographically, the majority of comparisons (20) were conducted in North America, followed by Europe (10), Australia (9), the United Kingdom (6), Iran (3), Brazil (2), China (2), Japan (2), and India (1).
Psy, type of psychological treatment; CBT, cognitive-behavioral therapy; CT, cognitive therapy; ERP, exposure and response prevention; 3rd, third-wave cognitive-behavioral therapy; other, other psychological treatment; Ctr, control group; WL, waitlist; CAU, care-as-usual; Psy p, psychological placebo; pill p, pill placebo; Fmt, treatment format; ind, individual; grp, group; FI, family-involved; gsh, guided self-help; TI, time-intensive; other, other/mixed format/not clear; Nsess, number of treatment sessions at post-treatment; Pos, assessment point at post-treatment; FU, assessment point at follow-up; Outcome, outcome measures; Y-BOCS, the Yale-Brown Obsessive-Compulsive Scale; CY-BOCS, the Children's Yale-Brown Obsessive Compulsive Scale; OCI (-R), the Obsessive Compulsive Inventory (-Revised); NIMH-OC, the National Institute of Mental Health Obsessive-Compulsive Scale; NIMH-GOCS, the National Institute of Mental Health Global Obsessive Compulsive Scale; PADUA, the Padua Inventory; DOCS (-SF), the Dimensional Obsessive-Compulsive Scale (- Short form); VOCI, the Vancouver Obsessional Compulsive Inventory; D-YBOCS, the Dimensional-Yale Brown Obsessive-Compulsive Scale; MOCI, the Maudsley Obsessional-Compulsive Inventor; CHOCI-R-C/P, the Children's Obsessional Compulsive Inventory–Revised – children reported/patients reported); N_Psy, number of participants in treatment condition at post-treatment; N_Ctr, number of participants in control condition at post-treatment; D_Psy, number of dropouts in treatment condition at post-treatment; D_Ctr, number of dropouts in control condition at post-treatment. M age, mean age; Age g, age group (child; adol, adolescent; adult); % Women, the proportion of women; Region, region of the trial (North A, North America; Aus, Australia; EU, Europe; UK, United Kingdom; Brazil; Iran; Japan; China; India); Recur, methods of recruiting participants (com, community; clin, clinical setting; other); Comd, co-occurring mental disorder (yes/no); % Med_P, proportion of participants using psychiatric medicine in treatment condition; % Med_C, proportion of participants using psychiatric medicine in control condition; SG, sequence generation (positive or negative [negative includes unclear]); AC, allocation concealment (positive or negative [negative includes unclear]); BA, blinded assessment (positive or negative [negative includes unclear]; sr, self-report); IOD, incomplete outcome data (positive or negative [negative includes unclear]); SOR, selective outcome reporting (positive or negative [negative includes unclear]).
Treatment types: 28 treatments used CBT, 17 used ERP, 4 used CT, 3 used third-wave CBT, and 3 used other psychological treatments (satiation therapy, attachment-based intervention, and mixed intervention consisting of detached mindfulness plus cognitive restructuring). The majority of studies (88%) utilized treatment manuals, 75% conducted fidelity checks, and 85% delivered treatments through professionals (see Appendix C for the detailed information).
Control groups: 29 comparisons used waitlists, 16 used psychological placebos, 7 used CAUs, and 3 used pill placebos.
Treatment formats: 26 treatments were delivered as individual face-to-face, 11 used family-involved, 6 used guided self-help, 4 used group, 4 used time-intensive, and 4 used other treatment delivery formats, including mixed format of individual plus group (Mancebo et al., Reference Mancebo, Yip, Boisseau, Rasmussen and Zlotnick2021), videoconference plus telephone (Vogel et al., Reference Vogel, Solem, Hagen, Moen, Launes, Håland and Himle2014), and two unclear formats (Rezvan et al., Reference Rezvan, Bahrami, Abedi, Macleod, Doost and Ghasemi2013; 符泽娟 & 谢海玲, Reference Fu and Xie2016).
Treatment sessions ranged from 2 to 26, with post-treatment assessments conducted between 3 and 26 weeks and follow-up assessments occurring at 12–58 weeks (approximately 3–13 months) after randomization. Outcome measurements varied across trials, with clinician-rated Y-BOCS (33) and CY-BOCS (16) being the most commonly used for OCD studies on adults and children/adolescent, respectively. Other measures included OCI/OCI-R (11), self-rated Y-BOCS (4), and NIMHOC/NIMHGOCS (7) etc.
Regarding risk of bias, only 53% reported adequate sequence generation, 40% had adequate concealed allocation, 76% employed blinded outcome assessors or self-report outcomes, and 33% used appropriate methods to handle missing data. Moreover, 84% of the RCTs were at risk of selective reporting, either not registered or registered retrospectively. In total, 87% of RCTs were rated as having a high risk of bias. The overall risk of bias assessment is presented in Fig. 2.
Effects of psychological treatments on OCD symptom severity
The combined effect size was g = −1.14 (95% CI [−1.31 to −0.97)) at post-treatment with high between-study heterogeneity (I 2 = 72.23%; 95% CI [63.7–78.75); see Table 2). The prediction interval ranged from −2.20 to −0.08. However, the effects lost significance at 3–6 month (g = −1.45; 95% CI [−3.06 to 0.16]; n = 6) and 6 −12 month follow-ups (g = −1.32; 95% CI [−3.38 to 0.73]; n = 5). The forest plot is presented in Fig. 3.
K, number of comparisons; CI, confidence interval; PI, predication interval; ERP: exposure and response; CBT: cognitive-behavioral therapy; third-wave: third-wave cognitive-behavioral therapy; Y-BOCS, the Yale-Brown Obsessive-Compulsive Scale; CY-BOCS, the Children's Yale-Brown Obsessive Compulsive Scale; OCI (-R): the Obsessive Compulsive Inventory (-Revised); NIMH-OC, the National Institute of Mental Health Obsessive-Compulsive Scale; NIMH-GOCS, the National Institute of Mental Health Global Obsessive Compulsive Scale; PADUA, the Padua Inventory; DOCS (-SF), the Dimensional Obsessive-Compulsive Scale (-Short form); VOCI, the Vancouver Obsessional Compulsive Inventory; D-YBOCS, the Dimensional- Yale Brown Obsessive-Compulsive Scale; MOCI, the Maudsley Obsessional-Compulsive Inventor CHOCI-R-C/P, the Children's Obsessional Compulsive Inventory–Revised – children reported/patients reported.
a According to a random-effects model.
b Barrett (2003); Barrett (2004) cbft(ind); Barrett (2004) cbft(grp); Freeman (2008); Khodarahimi (2009) erp; Khodarahimi (2009) other psy; Kyrios (2018); Russell (2013); van-Balkom (1998)-CT; Whittal (2010); Zhang (2021).
The p values indicate whether the difference between the effect sizes in the subgroups is significant.
There were some indications for publication bias, as suggested by the funnel plot and the Egger's test of the intercept (intercept = −2.57; 95% CI [−0.37 to 0.24]; p = 0.002; see Appendix D). After adjustment for publication bias using Duval and Tweedie's trim-and-fill procedure, which identified 13 missing studies, the effect size remained large and significant (g = −0.88; 95% CI [to 1.09 to −0.67]). The selection model yielded similar results as the trim-and-fill procedure, and the limit meta-analysis revealed smaller but still significant moderate effect size (g = −0.59; 95% CI [−0.93 to −0.24]; see Table 2).
Sensitivity analyses
After removing 11 outliers, the effect size remained comparable to the overall effect size (g = −1.08; 95% CI [−1.2 to −0.97]), and between-study heterogeneity decreased considerably (I 2 = 33.35%; 95% CI [3.35–54.04]). Furthermore, sensitivity analyses for specific instruments showed consistent and robust findings, such as clinician-rated Y-BOCS (g = −1.11; 95% CI [−1.33 to −0.88]; n = 33), CY-BOCS (g = −1.26; 95% CI [−1.65 to −0.87]; n = 16), OCI/OCI-R (g = −0.99; 95% CI [−1.31 to −0.67]; n = 11), and NIMHOC/NIMHGOCS (g = −2.07; 95% CI [−2.79 to −1.34]; n = 7). However, the sensitivity analyses for self-rated Y-BOCS (g = −1.88; 95% CI [−3.8 to 0.04]; n = 4) and DOCS/DOCS-SF (g = −0.98, 95% CI [−2.65 to 0.68]; n = 3) did not show significant findings. Moreover, when limiting the analysis to only studies with low risk of bias (n = 3), the finding lost significance (g = −0.87; 95% CI [−2.25 to 0.5]; see Table 2).
Subgroup analyses
No significant differences in effect sizes were found based on age group, recruitment methods of participants, presence of comorbidity, handling of missing data, or the region of study origin. However, significant differences were observed for treatment type, control group type, and treatment delivery format. Specifically, ERP resulted in the largest effect size (g = −1.25, 95% CI [−1.56 to −0.94]), followed by CBT, which had a slightly smaller, but still comparable effect (g = −1.13, 95% CI [−1.37 to −0.9]) to ERP. The effects of CT and third-wave CBT were not significant. Psychological placebo resulted in the smallest effect (g = −0.82, 95% CI [−1.06 to −0.59]), while pill placebo resulted in the largest effect (g = −1.32, 95% CI [−2.22 to −0.42]) and waitlist (g = −1.29) and CAU (g = −1.27) had comparable effects to pill placebo. All treatment formats had significant large effects in treating OCD, with time-intensive (g = −1.41, 95% CI [−1.98 to −0.84]) and family-involved (g = −1.39, 95% CI [−1.99 to −0.8]) treatments exhibited comparable larger effects than the remaining formats (see Table 2). We excluded the categories of ‘other psychological treatments’ and ‘other treatment delivery formats’ from subgroup analyses, because studies in both categories involved totally different type of treatments or formats, compromising the representativeness of the findings.
Bivariate meta-regression analyses
No significant associations were found for mean age, proportion of women, proportion of participants using psychiatric medicine in the treatment group, and number of treatment sessions. However, a significant negative association was observed between baseline OCD symptom severity and effect size (β = −0.1, p = 0.02; see Appendix E).
Dropout
Study dropout rates in 59 comparisons between psychological treatments and control groups were calculated, revealing no indication of a differential pooled dropout rate (RR = 0.92; 95% CI [0.77–1.09]; p = 0.31).
Discussion
Synthesizing the largest number of RCTs to date, we provided a relatively comprehensive understanding of the overall effectiveness of psychological treatments for OCD. Compared to control groups, psychological treatments significantly reduced OCD symptom severity at post-treatment, and the effect persisted after a series of sensitivity analyses, including adjustment for publication bias.
We found that the type of treatment, control group, treatment format, and baseline OCD symptom severity may influence the treatment effects. Specifically, ERP showed the largest effect size, supporting its gold-standard status in OCD treatment. CBT received the most research attention and demonstrated a comparable effect size to ERP, indicating its widespread use and effectiveness. However, CT and third-wave CBT had smaller and non-significant effects, not confirming the previous studies. One previous meta-analysis (Rosa-Alcázar et al., Reference Rosa-Alcázar, Sánchez-Meca, Gómez-Conesa and Marín-Martínez2008) found similar effects of ERP, CT, and ERP + CT based on studies published between 1980 and 2006, and another one (Başkaya, Özgüç, & Tanrıverdi, Reference Başkaya, Özgüç and Tanrıverdi2021) demonstrated a small but significant effect of MBCT on OCD symptom severity according to five studies containing non-RCTs. Although our meta-analysis contained a larger number of RCTs and provided the most up-to-date and comprehensive evaluation of psychological treatments for OCD, the effects of CT and third-wave CBT in present study should be interpreted cautiously due to the small number of studies with small simple sizes synthesized. Additionally, it may also suggest that the effect of CBT mainly arises from ERP, with CT playing a lesser role in OCD treatment, which is in line with previous research (Olatunji et al., Reference Olatunji, Rosenfield, Tart, Cottraux, Powers and Smits2013). Concerning control groups, psychological placebo yielded the smallest effect size, consistent with findings in a depression meta-analysis (Cuijpers, Quero, Papola, Cristea, & Karyotaki, Reference Cuijpers, Quero, Papola, Cristea and Karyotaki2021). The majority of studies used waitlist as a control group, which is associated with an overestimation of treatment effect (Michopoulos et al., Reference Michopoulos, Furukawa, Noma, Kishimoto, Onishi, Ostinelli and Cuijpers2021). Although effect sizes estimated with CAU and pill placebo comparators were similar to waitlist, the number of studies using such control conditions was very small and their results were probably uncertain. Regarding treatment format, individual treatment received most research attention and exhibited a large effect, showing its extensive use and effectiveness. Although group and guided self-help exhibited smaller effects than individual, time-intensive, and family-involved treatments, their effects could still be considered large, and they holds great advantages for facilitating implementation. While the association between time-intensive and larger effects aligns with prior research (Jónsson et al., Reference Jónsson, Kristensen and Arendt2015), caution is advised due to the limited number of RCTs supporting this conclusion. Additionally, previous meta-analyses have yielded mixed results on the effectiveness of family-involved treatment. One previous meta-analysis (Öst, Riise, Wergeland, Hansen, & Kvale, Reference Öst, Riise, Wergeland, Hansen and Kvale2016) showed that family-involved CBT was not significantly better than individual CBT based on two head-to-head comparisons. On the other hand, other previous meta-analytic studies (Iniesta-Sepúlveda, Rosa-Alcázar, Sánchez-Meca, Parada-Navas, & Rosa-Alcázar, Reference Iniesta-Sepúlveda, Rosa-Alcázar, Sánchez-Meca, Parada-Navas and Rosa-Alcázar2017; McGrath & Abbott, Reference McGrath and Abbott2019) including both RCTs and non-RCTs showed large effects of family-involved CBT based on pre–post effect sizes. While the effect of the family-involved treatment needs further confirmation with more studies, our current findings highlight the potential importance of family function in OCD treatment. Moreover, we observed that more severe baseline OCD symptomatology predicted higher degree of change in the treatment, consistent with a previous study (Andersson et al., Reference Andersson, Ljótsson, Hedman, Enander, Kaldo, Andersson and Rück2015). Future investigations should further explore the relationship between baseline OCD symptomology and treatment effect, ideally by exploring data at the patient level.
We encountered some discrepancies in the exploration of the long-term effects of psychological treatments for OCD, compared to previous related meta-analyses. First, previous meta-analyses on adults (Öst et al., Reference Öst, Havnen, Hansen and Kvale2015) and children/adolescents (Öst et al., Reference Öst, Riise, Wergeland, Hansen and Kvale2016) with OCD reported mean follow-up periods of 15 months and 9 months, respectively. In contrast, our meta-analysis did not find similar follow-up periods. This discrepancy could be attributed to the nature of the comparisons included in our study. In our meta-analysis, we focused on the comparisons between psychological treatments and control groups, with most control groups involving waitlists, whose follow-up data, if available, is often not usable due to participants receiving the experimental intervention after the post-test. In contrast, the previous meta-analyses included both control and active treatment groups, which may potentially extend the overall follow-up periods. Additionally, we did not find significant effect sizes at 3–6 month and 6–12 month follow-up intervals, possibly due to the limited number of included trials. Future studies investigating the long-term effects of psychological treatments for OCD are necessary to provide more robust evidence in this area.
Recognizing the diversity in the assessment of OCD symptoms across studies as a reflection of real-world clinical variations, we aimed to comprehensively evaluate the overall effectiveness of psychological treatments for OCD by combining all available outcomes measuring OCD symptoms. Despite the variability, our findings indicated that, compared to control groups, psychological treatments were consistently effective across most outcome measures, with the exception of self-rated Y-BOCS and DOCS (-SF), where the non-significant results may have been influenced by the limited sample sizes. However, it is crucial to acknowledge that large between-study heterogeneity and high risk of bias observed across included RCTs introduced uncertainty and reduced the validity of the results. When the analysis was restricted to the studies with low risk of bias, the effect lost its statistical significance. Although the result was based on only three studies with low risk of bias, it adds uncertainty to the overall effect size, as these studies had more rigorous methodological designs and may better reflect the true effect estimate. Therefore, caution is necessary when interpreting the results from our meta-analysis. To strengthen the evidence base, conducting well-designed RCTs with appropriate randomization procedures, strict allocation concealment, and comprehensive reporting of outcomes are needed. What's more, this meta-analysis only included the outcomes measuring OCD symptom severity. Future studies should also consider other secondary outcomes, such as the quality of life, as a reduction in OCD symptom severity may not necessarily correlate with an overall improvement in the quality of life for individuals with OCD.
This study holds significant implications for advancing both clinical research and the treatment of OCD. First and foremost, it underscores the overall effectiveness of psychological treatments for OCD. However, the presence of high risk of bias across studies highlights the critical need for standardization in study design and methodology. The call for pre-registration of trials becomes imperative to mitigate publication bias and selective reporting in future studies. Additionally, there is a clear demand for more long-term evidence on the effectiveness of psychological treatments for OCD.
Second, the high between-study heterogeneity indicates varying treatment effects across studies. While subgroup analyses provide some insights into potential causes of differences, further research is necessary to identify potential moderators and predictors. Particularly, the direct comparison between ERP and CT is highly recommended, to further verify the effect of CT in treating OCD and its function in the CBT treatment containing both ERP and CT. Trials of head-to-head comparisons between different formats and studies investigating the effectiveness and acceptability of different formats are also needed, to further explore the most effective treatment format for OCD generally and which format is most beneficial to which population group.
Lastly, the study highlights the need for more rigorous research on psychological treatments for OCD in the Chinese context. Despite the initial identification of numerous Chinese studies on OCD, only one RCT was included, shedding light on existing research gaps. Pure pharmacotherapy was the predominant reason for the exclusion of Chinese studies in the full-text screening. Furthermore, the comparison between psychological nursing plus traditional nursing to traditional nursing was the main focus of Chinese studies in the field of psychological treatment for OCD, incompatible of our conceptualizations of treatment and control groups in present study. Non-RCT designs in numerous studies also played a pivotal role in their exclusion. Future research on psychological treatments with more rigorous methodological designs would be needed to provide more robust evidence regarding the effectiveness of psychological treatments for OCD within the Chinese population.
Conclusion
The current meta-analysis provides evidence for the effectiveness of psychological treatments in alleviating OCD symptom severity, with ERP emerging as the most effective treatment. CT and third-wave CBT were somewhat less effective, although only few trials were available for each treatment type. Thus, further research is needed to draw conclusions on comparative effectiveness. All treatment formats hold large effect sizes, but making treatment more intensive or involving family members could potentially improve effectiveness. Additionally, baseline OCD severity may predict treatment response. More evidence on long-term treatment effects is needed, as well as studies with more rigorous methodology and reporting. Finally, further evidence on diverse cultural contexts could offer valuable insights and would increase the generalizability of the findings.
Supplementary material
The supplementary material for this article can be found at https://doi.org/10.1017/S0033291724001375.
Acknowledgment
We thank CM, MC, AA, JL, RZ for their kind help during data extraction.
Author contributions
YW, SYS, LMW, and PC designed the study. YW conducted analyses and prepared the draft manuscript. MBT, SYS, LMW, CM, and PC provided critical and intellectual comments on the draft. YW, CM, MC, AA, JL, and RZ contributed to data collection. All authors reviewed and approved the final manuscript.
Funding statement
YW is financially supported by the Chinese Scholarship Council Grant no. 202108330055 for her Ph.D. The funding source had no role in the design of the research.
Competing interests
The authors declare there are no conflicts of interest.