Background
Cognitive impairment is considered a core feature of schizophrenia,Reference Green, Horan and Lee1 and is associated with poor functional outcomes in multiple domains.Reference Green, Kern and Heaton2,Reference Mucci, Galderisi, Gibertoni, Rossi, Rocca and Bertolino3 Meta-analyses and prospective cohort studies have detected a medium-sized (8–9 IQ point) impairment in premorbid IQ in those who go on to develop schizophrenia,Reference Meier, Caspi, Reichenberg, Keefe, Fisher and Harrington4,Reference Woodberry, Giuliano and Seidman5 and an even larger impairment (14–15 IQ points) following symptom onset.Reference Meier, Caspi, Reichenberg, Keefe, Fisher and Harrington4,Reference Mesholam-Gately, Giuliano, Goff, Faraone and Seidman6 Although impairment affects almost all cognitive domains,Reference Mollon, David, Zammit, Lewis and Reichenberg7 there is evidence of slight variations in the degree of impairment, with replicated findings that memory, executive functions, attention and processing speed domains are the most compromised.Reference Mesholam-Gately, Giuliano, Goff, Faraone and Seidman6,Reference Nuechterlein, Barch, Gold, Goldberg, Green and Heaton8,Reference Sheffield, Karcher and Barch9 As a result of an historical lack of longitudinal cohort studies, the course of cognitive impairment has been largely estimated by cross-sectional studies of groups at different illness development stages.Reference Woodberry, Giuliano and Seidman5,Reference Mesholam-Gately, Giuliano, Goff, Faraone and Seidman6,Reference Fioravanti, Carlone, Vitale, Cinti and Clare10–Reference Heinrichs and Zakzanis12
The trajectory of cognitive impairments following a first episode of psychosis has been the centre of debate about whether cognitive impairment is insidious, developing gradually over time, or whether cognitive dysfunction begins early, with critical periods of rapid decline prior to, and in the early years following, onset of symptoms with stability thereafter.Reference Lewandowski, Cohen and Öngur13 The early years following a first episode of psychosis have been proposed as a ‘critical period’ for recovery in which individuals can make the greatest improvement in social functioning.Reference Birchwood, Todd and Jackson14–Reference Luther, Rosen, Cummins and Sharma16 Understanding the course of cognitive functioning therefore has important implications for consideration of difficulties at different illness stages, and for the development and optimisation of intervention strategies.
A previous meta-analysis examined the progression of cognitive deficits within the first 5 years following first episode of psychosis onset, including longitudinal studies up until February 2013.Reference Bora and Murray17 The lack of evidence of continued cognitive decline provided support for a neurodevelopmental risk-factor model of schizophrenia,Reference Weinberger18,Reference Murray, Bhavsar, Tripoli and Howes19 rather than a neurodegenerative model characterised by a chronic active illness progression.Reference DeLisi, Sakuma, Tew, Kushner, Hoff and Grimson20,Reference Lieberman21 However, potentially because of the scarcity of studies, this meta-analysis included randomised controlled studies and those in which cohorts had undergone several cognitive testing visits within the first year, increasing the likelihood of treatment and practice effects playing a role.
Aims of the current study
Almost a decade on, many more cohort studies have reported on cognitive performance in the years following a first episode of psychosis. Some provided data that do not align with the neurodevelopmental risk-factor model as they indicate worsening performance in some domains,Reference Haatveit, Vaskinn, Sundet, Jensen, Andreassen and Melle22–Reference Torgalsbøen, Mohn and Rishovd Rund24 whereas others indicate cognitive stability or improvement across all domains.Reference González-Ortega, de los Mozos, Echeburúa, Mezo, Besga and Ruiz de Azúa25,Reference Sánchez-Torres, Moreno-Izco, Lorente-Omeñaca, Cabrera, Lobo and González-Pinto26 The focus of this meta-analysis is to comprehensively review the literature on the longitudinal trajectory of cognitive deficits in the early years following psychosis onset. This study will only consider longitudinal cohort studies to exclude potential forms of bias such as randomisation to trial arms. Multiple exposure bias will be controlled by including only the results from the most recent assessment after baseline, with a minimum of 12 months between assessments to minimise practice effects. In addition to cognitive domains previously covered in meta-analyses this study will be the first, to our knowledge, to synthesise longitudinal data on social cognition that has been shown to have a stronger relationship with social functioning than cognition.Reference Cowman, Holleran, Lonergan, O'Connor, Birchwood and Donohoe27,Reference Fett, Viechtbauer, Dominguez, Penn, van Os and Krabbendam28 As in Bora & Murray (2014),Reference Bora and Murray17 this meta-analysis will include longitudinal studies examining cognitive changes within the proposed ‘critical period’ 1–5 years following psychosis onset, when confounding effects of longer-term illness (such as medication) are likely to be minimised.Reference Birchwood, Todd and Jackson14 Potential moderators of cognitive change will also be explored.
The findings of this meta-analysis will provide evidence on the evolution of global, domain and task-specific cognitive impairments over time. Relative stability is thought to provide support for the neurodevelopmental theory of schizophrenia, whereas continued cognitive decline is considered to fit with a neurodegenerative model. The findings have implications for the implementation and optimisation of interventions targeting cognitive difficulties such as cognitive remediation therapy, which have been shown to effectively target cognitive processes and improve social functioning.Reference Vita, Barlati, Ceraso, Nibbio, Ariu and Deste29,Reference Wykes, Huddy, Cellard, McGurk and Czobor30
Method
Protocol and registration
This review follows the PRISMA guidelines for reporting systematic reviews.Reference Page, McKenzie, Bossuyt, Boutron, Hoffmann and Mulrow31 The protocol was pre-registered on the PROSPERO registry, ID: CRD42021241525 on 22 March 2021.
Search strategy
The following three electronic databases were searched up to the end of September 2021: ‘PubMed’, ‘PsycINFO’, and ‘Scopus’, using the following search terms: to identify participants: (‘psychosis’ OR ‘schizophrenia’ OR ‘schizo*’ OR ‘FEP’ OR (‘first episode psychosis’) AND to identify cognitive measures: (‘cog*’, ‘neurops*’ OR ‘neurocognit*’ OR ‘memory’ OR ‘attention’, OR ‘executive’ or ‘processing OR ‘social cog’, OR ‘social knowledge’ OR ‘attribution bias’ OR theory of mind’ OR ‘social perception’) AND to identify the study design: (‘longitudinal’ OR ‘follow-up’ OR ‘chang*’ OR ‘course’ OR ‘trajectory’). Article titles and abstracts were reviewed to identify relevance, with full texts then being examined. A backwards reference search of included articles was performed manually to identify additional studies meeting the inclusion criteria.
Inclusion/exclusion criteria
Inclusion criteria were studies that:
(a) are published in an English language peer-reviewed journal;
(b) report longitudinal cognition data;
(c) used at least one standardised test at two time points;
(d) had a minimum follow-up period of 12-months; and
(e) included participants with a first episode of psychosis, defined as being within 5 years of their first psychosis episode at the time of the initial assessment.
Inclusion criteria for follow-up duration was 1–5 years from initial assessment.
Exclusion criteria were:
(a) multiple reports of the same data (in which case the study reporting the earliest cognitive assessment follow-up was included);
(b) unpublished studies, reviews, conference abstracts or case reports;
(c) studies not reporting quantitative data;
(d) studies using a non-standardised test;
(e) controlled intervention studies (i.e. randomised control intervention studies);
(f) studies conducting additional cognitive follow-up testing before the minimum follow-up period (e.g. at 6 months); these were excluded to minimise practice effects; or
(g) studies conducted in early-onset psychosis (age < 16 years).
Screening
After removing duplicates two authors (A.J.W., L.H.) independently screened all titles and abstracts for eligibility. Full texts for eligible papers were also further screened for eligibility by the same authors and disagreements resolved by a third author.
Data extraction
Study sample size, follow-up duration, mean age, gender (% male), sample diagnoses, % on antipsychotic medication, cognitive tasks used, cognitive data (baseline and follow-up) and positive and negative symptom scores (baseline and follow-up) were extracted independently by two authors, for inclusion in main and moderator analyses. For comparison with the psychosis group, the same variables were also extracted for healthy controls. A comprehensive analysis of test–retest scores in healthy controls was beyond the scope of this study, so healthy control data were restricted to data from the papers included in the main analysis
Cognitive measures
Where studies reported cognitive domain summary scores (e.g. processing speed), these were extracted directly from articles. Where only individual tests scores were reported, these were assigned to predefined cognitive domains. As there is little consensus on how individual tests map onto cognitive domains, assignment was guided by the articles and broader literature. For studies with multiple tests measuring the same neuropsychological construct (e.g. WAIS-Digit Symbol Substitution Test and Trail Making Task A), a single domain summary measure (i.e. processing speed) was calculated by averaging effect sizes. We also computed a global cognition score using source data where possible, and where no summary measure was reported, effect sizes of all neuropsychological measures were pooled (where at least two seperate domains were measured). For a full list of domains and allocation of specific cognitive tasks, see Supplementary S1 and Supplementary Table S1, available at https://doi.org/10.1192/bjp.2022.131.
Study quality
The Newcastle–Ottawa ScaleReference Luchini, Stubbs, Solmi and Veronese32 was used to assess the quality of each included study. This is a commonly used tool for non-randomised studies and includes a quality rating for: selection, comparability and exposure/outcome. Studies are given a star rating from zero to nine. Data quality was rated by authors A.J.W. and L.H.
Meta-analytic procedure
Meta-analyses were conducted using the ‘Metafor’ statistical package for R.Reference Viechtbauer33 Using data reported in each study, standardised mean differences were estimated by subtracting average scores at follow-up from average scores at baseline, and dividing the result by the pooled standard deviation, with a small sample correction taking each sample size into consideration (Hedges’ unbiased g).Reference Hedges34 This approach is used to avoid the overestimation of the magnitude of effect in meta-analyses of longitudinal studies.Reference Dunlap, Cortina, Vaslow and Burke35 Where more than three studies reported data for the same cognitive task, individual measure meta-analyses were performed. This has the advantage of identifying changes in specific cognitive processes (e.g. immediate versus delayed recall) and limits heterogeneity. Where samples overlap, the study with the largest overall sample size was included. In addition, to establish the presence of cognitive impairment at baseline, patient and healthy control groups were compared on baseline cognitive function for each domain using the same analytic procedure, but with the inclusion of baseline cognitive scores for each group.
Distributions of effect sizes were expected to be heterogeneous in cognitive studies in this population, so a random-effects model (DerSimonian–Laird estimate) was used, with effect sizes weighted using the inverse variance method. Heterogeneity of effect size distributions was measured using the Q-test, and the extent of heterogeneity reported using the I 2 statistic.Reference Higgins, Thompson, Deeks and Altman36 I 2 quantifies the percentage of total variation across studies considered to be because of heterogeneity rather than chance. I 2 values indicate low (25%), moderate (50%), and (75%) high heterogeneity.Reference Higgins, Thompson, Deeks and Altman36 In analyses with at a minimum of ten studies, publication bias was assessed using inspection of forest plots and Egger's test. All results in the analyses are presented so that positive values reflect improved performance at follow-up.
Subgroup sensitivity analyses
The Q bet-test was used to compare patient versus healthy control change in cognition, and additionally to compare non-affective only versus mixed-affective studies. This method maximises the number of included studies, although it can inflate the variance of the estimates. As such, a sensitivity analysis was also performed calculating patient–control effect size for change (standardised mean difference in change scores from baseline to follow-up) for only studies including both patient and control groups. Results were screened for outliers using the find.outlier function in the {dmetar} package that implements an outlier detection and removal algorithm of studies with confidence intervals that do not overlap with the confidence interval of the pooled effect. Outlier influence on results was investigated using leave-one-out analysis (effect size and I 2). Domain and task outliers are listed in Supplementary Table S2.
Meta-regressions
To identify the influence of potential moderators, meta-regression analyses were conducted for cognitive domains, using a restricted-maximum-likelihood random-effects model. Potential moderators of change were, mean age, percentage of male participants, mean years of education, percentage of participants on antipsychotic medication, study follow-up duration (months), change in positive and negative symptom severity scores (% after adjusting for non-zero floor), and baseline positive and negative symptoms scores. Where there were not enough studies with data to meet the minimum threshold for analysis (n < 10)Reference Higgins, Thompson, Deeks and Altman36 meta-regression analyses were not performed. Studies including only age-adjusted scores, were not included when considering age as a moderator.
Results
Search outcome
After removing duplicates, intervention studies, papers that only measured cognition once, did not use standardised tests or did not fulfil other criteria, 25 full-text papers from the 1780 abstracts were left for the meta-analyses (see Fig. 1 and Supplementary Table S3). Owing in part to the inclusion criteria (e.g. use of standardised tests), all studies scored similarly on the Newcastle–Ottawa scale and were of good quality. Sensitivity analyses by study quality were therefore not performed.
Sample demographics
Included studies had 1480 participants with early psychosis, a mean age 24.85 years, 62.84% of participants were men and they had a mean of 20.76 months between baseline and follow-up assessments. For comparison, 13 studies included data for healthy controls, with 789 participants, a mean age of 25.45 years, an average of 60.82% men, and with 19.60 months between baseline and follow-up assessments. Where data were available, patients showed a mean reduction of 48.9% in positive symptoms and 18.7% in negative symptoms between baseline testing and follow-up. Study characteristics are detailed in Supplementary Table S3.
Patient–control differences at baseline
Large cognitive impairments were present in the patient group compared with healthy controls at baseline, including for global cognition (g = 0.85) (Table 1). See Supplementary S4 for further description.
a. All effect sizes indicate better performance in healthy controls. b. Estimated effect (g) difference in performance between patients and healthy controls at baseline. Q is the measure of the heterogeneity of the distribution of effect size. I2 quantifies the percentage of total variation across studies because of heterogeneity. Bias is the P-value of Egger's test.
Change in cognition
Patient samples showed significant improvement in 7 out of the 11 domains (Table 2), including global cognition (Fig. 2) (see Supplementary S5 for further description). In comparison with healthy controls, effect sizes of change were identical for global cognition (0.25) and the Q bet-test was non-significant for all domains (Table 2). The Q bet tests for verbal learning and memory remained non-significant even after separate analysis with the exclusion of clear outliers (Q bet = 0.23, P = 0.63). Average follow-up period was similar for patients and controls (Supplementary Table S4). Results did not differ in the sensitivity analyses comparing standardised mean difference in change scores from baseline to follow-up, in only studies including both patient and control groups (Supplementary Table S5).
a. Estimated effect (g) improvement in performance from first assessment to follow-up. Q is measure of the heterogeneity of the distribution of effect size. I2 quantifies the percentage of total variation across studies due to heterogeneity. Bias is the P-value of Egger's test. Qbet is the comparison of change in patients versus healthy controls. Results reported before any exclusion of outliers. Forest plots for each domain are available in Supplementary S6.
b. Not enough data.
Moderators
There were no significant moderators, except for working memory, where a higher percentage of men was associated with less improvement between time points (z = 2.14, P = 0.03). In the sensitivity analysis comparing non-affective versus mixed samples, there was significantly less improvement in attention and vigilance in the mixed-affective studies (Q = 5.14, P = 0.02). All other comparisons were non-significant (Supplementary Tables S7 and S8).
Task-specific analyses
Qbet-test comparisons with healthy control samples were non-significant for all tasks and remained non-significant even after the exclusion of clear outliers. Further description of change in patients and healthy controls can be found in Supplementary S9.
Discussion
Main findings
This meta-analysis is the first, to our knowledge, to comprehensively examine cognitive change in the early years following the onset of psychosis across all cognitive domains in studies of good quality and little risk of bias from treatment and practice effects.
The patient groups were impaired on all cognitive domains at baseline, with large effect sizes for global cognition and large or medium effect sizes for all other domains. Over time, only modest improvements were seen in global cognitive performance, with an effect size identical to that in healthy controls, indicating that any improvement may be an artefact of practice. Small improvements were also seen across specific domains and tasks, including social cognition, but improvement did not significantly differ from that in healthy controls despite healthy controls already performing significantly better across all tasks. There were insufficient data to examine subdomains of social cognition (theory of mind, attribution bias, social perception, emotion perception/recognition) with further research needed to discern whether there is stability or change. Lack of significant improvements were in the same domains in both the healthy controls and patient groups (construction and visuospatial skills, and visual learning and memory), and those where patients are thought to be less impaired (motor skills). The exception was verbal fluency, where, in contrast to controls, patients showed no significant improvement. Improvement in symptoms did not moderate change in global cognition, in keeping with findings of other studies.Reference Leeson, Barnes, Hutton, Ron and Joyce40, Reference Hoff, Svetina, Shields, Stewart and DeLisi51 Somewhat surprisingly, sensitivity analysis showed significantly less improvement in attention and vigilance in mixed (affective and non-affective) studies, compared with non-affective-only samples. Results were not undermined by publication bias or high heterogeneity.
Comparison with the existing literature
Overall, these findings support the neurodevelopmental risk-factor hypothesis and earlier longitudinal and cross-sectional meta-analyses.Reference Mesholam-Gately, Giuliano, Goff, Faraone and Seidman6,Reference Bora and Murray17 Much of the cognitive impairment seen in psychosis occurs prior to the onset of illness and remains relatively stable following symptom onset. Despite the exclusion of controlled trials, effect sizes for domain improvements were comparable with those of Bora & Murray (2014)Reference Bora and Murray17 (0.02–0.31 in this analysis versus 0.13–0.38 in Bora & MurrayReference Bora and Murray17). Both studies showed least improvement compared with controls in processing speed, working memory and verbal fluency. Greatest improvement was seen in executive function, using the Wisconsin Card Sorting Test, potentially indicating greatest malleability in problem-solving skills.
Given that large-scale meta-analyses have shown cognitive deficits in childhood or adolescence prior to detectable symptomsReference Woodberry, Giuliano and Seidman5,Reference Mollon, David, Zammit, Lewis and Reichenberg7,Reference Mollon and Reichenberg53 with greater deviation from controls at the time of first episode,Reference Mesholam-Gately, Giuliano, Goff, Faraone and Seidman6 there remains the question of when cognitive decline occurs. One explanation accounting for the discrepancy between pre- and post-onset cognitive deficits, is the model of progressive neurodevelopmental impairmentReference Reichenberg, Caspi, Harrington, Houts, Keefe and Murray54 suggesting increasing cognitive ‘lag’ seen following critical developmental periods.Reference Pantelis, Yücel, Bora, Fornito, Testa and Brewer55 Others argue that cognitive worsening takes place during a prodromal period, with some studies showing evidence of cognitive worsening at this stage,Reference Lewandowski, Cohen and Öngur13 while other studies found no evidence.Reference Hawkins, Keefe, Christensen, Addington, Woods and Callahan56,Reference Keefe, Perkins, Gu, Zipursky, Christensen and Lieberman57 HarveyReference Harvey58 argues that inconsistencies may be related to the conceptualisation of when a first episode begins, with sensitive prodromal measures potentially capturing the first stages of psychotic illness, previously considered to be prodromal.
Although our meta-analysis shows no cognitive improvement above that of healthy controls, debate remains around the trajectory of cognitive impairment over longer periods. A study by Zanelli et al,Reference Zanelli, Mollon, Sandin, Morgan, Dazzan and Pilecka59 assessing individuals with a first episode of psychosis at 10-year follow-up, found that compared with healthy participants, patients with schizophrenia showed a significant decline in IQ and in tasks assessing memory and verbal knowledge, even after controlling for gender, age, ethnicity and education. Similarly, an 18-year follow-up study, Fett et al,Reference Fett, Velthorst, Reichenberg, Ruggero, Callahan and Fochtmann60 found continued cognitive decline across multiple, but not all domains, and there is evidence of a slightly larger (15–21 IQ point) deficit in those with long-standing schizophrenia.Reference Fioravanti, Carlone, Vitale, Cinti and Clare10,Reference Heinrichs and Zakzanis12,Reference Reichenberg and Harvey61 These findings raise the question of whether cognitive impairment in schizophrenia results from both abnormal development and later deterioration only detectable over long-periods, with later deterioration differing between cognitive functions. One possible explanation for this is that psychosis may lead to longer periods of restricted activity and opportunities to use cognitive skills, leading to a decline over time. Against this, other studies assessing individuals with psychosis over 10 years found no worsening of deficits after illness onsetReference Albus, Hubmann, Mohr, Tiedemann, Pechler and Drießlein62–Reference Rund, Barder, Evensen, Haahr, Hegelstad and Joa64 with some evidence that any progression may be because of medicationReference Vita, Barlati, Ceraso, Nibbio, Ariu and Deste29,Reference Joshi, Thomas, Braff, Green, Gur and Gur65,Reference Zipursky, Reilly and Murray66 or lifestyle factors.Reference Bora, Akdede and Alptekin67,Reference Stubbs, Ku, Chung and Chen68
Further complexity in assessing cognition over time comes from individual heterogeneity,Reference Joyce and Roiser69 which may be obscured when assessing group means,Reference Sánchez-Torres, Moreno-Izco, Lorente-Omeñaca, Cabrera, Lobo and González-Pinto26 as well as the potential for those who make good recoveries being underrepresented in studies with longer follow-up periods.Reference O'Keeffe, Hannigan, Doyle, Kinsella, Sheridan and Kelly70 Our analysis shows that study quality has remained high, although this is limited by the exclusion of many studies because of multiple testing within the first year (n = 22). Of the studies including non-affective samples, four (out of six) were conducted since the Bora and Murray (2014)Reference Bora and Murray17 analysis, highlighting some studies still include mixed diagnoses in their inclusion criteria, although the proportion of patients with affective diagnoses was low. There remains a need for studies to assess cognition in affective-first-episode psychosis independently. Resolving these issues is important, with implications for the course of illness, as well as consideration of those most suited to benefit from interventional strategies aimed at preserving and improving cognition in people with psychosis.Reference Vita, Barlati, Ceraso, Nibbio, Ariu and Deste29,Reference Wykes, Huddy, Cellard, McGurk and Czobor30,Reference Pantelis, Wannan, Bartholomeusz, Allott and McGorry71 Finally, evidence for comparable improvement in cognitive test performance in healthy controls has implications for assessing cognitive change both clinically and in randomised controlled trials, highlighting the importance of using control groups when drawing conclusions about intervention efficacy.
Limitations
Limited healthy control data meant comparison with patient groups was not possible for all domains or tasks. Similarly, meta-regressions and sensitivity analyses could not be performed for all domains because of inadequate data, including for antipsychotic dose, which may moderate cognitive change. Social cognition tests varied across studies, many of which have since been shown to have poor psychometric properties.Reference Pinkham, Harvey and Penn72 Grouping social cognition constructs prohibits conclusions being made about these subdomains, particularly given the potential for their separate factor structuresReference Mehta, Thirthalli, Subbakrishna, Gangadhar, Eack and Keshavan73 and neural bases.Reference Pinkham, Penn, Perkins and Lieberman74 Categorisation of cognitive tasks also remains a wider issue, with no consensus across studies as to the cognitive domains that tasks are most accurately measuring. It should also not be discounted that ceiling effects may have minimised improvement in healthy control performance, and that this might mask areas of relative deterioration in the patient group. Finally, subgroup analyses have been shown to lack statistical power to detect group differences, which might be observed with more available studies,Reference Cuijpers, Griffin and Furukawa75 however, comparison of confidence intervals and careful interpretation do not change the conclusions of this analysis.
Implications
We found no evidence of cognitive decline in the early years following psychosis onset. Small improvements were observed across individual cognitive domains and tasks, but these were in line with those seen in healthy comparison groups and are likely an artefact of practice effects. Further data are needed to evaluate the course of social cognition subdomains following onset of psychosis. We conclude that there is no evidence of continued cognitive decline or improvement in the early years following psychosis onset, with a need for more studies over longer follow-up periods. Evidence of practice effects highlights the importance of including control samples in longitudinal and intervention studies measuring cognition at multiple time points.
Supplementary material
To view supplementary material for this article, please visit https://doi.org/10.1192/bjp.2022.131
Data availability
The data that support the findings of this study are available from the corresponding author, A.J.W., upon reasonable request.
Author contributions
Conception and design of the work was by A.J.W., M.C., T.W. and A.P. Data search and extraction were performed by A.J.W. and L.H. Analyses were conducted by A.J.W., with supervision by M.C. and A.P. All authors made significant contributions to drafting and/or revising the manuscript.
Funding
This research received no specific grant from any funding agency, commercial or not-for-profit sectors. T.W. acknowledges the support of the NIHR Maudsley Biomedical Research Centre at South London and Maudsley NHS Foundation Trust and King's College London and the NIHR-PGfAR RP-PG-0612-20002. The views expressed are those of the authors and not necessarily those of the NIHR or the Department of Health and Social Care.
Declaration of interest
None.
eLetters
No eLetters have been published for this article.