INTRODUCTION
Cognitive tasks are used during functional magnetic resonance imaging (fMRI) to probe for signs of aberrant neuronal functioning in patients diagnosed with schizophrenia (SZs). However, since brain responses vary with task performance, and because performance differs between patients and healthy control subjects (HCs), it is often unclear whether different patterns of neuronal response between groups should be attributed to intrinsic differences in brain function, differences in performance, or both (Brown & Eyler, Reference Brown and Eyler2006; Callicott et al., Reference Callicott, Mattay, Verchinski, Marenco, Egan and Weinberger2003; Manoach, Reference Manoach2003).
Typically, one of two approaches is used to understand the relationship between performance and neuronal activity differences between groups. The first approach is to examine neuronal activity as a function of task difficulty (e.g., Cohen et al., Reference Cohen, Perlstein, Braver, Nystrom, Noll, Jonides and Smith1997; Jansma, Reference Jansma2004). This approach is able to characterize different patterns of activity across task difficulty levels but obscures the impact of group performance levels on brain activation. The second approach is to examine neuronal activity as a function of marginal (or average) task performance levels (e.g., Karlsgodt et al., Reference Karlsgodt, Sanz, van Erp, Bearden, Nuechterlein and Cannon2009; Manoach et al., Reference Manoach, Press, Thangaraj, Searl, Goff, Halpern and Warach1999). This approach is able to characterize patterns of activity across task performance levels but obscures the impact of task difficulty levels on neuronal activity. Consequently, neither of these approaches examines differences in both the pattern and level of neuronal activity between patient and control groups. We aimed to resolve this problem using a novel methodology: conditional performance analysis. That is, the examination of neuronal activity as a function of performance within specific difficulty levels.
Neuroimaging of Working Memory in Schizophrenia
Working memory (WM) is defined as the “… mechanisms or processes that are involved in the control, regulation and active maintenance of task-relevant information in the service of complex cognition…” (Miyake & Shah, Reference Miyake, Shah, Akira and Priti1999, p. 450). WM deficits are a core feature of schizophrenia (Barch & Smith, Reference Barch and Smith2008; Kalkstein, Hurford, & Gur, Reference Kalkstein, Hurford and Gur2010; Lee & Park, Reference Lee and Park2005), as well as a predictor of poor functional outcome (Green, Kern, Braff, & Mintz, Reference Green, Kern, Braff and Mintz2000). In the attempt to better understand WM deficits, researchers often rely on neuroimaging methods, especially fMRI (Brown & Eyler, Reference Brown and Eyler2006).
Unfortunately, inconsistencies in the properties of tasks used can complicate interpretations of brain imaging results (Gur, Erwin, & Gur, Reference Gur, Erwin and Gur1992). This is due, in part, to nonlinear neuronal response functions. Early fMRI research demonstrated that increased task difficulty results in increased brain response in cortical regions thought to be involved in WM (Desmond, Gabrieli, Wagner, Ginier, & Glover, Reference Desmond, Gabrieli, Wagner, Ginier and Glover1997; Manoach et al., Reference Manoach, Schlaug, Siewert, Darby, Bly, Benfield and Warach1997). However, an early PET imaging study by Goldberg and colleagues suggested that neuronal activity is not monotonic, but rather can also decrease as demands become excessive (Reference Goldberg, Berman, Fleming, Ostrem, Van Horn, Esposito and Weinberger1998). More specifically, Callicott and colleagues (Reference Callicott1999) found an inverted-U-shaped response pattern in the dorsolateral prefrontal cortex (DLPFC). Presumably, task disengagement—the descending slope of the inverted-U—is related to high computational costs (Westbrook & Braver, Reference Westbrook and Braver2015); that is, if the examinee deems that cognitive effort is unlikely to result in accurate performance (and reward), they might disengage resources.
Studies comparing the DLPFC response between patients and controls suggest that nonlinearity can confuse the comparison of neuronal activity between groups. Specifically, because patients are assumed to have “left-shifted” inverted-U-shaped neuronal response functions—that is, the neuronal response peaks at lower levels of difficulty—using tasks with varying levels of difficulty between studies can produce contradictory patterns. For example, Glahn et al. conducted a meta-analysis of fMRI research using a popular WM task (N-back) and concluded that SZs demonstrate hypoactivation in the DLPFC (Glahn et al., Reference Glahn, Ragland, Abramoff, Barrett, Laird, Bearden and Velligan2005). However, this conclusion was challenged by a significant minority of studies finding hyperactivation (Brown & Thompson, Reference Brown, Thompson and Swerdlow2010). Presumably, evidence that SZs show less neuronal activity during task engagement is an artifact that reflects examinees’ experience of being overwhelmed by task difficulty. Manoach and colleagues, for example, suggested that, because “… DLPFC activation may reflect WM demand only up to the point at which the demands begin to outstrip WM capacity” (Manoach et al., Reference Manoach, Press, Thangaraj, Searl, Goff, Halpern and Warach1999, p. 1135), combined with the fact that patients demonstrate worse performance, “… subjects may [engage in] cognitive and affective processes that [are] unrelated to WM [such as] error monitoring, attempts at compensation, disengaging from the task, feeling overwhelmed, and guessing” (Manoach et al., Reference Manoach, Press, Thangaraj, Searl, Goff, Halpern and Warach1999, p. 1129).
Studies have provided mixed support for the inverted-U model of neuronal activity in both HCs and SZs, particularly as they relate to the theory that, after accounting for the “left-shift,” patients demonstrate inefficient (i.e., greater) activity (Callicott et al., Reference Callicott, Bertolino, Mattay, Langheim, Duyn, Coppola and Weinberger2000; Jansma, Reference Jansma2004; Karlsgodt et al., Reference Karlsgodt, Sanz, van Erp, Bearden, Nuechterlein and Cannon2009; Manoach et al., Reference Manoach, Gollub, Benson, Searl, Goff, Halpern and Rauch2000; Potkin et al., Reference Potkin, Turner, Brown, McCarthy, Greve and Glover2009; Van Snellenberg et al., Reference Van Snellenberg, Girgis, Horga, van de Giessen, Slifstein, Ojeil and Abi-Dargham2016; Zou, Gu, Wang, Gao, & Yang, Reference Zou, Gu, Wang, Gao and Yang2011). The methodological challenge is clear: How should differences in neuronal activity be interpreted between patient and control groups given that task performance also differs? Some researchers focus on fMRI data produced by items or tasks where performance does not differ between groups. This might include selecting subsets of individuals from each group that are matched on performance, parametrically manipulating task conditions on an individual basis, or evaluating only correct trials in event-related designs (Brown & Eyler, Reference Brown and Eyler2006; Jansma, Reference Jansma2004; Potkin et al., Reference Potkin, Turner, Brown, McCarthy, Greve and Glover2009). However, matching performance between groups can artificially create a scenario that does not reflect the original abnormality, thus creating ambiguity when interpreting neurophysiological results (Zhou et al., Reference Zhou, Pu, Wang, Liu, Wu, Liu and Liu2016). More broadly, this problem is known as the performance–activation dilemma (Brown & Eyler, Reference Brown and Eyler2006) and poses a significant obstacle to the development of comprehensive, brain-cognitive models of mental illness.
The Inverted-U: Sources of Misinterpretation
Examining neuronal activity differences between patient and control groups without considering group differences in the match between ability and task difficulty can be counterproductive. To help explain why, Figure 1 presents several possible scenarios based on the inverted-U model of neuronal activity. To simplify the analysis, the model assumes that the relationship between neuronal activity and task difficulty is a normal distribution with equal group variances and with the mode of the schizophrenia group “left-shifted” toward less difficult task conditions compared to controls. Note that in both groups, it is assumed that activity increases with difficulty up to some critical point and then decreases. Not all tasks produce inverted-U-shaped activity; thus, here we are limiting our discussion to tasks where the relationship between difficulty and brain activation follows this theoretical model.
Figure 1 plots neuronal activity as a function of three levels of task difficulty (non-shaded region of each plot): a low difficulty task, a moderate difficulty task, and a high difficulty task. The left panel shows the pattern of hyperactivation among SZs when overall task difficulty is low. In this scenario, the SZs are expected to show greater neuronal activity than HCs, because, whereas SZs are moderately challenged by the task, HCs are only very weakly challenged. In other words, left panel is indicative of a task that is too easy from HCs but appropriately difficult for SZs. The middle panel shows the pattern of no difference in activity between SZs and HCs when overall task difficulty is moderate. SZs are highly challenged and HCs are beginning to be moderately challenged. However, because mirror sides of the inverted-U functions are engaged, the expected result is no overall group differences in neuronal activity. Finally, the right panel shows the pattern of hypoactivation among SZs—which is most often reported in the literature—that occurs when overall task difficulty is high. The task in this scenario is too hard for SZs but appropriately difficult for HCs.
In this paper, we test the assumption that examining neuronal activity as a function of performance conditioned on task difficulty (i.e., WM load) will resolve these interpretative challenges. Conditional performance analysis requires the investigator to estimate individual subjects’ performance (e.g., accuracy) at each level of task difficulty (e.g., each level of WM load) and then to analyze the data as a repeated measures design. The key argument for using this approach is that neuronal activity should be examined as a function of conditional performance and not task difficult or even marginal (i.e., average) performance.
Study Aims
Our study had three aims. The first aim was hypothesis-driven and the second and third aims were exploratory. The first aim was to demonstrate the use of conditional performance analysis for examining and comparing load-response curves between groups of patients and controls. We hypothesized that conditional performance analysis would uniquely and consistently reveal nonlinear (inverted-U) shaped neuronal activity in both groups. The second aim was to compare brain response between SZs and HCs in order to explore the neural inefficiency model of WM deficits in schizophrenia. The third aim was to explore demographic, clinical, and cognitive correlates of inefficient neuronal activity. We also conducted whole-brain, voxel-wise exploratory analyses of group and group-by-conditional performance effects.
METHODS
Participants
Data were collected from 27 SZs (schizophrenia or schizoaffective disorder) and 25 HCs. We aimed to recruit male and female adults aged 18–70 years with adequate hearing and eyesight who were fluent in English and able to perform the cognitive and imaging tasks required. Exclusion criteria were inability to understand or give consent, positive drug toxicology screen (except marijuana), alcohol or substance dependence in the preceding 6 months, pregnancy, contraindications and conditions incompatible with MRI, previous significant head injury, significant extrapyramidal symptoms or tardive dyskinesia, and significant medical or neurological diagnoses (except diabetes). Controls were additionally excluded if they were enrolled in special education courses during school or met diagnostic criteria for any psychosis spectrum disorder or bipolar disorder. Clinical diagnoses were verified using a structured clinical interview administered by a licensed psychologist (First, Williams, Karg, & Spitzer, Reference First, Williams, Karg and Spitzer2015). We did not exclude left-handed participants because left-handedness tends to be higher among people with psychosis (Webb et al., Reference Webb, Schroeder, Chee, Dial, Hana, Jefee and Molitor2013), and thus we did not want to limit the generalizability of our results. The final proportion of left-handed individuals was nearly identical between HCs (3/25; 12%) and SZs (4/27; 15%), and Fisher’s exact test indicated no significant association between handedness and group. Written consent was obtained from all participants. Research procedures were reviewed and approved by the UC San Diego Institutional Review Board.
Experimental and Behavioral Task Design
The experimental design was a two-by-five mixed factorial. The between-subjects factor was population (i.e., HC vs. SZ; two levels). The within-subjects factor was WM load (five levels). To manipulate load, participants were administered the N-back task. The N-back task is a forced-choice measure of WM that requires examinees to monitor a continuous stream of stimuli (pseudowords) and respond each time an item is repeated from N before. This task was chosen because it is commonly used to study brain and cognitive functioning in SZs (Glahn et al., Reference Glahn, Ragland, Abramoff, Barrett, Laird, Bearden and Velligan2005). We created three N-back runs, each consisting of five blocks of trials (i.e., 1- through 5-back load conditions). One run was administered outside of the scanner and the remaining two were administered within the scanner. Blocks were counterbalanced over runs. The task was administered using PsychoPy (Peirce et al., Reference Peirce, Gray, Simpson, MacAskill, Höchenberger, Sogo and Lindeløv2019). Pseudowords were presented in white font on a black background for 2500 ms with a 500 ms inter-item interval. The timing was constrained so that each block would last exactly 60 s. Blocks were separated by 20 s intervals (with a fixation cross). Additional experimental and task design details are provided in the Supplemental Materials.
Clinical and Cognitive Measures
Symptoms were assessed using the Scale for the Assessment of Negative Symptoms (SANS) and Scale for the Assessment of Positive Symptoms (SAPS) (Andreasen, Reference Andreasen1984b, Reference Andreasen1984a). Cognitive measures included Trail Making Test: Part A (TMT), Brief Assessment of Cognition in Schizophrenia Symbol Coding (BASC), Hopkins Verbal Learning Test (HVLT), Letter-Number Sequencing (LNS), and Category (Animal) Fluency from the NIMH MATRICS Consensus Cognitive Battery (MCCB) (Green et al., Reference Green, Nuechterlein, Gold, Barch, Cohen, Essock and Marder2004). We also administered the Wide Range Achievement Test 3rd Edition (WRAT-3) Reading subset. Effort was objectively measured using the Rey 15-Item task with the extended recognition trial (Boone, Salazar, Lu, Warner-Chacon, & Razani, Reference Boone, Salazar, Lu, Warner-Chacon and Razani2002) and subjectively measured using the NASA Task Load Index (Hart & Staveland, Reference Hart and Staveland1988). Functioning was measured using the Role Functioning Scale (Goodman, Sewell, Cooley, & Leavitt, Reference Goodman, Sewell, Cooley and Leavitt1993).
Image Acquisition
Participants were scanned using a General Electric (GE) Discovery MR750 3.0 Tesla whole-body imaging system and a Nova 32-channel head coil. Anatomical scans were based on a T1-weighted spoiled gradient echo sequence with fast and prospective motion correction imaging options (TR = 7.4 ms; TI = 1060 ms; TE = 2.3 ms; flip angle = 8°; FOV = 25.6 cm; matrix size = 320 × 320; in-plane resolution = .8 mm; slice thickness = .8 mm; slices = 204; slice spacing = 0) acquired parallel to the sagittal plane in an interleaved manner. Functional scans sensitive to the T2-weighted blood-oxygen-level-dependent (BOLD) signal were collected using a gradient echo pulse sequence with multiband and echo-planar imaging options (TR = 800 ms; TE = 25 ms; flip angle = 52°; FOV = 20.8 cm; matrix size = 86 × 86; in-plane resolution = 2.42 mm; slice thickness = 2.4 mm; slices = 10 [60 effective]; slice spacing = 0; multiband factor = 6) acquired parallel to the intercommissural (AC-PC) plane in an interleaved manner.
Image Processing and Regions of Interest (ROIs)
We used software from Analysis of Functional NeuroImages (AFNI; Ver. 18.1.14) (Cox, Reference Cox1996) and FMRIB Software Library (FSL; Ver. 5.0.10) (Jenkinson, Beckmann, Behrens, Woolrich, & Smith, Reference Jenkinson, Beckmann, Behrens, Woolrich and Smith2012) to preprocess the structural and functional images. A detailed description of preprocessing steps is included in the Supplemental Materials. Briefly, images were processed using a pipeline that included segmentation, distortion correct, despiking, alignment and co-registration of the functional images to the structural images, detection of outliers, and blurring. To account for physiological motion, respiration and cardiac activity were acquired in parallel with the functional images and converted to sines and cosines of the first- and second-phase cycles modeling of the physiological activity (Glover, Li, & Ress, Reference Glover, Li and Ress2000). Using AFNI’s 3dDeconvolve tool, a general linear model (GLM) was then applied to each participant’s co-registered functional images and movement time series data (ignoring censored values). The GLM analysis incorporated covariates accounting for linear, quadratic, cubic, and quartic drift, six motion parameters, eight physiological noise regressors, and the reference functions. The reference functions were vectors representing the behavioral paradigm convolved with a model of the hemodynamic response using a gamma function.
To prepare for group analyses, individual statistical maps reflecting parameter estimates for each simple contrast (1-, 2-, 3-, 4-, and 5-back vs. low-level fixation baseline) were rescaled to reflect percent signal change. We then removed non-brain tissue from the functional maps using AFNI’s 3dresample tool with masks based on the structural images. AFNI’s adwarp tool was then used to warp the masked individual statistical maps into Talairach space using the ICBM-452 brain template (Rex, Ma, & Toga, Reference Rex, Ma and Toga2003).
Regions of interest (ROIs) were based on results from a quantitative meta-analysis on the N-back task using the activation likelihood estimates (Owen, McMillan, Laird, & Bullmore, Reference Owen, McMillan, Laird and Bullmore2005). Specifically, using the meta-analysis coordinates, we selected ROIs in regions traditionally considered to be part of the DLPFC, ventrolateral prefrontal cortex (VLPFC), parietal association cortex (PAC), and dorsal midcingulate cortex (dMCC). The first two ROIs were in the DLPFC. The first was centered near the anterior portion of the left middle frontal gyrus (aMFG) primarily near Brodmann area 10 (RAI x = 38, y = −44, z = 20; radius = 6 mm) and the second was centered near the posterior portion of the left middle frontal gyrus (pMFG) primarily near Brodmann area 46 (RAI x = 44, y = −18, z = 22; radius = 14.2 mm). The third ROI, a marker of the VLPFC, was located near the opercular part of the inferior frontal gyrus (IFG) primarily near Brodmann area 44 (RAI x = 50, y = −12, z = 5; radius = 11.3 mm). The fourth ROI was centered near the dMCC primarily near Brodmann area 32 (RAI x = 2, y = −12, z = 42; radius = 9 mm). The fifth ROI, a marker of the PAC, was centered near the left inferior parietal lobule (IPL)—mostly the supramarginal gyrus—primarily near Brodmann area 40 (RAI x = 34, y = 48, z = 38; radius = 10.7 mm).
Analyses
Activation differences between groups in the ROIs were analyzed using linear mixed-effect models (Hox, Reference Hox2010) within the R “lme4” package (Bates, Mächler, Bolker, & Walker, Reference Bates, Mächler, Bolker and Walker2015). We regressed percent signal change in the BOLD response within each ROI onto the fixed effects of group, a single type of load performance regressor, and the interaction of group by load performance. Group was contrast-coded. Performance was operationalized as discriminability (d′) from an equal variance signal detection-item response model (Thomas et al., Reference Thomas, Brown, Gur, Moore, Patt, Risbrough and Baker2018). d′ is an index of performance accuracy, but one that has been adjusted for response bias. The load performance regressor was either: (1) N-back load: 1–5; (2) marginal performance: d′ averaged across load; or (3) conditional performance: d′ calculated for each load level. In other words, whereas the N-back load regressor only varied over conditions (e.g., 1-back), the marginal performance regressor varied over subjects (e.g., Subject 1’s d′, Subject 2’s d′, etc.), and the conditional performance regressor varied over subjects within conditions (e.g., Subject 1’s d′ at 1-back, Subject 2’s d′ at 1-back, Subject 1’s d′ at 2-back, Subject 2’s d′ at 2-back, etc.). In all cases, the load performance regressor was coded using orthogonal linear contrasts. Random intercepts and slopes for run and order effects were included in all models. To determine whether nonlinear models fitted the data better than linear models, we compared the overall fit of models with linear or both linear and quadratic terms using the Akaike information criterion (AIC; smaller is better) and chi-square difference tests (Δχ2). Significance values were adjusted for multiple comparisons using the false discovery rate correction (Benjamini & Hochberg, Reference Benjamini and Hochberg1995).
Separate from our ROI analyses, exploratory whole-brain, voxel-wise analyses were performed using the AFNI equitable thresholding and clustering (ETAC) technique to limit the potential false-positive rate to 5% (Cox, Reference Cox2019). ETAC was implemented as part of the 3dttest+ tool with clustering based on voxels touching at faces or edges, two-sided t-tests, p values of .05, .01, and .001 with no added blur. We conducted both one-sample and independent two-sample t-tests. Separate analyses were conducted with the dependent variable defined either as the average BOLD response or as the quadratic BOLD response predicted from conditional performance.
RESULTS
Table 1 reports demographic and clinical characteristics of both samples. SZs and HCs differed significantly on years of education and WRAT scores, but not on age, gender, ethnicity, race, or years of parents’ education. In terms of N-back task accuracy, SZs performed significantly worse overall (t(50) = −3.586, p = .001, Cohen’s d = −1.00), as well at all levels of N-back load, with Cohen’s ds of −.89, −.63, −.58, −.53, and −.48 for 1- through 5-back, respectively. Additionally, a nonparametric (Kolmogorov–Smirnov) test of equality between raw accuracy distributions for SZs and HCs indicated a significant difference (D = .47, p = .002). Estimates of memory discriminability (d′) and bias (C c ) over N-back load by group are plotted in Figure 2.
Note: Means and standard deviations are reported for continuous variables. Counts and percentages are reported for discrete variables. Groups were compared using regression for continuous variables and Fisher’s exact test for categorical variables. Education is in years completed. SANS = Scale for the Assessment of Negative Symptoms reported as total global rating scores; SAPS = Scale for the Assessment of Positive Symptoms reported as total global rating scores; WRAT = Wide Range Achievement Test.
Aim 1: Nonlinear Neuronal Activity
Comparisons of models that included linear versus nonlinear (both linear and quadratic) terms are reported in Table 2. Predictions based on the best-fitting model for each ROI are reported in Figure 3. Corresponding estimates of regression parameters and effect sizes with confidence intervals are reported in Supplemental Table 1.
Note: aMFG = anterior middle frontal gyrus; pMFG = posterior middle frontal gyrus; IFG = inferior frontal gyrus; dMCC = dorsal mid cingulate cortex; IPL = inferior parietal lobule; AIC = Akaike information criterion; p FDR = false discovery rate corrected p-value; ROI = region of interest. Statistically significant effects are highlighted in bold font.
For the prediction of neuronal activity from N-back load, nonlinear activity was only found in the left IPL. All other comparisons indicated linear patterns of activity. In the left IPL, the patterns of activity were mixed for N-back load, with HCs, in blue, showing a negative quadratic trend (i.e., inverted-U-shaped), and SZs, in red, showing a positive quadratic trend (i.e., U-shaped). For the prediction of neuronal activity from marginal performance, none of the nonlinear models fitted the data significantly better than models that only included linear terms. As reported in Figure 3, neuronal activity was positively associated with marginal performance for SZs and showed a mixed pattern of positive and negative trends for HCs. For the prediction of neuronal activity from conditional performance, all but one ROI favored the nonlinear model. The one exception was the left IFG. For all other ROIs, Figure 3 shows clear evidence of negative quadratic trends (i.e., inverted-U-shaped activity) in both groups.
Aim 2: Neuronal Inefficiency in Schizophrenia
Neuronal inefficiency is operationalized as one group—such as patients—needing to produce greater neuronal activity to achieve a similar level of performance as another group—such as controls. Thus, neuronal inefficiency should be evidenced by large group main or interaction effects in the conditional performance domain. In Figure 3, neuronal inefficiency should be evidenced by diverging patterns of neuronal activity (and also by meaningful group effect sizes for conditional performance in Supplemental Table 1).
Contrary to the neuronal inefficiency model, the patterns of activity are mostly overlapping, with the possible exception of the left aMFG. That is, as reported in Supplemental Table 1, most group and group interaction effect size parameters were small, and their confidence intervals included zero. However, there was a medium-sized group-by-quadratic interaction effect with a nonzero confidence interval in the left aMFG. Specifically, SZs showed a more negative quadratic slope. In Figure 3, it can be seen that SZs produced more peaked activity in the aMFG in comparison to HCs, who produced a more muted, flatter pattern of activity.
Aim 3: Correlates of Inefficient Neuronal Activity
To better understand the meaning of aMFG activity, we explored the demographic, cognitive, and clinical correlates of BOLD response within the ROI in comparison to activity observed within the pMFG. We operationalized “inefficient” activity as SZ BOLD signal that was in excess to the signal observed among HCs at individually matched levels of N-back load.
Specifically, within the conditional performance domain (bottom row of Figure 3), we noted that peak BOLD activity generally occurred between discriminability (d′) values of 2–3. This is equivalent to a performance accuracy range for target items of approximately 69%–84% (after accounting for response bias). Matched load was defined as the N-back condition that was likely to produce discriminability (d′) values of 2.5 (˜77% performance accuracy) for each subject.
Figure 4 presents correlations (with 95% confidence intervals) of “inefficient” BOLD activity in the pMFG and aMFG with demographic, clinical, and cognitive variables. Only the group correlation (first line) includes HCs; all other correlations are based on the SZ group alone. The general descriptive pattern suggests that while greater BOLD activity was broadly associated with better health (e.g., younger, fewer symptoms, and better cognitive functioning), the pattern of association is stronger for the pMFG relative to the aMFG.
Whole-Brain, Voxel-Wise Analyses
Results of whole-brain, voxel-wise analyses indicated significant group differences in BOLD activity overall, but no significant group-by-quadratic conditional performance effects. Panel A of Figure 5 plots results from whole-brain, voxel-wise analyses comparing average activity collapsed over groups across all WM load conditions to the low-level baseline. Colors indicate effect sizes presented as Cohen’s d. Redder colors indicate greater BOLD signal relative to baseline. Bluer colors indicate less BOLD signal relative to baseline. Only voxels surviving false-positive rate corrections (ETAC) are shown in color. The results indicate bilateral increases in BOLD signal within regions generally considered to be part of the task-positive network (e.g., frontal eye fields, DLPFC and VLPFC, dMCC, and IPL). Concurrently, there are bilateral decreases in BOLD signal within regions considered to be part of the task-negative or default mode network (e.g., posterior cingulate, medial frontal gyrus, and middle temporal gyrus). Panel B of Figure 5 plots results from whole-brain, voxel-wise analyses comparing group differences. Nearly, all of the significant voxels are within default mode brain regions, with SZs showing greater activity.
DISCUSSION
We largely confirmed our hypothesis that conditional performance analysis would uniquely reveal inverted-U-shaped neuronal activity in both patients and controls. Although analyses based on task difficulty alone revealed one ROI with inverted-U-shaped neuronal activity for HCs, results suggested the oppositive pattern for SZs. These results are consistent with the “left-shift” in that the neuronal response curves for SZs appears to peak at lower levels of difficulty. Thus, differences in neuronal activity between groups as a function of load were largely determined by (or reflected in) differences in cognitive performance.
Our second aim was to explore whether patients would show greater activity in comparison to controls (i.e., neural inefficiency) for similar levels of performance. In most ROIs, SZs demonstrated a very similar pattern of neuronal activity to HCs (bottom panel of Figure 3). Only the aMFG differed; specifically, SZs demonstrated a more peaked inverted-U-shaped load-response curve in the aMFG. Interpretively, it appears that HCs exhibited flat, modest activity in this region, and SZs exhibited highly peaked activity (i.e., both high and low activity depending on performance level).
Our third aim was to explore demographic, clinical, and cognitive correlates of inefficient neuronal activity. Perhaps surprisingly, “inefficient” BOLD activity in the aMFG appears to be a sign of relatively better health in SZs. Changes in PFC activity in both schizophrenia and aging research are sometimes interpreted as evidence of compensatory activation (Cabeza et al., Reference Cabeza, Albert, Belleville, Craik, Duarte, Grady and Rajah2018; Manoach et al., Reference Manoach, Press, Thangaraj, Searl, Goff, Halpern and Warach1999). However, our results suggest that greater pMFG activity relative to the aMFG is a stronger marker of cognitive health in schizophrenia. Thus, whether aberrant aMFG activity should be interpreted as compensation, a deficit, or both, is unclear. Ciesklik el al. (Reference Cieslik, Zilles, Caspers, Roski, Kellermann, Jakobs and Eickhoff2013) argue that that there is an anterior–ventral DLPFC that is more strongly connected to the anterior cingulate, and a posterior–dorsal DLPFC that is more strongly correlated with bilateral intraparietal sulci. Whereas the anterior–ventral DLPFC is more strongly related to attention and inhibition, the posterior network is more strongly related to action execution and WM. Interestingly, in Figure 3, it can be seen that conditional performance predicted a similarly elevated pattern of activity in the aMFG and dMCC for SZs relative to HCs. More peaked activity in the aMFG among SZs could be indicative of greater need to exert control over attention and inhibition during task engagement. Nonetheless, within patients, the suggestion from these findings is that exertion of greater cognitive control is associated with better cognitive health overall. Thus, there is at least some support for the idea of compensation. It is also possible that more diffuse activation topography reflects dedifferentiation or failure to specialize (Koen & Rugg, Reference Koen and Rugg2019).
Whole-brain, voxel-wise analyses suggested that the greatest differences in neuronal activity between SZs and HCs were in activity within the task-negative or default mode network (Raichle et al., Reference Raichle, MacLeod, Snyder, Powers, Gusnard and Shulman2001). These findings support prior work suggesting that patients fail to suppress default mode network activity during task engagement, particularly among cognitively impaired individuals (Zhou et al., Reference Zhou, Pu, Wang, Liu, Wu, Liu and Liu2016). Notably, our results do not suggest that patients were simply disengaged with the task overall. We found no significant differences in activity within the task-positive network, including the specific ROIs targeted, and no differences in inverted-U-shaped (quadratic) activity.
Limitations
A counterargument to the proposed conditional performance analysis strategy might claim that different levels of task difficulty are qualitatively different. That is, although load manipulations are common in clinical studies, some view distinct load conditions as measures of different and unique subconstructs (Ragland et al., Reference Ragland, Turetsky, Gur, Gunning-Dixon, Turner, Schroeder and Gur2002). In previous work, we have shown that the psychometric dimensionality of the N-back task is well explained by the signal detection model (Thomas, Brown, Patt, & Duffy, Reference Thomas, Brown, Patt and Duffy2020). But even so, psychometric dimensionality and neuronal response dimensionality are not one and the same (Brown, Thomas, & Patt, Reference Brown, Thomas and Patt2017). Shifts in strategy and learning over time are additional concerns (Juvina & Taatgen, Reference Juvina and Taatgen2007); although these can sometimes be managed with proper counterbalancing of load conditions, changing in strategy and learning are likely to add error to brain and cognitive measures. Another concern is that our results demonstrated the use of conditional performance analysis using a single behavioral paradigm. Additional studies are needed to determine whether the utility of this approach applies to other tasks and constructs. Also, although all participants denied recent marijuana use, we did not record positive urine screens for marijuana, and thus it is possible that marijuana use could have impacted group differences in cognitive performance or neuronal activity. Finally, it should be noted that there is a growing concern about the replicability of findings from small sample studies in the fMRI literature (Turner, Paul, Miller, & Barbey, Reference Turner, Paul, Miller and Barbey2018). To counter this concern in the current study, we chose a hypothesis-based ROI approach, performed multiple runs of our task, and used a cognitive paradigm with high individual differences reliability; all strategies that should improve the replicability of our findings (Nee, Reference Nee2019; Poldrack, Mumford, & Nichols, Reference Poldrack, Mumford and Nichols2011). Given these choices, we ultimately had enough power to detect differences in model fit that were equivalent to an increase in variance explained of just .02 (i.e., small). Additionally, as shown in Supplemental Figure 2, the false discovery rate-corrected p values in the current study followed a bimodal distribution, where the majority of nonsignificant values were far above the nominal cutoff of .05.
Summary
Results confirm the inverted-U-shaped theory of brain activation as a function of task difficulty. They further suggest that interpretations of neuronal activity differences between clinical and healthy comparison groups can be flawed when not considered within the contexts of individual and group differences in ability and task difficulty. While future research is needed to determine whether conditional performance analysis is applicable across cognitive domains, the results of this study—at a minimum—suggest that researchers should consider the use of conditional performance analysis to completement more traditional analysis strategies used in task-based fMRI research. The approach is particularly relevant to populations that differ in cognitive functioning. As demonstrated here, conditional performance analysis can be implemented for both ROI and whole-brain, voxel-wise analyses. Moreover, the results underscore the importance of cautious interpretation of brain activation differences between groups that differ on ability and call into question previous theories that use performance data to interpret the meaning of more or less neuronal activity in individuals with cognitive disorders.
SUPPLEMENTARY MATERIAL
To view supplementary material for this article, please visit https://doi.org/10.1017/S1355617721000424
FINANCIAL SUPPORT
Research reported in this publication was supported, in part, by the National Institute of Mental Health of the National Institutes of Health under award number K23 MH102420. The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health.
CONFLICT OF INTEREST
The authors report no conflicts of interest related to this research.