Defining the p-factor: an empirical test of five leading theories

Matthew W. Southward; Jennifer S. Cheavens; Emil F. Coccaro

doi:10.1017/S0033291722001635

Defining the p-factor: an empirical test of five leading theories

Published online by Cambridge University Press: 17 June 2022

and

Matthew W. Southward*: Affiliation:
Department of Psychology, University of Kentucky, Lexington, KY, USA
Jennifer S. Cheavens: Affiliation:
Department of Psychology, The Ohio State University, Columbus, OH, USA
Emil F. Coccaro: Affiliation:
Department of Psychiatry and Behavioral Health, The Ohio State University Wexner Medical Center, Columbus, OH, USA
*: Author for correspondence: Matthew W. Southward, E-mail: [email protected]

Article contents

Rights & Permissions

Abstract

Background

Despite statistical evidence of a general factor of psychopathology (i.e., p-factor), there is little agreement about what the p-factor represents. Researchers have proposed five theories: dispositional negative emotionality (neuroticism), impulsive responsivity to emotions (impulsivity), thought dysfunction, low cognitive functioning, and impairment. These theories have primarily been inferred from patterns of loadings of diagnoses on p-factors with different sets of diagnoses included in different studies. Researchers who have directly examined these theories of p have examined a subset of the theories in any single sample, limiting the ability to compare the size of their associations with a p-factor.

Methods

In a sample of adults (N = 1833, Mage = 34.20, 54.4% female, 53.3% white) who completed diagnostic assessments, self-report measures, and cognitive tests, we evaluated statistical p-factor structures across modeling approaches and compared the strength of associations among the p-factor and indicators of each of these five theories.

Results

We found consistent evidence of the p-factor's unidimensionality across one-factor and bifactor models. The p-factor was most strongly and similarly associated with neuroticism (r = .88), impairment (r = .88), and impulsivity (r = .87), χ2(1)s < .15, ps > .70, and less strongly associated with thought dysfunction (r = .78), χ2(1)s > 3.92, ps < .05, and cognitive functioning (r = −.25), χ2(1)s > 189.56, ps < .01.

Conclusions

We discuss a tripartite definition of p that involves the transaction of impulsive responses to frequent negative emotions leading to impairment that extends and synthesizes previous theories of psychopathology.

Keywords

General factor of psychopathology impairment impulsivity neuroticism p-factor

Type: Original Article
Information: Psychological Medicine , Volume 53 , Issue 7 , May 2023 , pp. 2732 - 2743

DOI: https://doi.org/10.1017/S0033291722001635 [Opens in a new window]
Creative Commons: This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright: Copyright © The Author(s), 2022. Published by Cambridge University Press

One of the most striking and replicable findings in psychiatric epidemiology is the high rate of comorbidity among psychiatric disorders, with up to two-thirds of people who meet criteria for one disorder meeting criteria for a second (Caspi & Moffitt, Reference Caspi and Moffitt2018). This pattern of comorbidity suggests the presence of broader dimensions of psychopathology, such as internalizing (e.g., depressive, anxiety disorders), externalizing (e.g., substance use, antisocial disorders), and thought disorder (e.g., schizophrenia, paranoid personality disorder (PD), bipolar disorder; Kotov et al., Reference Kotov, Jonas, Carpenter, Dretsch, Eaton and Forbes2020; Krueger & Markon, Reference Krueger and Markon2006). However, these broader dimensions are themselves relatively highly correlated (rs: .33–.85; Caspi et al., Reference Caspi, Houts, Belsky, Goldman-Mellor, Harrington, Israel and Moffitt2014; Krueger & Markon, Reference Krueger and Markon2006; Lahey et al., Reference Lahey, Applegate, Hakes, Zald, Hariri and Rathouz2012; cf. Wright & Simms, Reference Wright and Simms2015). Based on these correlations, researchers proposed that a single overarching dimension, or general factor, of psychopathology may give rise to psychiatric conditions (Caspi & Moffitt, Reference Caspi and Moffitt2018) or provide a more complete model of the general features of psychopathology (Kotov et al., Reference Kotov, Krueger, Watson, Achenbach, Althoff, Bagby and Zimmerman2017).

Statistical evidence for a general factor of psychopathology, or p-factor, encompassing internalizing and externalizing disorders was first provided by Lahey et al. (Reference Lahey, Applegate, Hakes, Zald, Hariri and Rathouz2012) and extended by Caspi et al. (Reference Caspi, Houts, Belsky, Goldman-Mellor, Harrington, Israel and Moffitt2014) to also include psychosis. These studies generated substantial interest in verifying statistical p-factors across samples, timeframes, and measures (Smith, Atkinson, Davis, Riley, & Oltmanns, Reference Smith, Atkinson, Davis, Riley and Oltmanns2020), culminating in a recent meta-analytic factor analysis in which all specific diagnoses demonstrated loadings between .30 and .70 on a p-factor (Ringwald, Forbes, & Wright, Reference Ringwald, Forbes and Wright2021).

Despite this relatively consistent evidence demonstrating the existence of statistical p-factors, there remains little agreement about what exactly these p-factors represent (Fried, Greene, & Eaton, Reference Fried, Greene and Eaton2021). Some researchers have argued that the p-factor is a statistical, rather than a substantive, construct resulting from positively correlated components (i.e., a positive manifold; van Bork, Epskamp, Rhemtulla, Borsboom, & van der Maas, Reference van Bork, Epskamp, Rhemtulla, Borsboom and van der Maas2017) without a strong theoretical account of how it is related to psychopathology (Fried, Reference Fried2020; Murray, Eisner, & Ribeaud, Reference Murray, Eisner and Ribeaud2016). Moving to a substantive understanding of p requires tests of discriminant validity between falsifiable theories of what the p-factor is and is not related to (Fried, Reference Fried2020). However, few plausible candidates can adequately characterize such a broad construct. Smith et al. (Reference Smith, Atkinson, Davis, Riley and Oltmanns2020) identified four substantive theories of p (i.e., dispositional negative emotionality, impulsive responsivity to emotions, thought dysfunction, low cognitive functioning) and proposed a further nonspecific theory (i.e., functional impairment), which we review below.

Dispositional negative emotionality

Dispositional negative emotionality, or neuroticism, is the tendency to experience frequent and intense negative emotions in response to stressors (Barlow, Sauer-Zavala, Carl, Bullis, & Ellard, Reference Barlow, Sauer-Zavala, Carl, Bullis and Ellard2014). In factor analyses of personality dimensions, neuroticism is often the first factor extracted, explaining the most unique variability among items (Tackett et al., Reference Tackett, Lahey, van Hulle, Waldman, Krueger and Rathouz2013). Neuroticism has demonstrated consistent, medium-to-large-sized associations with mood, anxiety, substance use, eating, psychotic, somatoform, and PDs (Malouff, Thorsteinsson, & Schutte, Reference Malouff, Thorsteinsson and Schutte2005; Saulsman & Page, Reference Saulsman and Page2004) and p-factors (rs: .40–.99; Brandes, Herzhoff, Smack, & Tackett, Reference Brandes, Herzhoff, Smack and Tackett2019; Caspi et al. Reference Caspi, Houts, Belsky, Goldman-Mellor, Harrington, Israel and Moffitt2014; Levin-Aspenson, Khoo, & Kotelnikova, Reference Levin-Aspenson, Khoo and Kotelnikova2019). Furthermore, among child and adolescent twins, neuroticism was more strongly related to a p-factor than dimensions of prosociality (i.e., empathy and remorse) or daringness (i.e., sensation-seeking and risk-taking) and the genetic component of neuroticism was more strongly related to a p-factor than to either internalizing or externalizing dimensions (rs: .20–.71; Tackett et al., Reference Tackett, Lahey, van Hulle, Waldman, Krueger and Rathouz2013). Frequent experiences of negative emotions characterize nearly all psychiatric disorders; even in the case of ego-syntonic disorders (e.g., bipolar disorder, anorexia nervosa), frequent experiences of negative emotions may result from interpersonal or functional consequences of behaviors characteristic of the disorder. Thus, it is plausible that the general factor of psychopathology indexes the frequency and intensity of negative emotions.

Impulsive responsivity to emotions

Alternatively, impulsive, maladaptive responses to negative emotions may define p (Carver, Johnson, & Timpano, Reference Carver, Johnson and Timpano2017). Impulsive responses may include impulsive inaction (e.g., passive avoidance or rumination) or action (e.g., aggressive behaviors), occur without much planning, and be maladaptively overreactive in the context used. p-factors have been associated with indicators of impulsivity, such as low conscientiousness (r = −.31; Caspi et al., Reference Caspi, Houts, Belsky, Goldman-Mellor, Harrington, Israel and Moffitt2014) and poor response inhibition (rs: −.34 to −.14; Castellanos-Ryan et al., Reference Castellanos-Ryan, Brière, O'Leary-Barrett, Banaschewski, Bokde and Bromberg2016; Martel et al., Reference Martel, Pan, Hoffmann, Gadelha, do Rosário, Mari and Salum2017). Impulsivity has predicted a range of behaviors characteristic of psychopathology including non-suicidal self-injury (Riley, Combs, Jordan, & Smith, Reference Riley, Combs, Jordan and Smith2015), posttraumatic stress disorder symptoms (Gaher et al., Reference Gaher, Simons, Hahn, Hofman, Hansen and Buchkoski2014), and substance use (Riley, Rukavina, & Smith, Reference Riley, Rukavina and Smith2016), above and beyond negative emotionality (Settles et al., Reference Settles, Fischer, Cyders, Combs, Gunn and Smith2012), suggesting that impulsive responsivity to emotions may be related to p regardless of the frequency or intensity of negative emotions.

Low cognitive functioning

Complementing these affective (i.e., negative emotionality) and behavioral (i.e., impulsivity) theories, some researchers have argued low cognitive functioning best characterizes p. p-factors have been negatively associated with IQ (rs: −.19 to −.10; Caspi et al., Reference Caspi, Houts, Belsky, Goldman-Mellor, Harrington, Israel and Moffitt2014; Castellanos-Ryan et al., Reference Castellanos-Ryan, Brière, O'Leary-Barrett, Banaschewski, Bokde and Bromberg2016), executive functioning (rs: −.24 to −.07; e.g., attention, processing speed, visual-motor coordination; Castellanos-Ryan et al., Reference Castellanos-Ryan, Brière, O'Leary-Barrett, Banaschewski, Bokde and Bromberg2016; Martel et al., Reference Martel, Pan, Hoffmann, Gadelha, do Rosário, Mari and Salum2017), and positively associated with cognitive problems in everyday life (rs: .20–.30; e.g., concentration problems, forgetfulness, difficulties organizing tasks; Caspi & Moffitt, Reference Caspi and Moffitt2018). Low cognitive functioning may predispose people to develop psychopathology because (a) low cognitive functioning indicates neuroanatomical abnormalities that increase a person's risk for developing psychopathology; (b) low cognitive functioning increases the risk and exposure to stressors that increase the likelihood of developing psychopathology; or (c) low cognitive functioning impairs treatment-seeking and -engagement, resulting in an increased burden of psychopathology (Caspi & Moffitt, Reference Caspi and Moffitt2018). However, the magnitude of the relations between Caspi et al.'s (Reference Caspi, Houts, Belsky, Goldman-Mellor, Harrington, Israel and Moffitt2014) p-factor and IQ scores was about half as large as those between the p-factor and negative emotionality.

Thought dysfunction

By contrast, a p-factor has been shown to be almost identical to a ‘thought disorder’ factor composed of schizophrenia, mania, and obsessive-compulsive disorder (OCD; r = 0.997; Caspi et al., Reference Caspi, Houts, Belsky, Goldman-Mellor, Harrington, Israel and Moffitt2014). Thought dysfunction includes ‘illogical, unfiltered, tangential, and reality-distorted and -distorting cognitions’ encompassing delusional beliefs, suicidal thoughts, obsessions, and difficulties making decisions (Caspi & Moffitt, Reference Caspi and Moffitt2018). Thought dysfunction frequently, but not always, demonstrates the highest loading on p-factors (λs: .26–.97; Caspi et al. Reference Caspi, Houts, Belsky, Goldman-Mellor, Harrington, Israel and Moffitt2014; Laceulle, Vollebergh, & Ormel, Reference Laceulle, Vollebergh and Ormel2015; Levin-Aspenson, Watson, Clark, & Zimmerman, Reference Levin-Aspenson, Watson, Clark and Zimmerman2020; Oltmanns, Smith, Oltmanns, & Widiger, Reference Oltmanns, Smith, Oltmanns and Widiger2018; cf. Forbes et al. Reference Forbes, Kotov, Ruggero, Watson, Zimmerman and Krueger2017; Reference Forbes, Sunderland, Rapee, Batterham, Calear, Carragher and Krueger2021b; Martel et al. Reference Martel, Pan, Hoffmann, Gadelha, do Rosário, Mari and Salum2017; Ringwald et al. Reference Ringwald, Forbes and Wright2021; Stochl et al. Reference Stochl, Khandaker, Lewis, Perez, Goodyer, Zammit and Jones2015), and p-factors have been uniquely associated with prospective suicide attempts (Hoertel et al., Reference Hoertel, Franco, Wall, Oquendo, Kerridge, Limosin and Blanco2015) and manic episodes (Lahey, Krueger, Rathouz, Waldman, & Zald, Reference Lahey, Krueger, Rathouz, Waldman and Zald2017). Although low cognitive control refers to a lack of resources to make efficient or adaptive decisions, thought dysfunction refers to the degree to which thought patterns correspond with reality. In this theory, p indicates how well a person's thought processes align with their environmental context, with more impairing thought processes (e.g., delusions, suicidal thoughts) indicating the highest elevations in p (Lahey et al., Reference Lahey, Krueger, Rathouz, Waldman and Zald2017).

Impairment

In response to concerns about whether any substantive definition could appropriately capture the range of specific dysfunctions constitutive of psychopathology (e.g., hallucinations, a lack of pleasure, talking excessively), Smith et al. (Reference Smith, Atkinson, Davis, Riley and Oltmanns2020) conceptualize the p-factor as an index of impairment (Oltmanns et al., Reference Oltmanns, Smith, Oltmanns and Widiger2018; Widiger & Oltmanns, Reference Widiger and Oltmanns2017). Conceptualizing the p-factor as impairment resolves how seemingly contrasting responses (e.g., sluggishness vs. mania) may positively load onto the same factor because high levels of each symptom can lead to functional impairment (Widiger & Oltmanns, Reference Widiger and Oltmanns2017). Lahey et al.'s (Reference Lahey, Applegate, Hakes, Zald, Hariri and Rathouz2012) p-factor, for instance, was more strongly predictive of functionally impairing outcomes (e.g., suicide attempts, psychiatric hospitalization, convictions for violence) than internalizing or externalizing dimensions.

Limitations

To date, no single study has simultaneously modeled and compared the strength of the relations between a p-factor and indicators of these five theories. Thus, researchers are left to compare the strength of associations between different studies with different samples, which may lead to biased conclusions. Furthermore, studies that do include measures of multiple theories have generally included observed correlations with a single indicator of each theory (e.g., Caspi et al., Reference Caspi, Houts, Belsky, Goldman-Mellor, Harrington, Israel and Moffitt2014), which may lead to attenuated and less reliable estimates compared to associations among multi-indicator latent variables (Bollen, Reference Bollen1989). Directly comparing these five theories in a single sample would offer the strongest comparative test of the strength of their association with a p-factor, providing an initial evaluation of discriminant validity to strengthen the empirical foundations of the theory of the general factor of psychopathology (Fried, Reference Fried2020).

Current study

In a secondary data analysis of participants recruited to represent a range of psychopathology with a focus on intermittent explosive disorder, we explored two primary hypotheses. First, we examined three potential factor structures of a p-factor using diagnoses, symptom counts, and self-report measures of psychopathology to examine (a) which indicators demonstrated the highest loadings on the p-factor and (b) whether the pattern of loadings differed by the factor structure tested. Second, we added indicators of the five theories of p to each of these models to compare the strength of the associations between each indicator and the p-factor.

Materials and methods

Participants

The sample included 1833 community participants (M _age = 34.20 years, s.d. = 10.73) from the American Midwest. Roughly half of participants identified as female (54.4%; n = 997), with a similar number identifying as white (53.3%; n = 977). The median reported annual income was $35 000–70 000. A plurality of participants (42.1%; n = 772) had earned at least a college degree. Participants were recruited to be in either a clinical or non-clinical group. Inclusion criteria for the clinical group were: being 18 years old or older and meeting criteria for a DSM-5 [American Psychiatric Association (APA), 2013] current or lifetime syndromal (formerly Axis I) or personality disorder. Exclusion criteria involved the presence of a medical illness requiring chronic treatment (e.g., hypertension, heart disease, diabetes, cancer), active mania or substance use disorder, a lifetime diagnosis of a psychotic disorder, or intellectual disability. Inclusion criteria for the non-clinical group involved being 18 years old or older, the absence of a current or lifetime DSM-5 disorder, and the absence of a medical illness requiring chronic treatment. All participants provided informed consent before engaging in study procedures, and the study was approved by the local university Institutional Review Board.

Measures

Indicators of the p-factor

Diagnostic assessments

Participants completed the Structured Clinical Interview for DSM-IV Axis I Disorders (SCID-I; First, Spitzer, Miriam, & Williams, Reference First, Spitzer, Miriam and Williams2002) to assess mood, anxiety, OCD, stress and trauma-related, eating, bipolar, substance use (including alcohol, cannabis, stimulants, and opioids), intermittent explosive, impulse-control, attention-deficit/hyperactivity (ADHD), conduct, and oppositional defiant disorders. Diagnoses were recorded as present or absent.

Participants also completed the Structured Interview for the Diagnosis of Personality Disorders (SIDP-IV; Pfohl, Blum, & Zimmerman, Reference Pfohl, Blum and Zimmerman1997) to assess PDs. Continuous symptom severity scores (items rated 0–3) were used in all models.

Interviews were conducted by master's or doctoral level clinical psychology students who exhibited good-to-excellent inter-rater reliability (average κ = .84; range: .79–.93) across diagnoses of mood, anxiety, substance use, impulse control, and PDs. Because data were originally collected using DSM-IV-TR (APA, 2000) criteria, assessors consulted information gathered during separate clinical interviews with a study psychiatrist and conducted chart reviews to update diagnoses to DSM-5 criteria. Final diagnoses were determined using best-estimate consensus procedures involving research psychiatrists and clinical psychologists (Kosten & Rounsaville, Reference Kosten and Rounsaville1992).

Self-reported psychopathology.

Depression: Participants reported on the intensity of depressive symptoms in the prior 2 weeks using the Beck Depression Inventory-II (21 items rated 0–3; Beck, Steer, & Brown, Reference Beck, Steer and Brown1996).

Anxiety: Participants reported on the intensity of anxiety symptoms in the prior month using the Beck Anxiety Inventory (21 items rated 0–3; Beck & Steer, Reference Beck and Steer1990).

Anger: Participants reported on the intensity and frequency of anger experiences in general using the State-Trait Anger Expression Inventory (10 items rated 0–4; Spielberger, Reference Spielberger1999).

Attention-deficit/hyperactivity: Participants reported on the current frequency of difficulties with attention and/or hyperactivity using a modified version of the Wender Utah Rating Scale (25 items rated 0–4; Ward, Wender, & Reimherr, Reference Ward, Wender and Reimherr1993).

Mania: Participants reported on the intensity of lifetime hypomanic, mixed mania, and depressive symptoms using the General Behavior Inventory-Biphasic subscale (28 items rated 0–3; Depue & Klein, Reference Depue, Klein, Dunner, Gershon and Barrett1988).

Psychosis: Participants reported the extent to which they had experienced ideas of reference and ideas of persecution in the past month using the Green et al. Paranoid Thoughts Scale (32 items rated 1–5; Green et al., Reference Green, Freeman, Kuipers, Bebbington, Fowler, Dunn and Garety2008).

Trauma: Participants rated how frequently they had experienced physical, sexual, and emotional abuse and neglect as children and adolescents using the Childhood Trauma Questionnaire-Short Form (28 items rated 1–5; Bernstein et al., Reference Bernstein, Stein, Newcomb, Walker, Pogge, Ahluvalia and Zule2003).

Indicators of the five theories of p

Dispositional negative emotionality: Participants reported their levels of neuroticism using two subscales from the NEO-Five Factor Inventory (Costa & McCrae, Reference Costa and McCrae1992) distinguished by Saucier (Reference Saucier1998): Self-Reproach (seven items rated 1–5) and Negative Affect (five items rated 1–5). Participants also completed the Eysenck Personality Questionnaire-Revised-Neuroticism scale (12 items rated 0–2; Eysenck & Eysenck, Reference Eysenck and Eysenck1991).

Impulsive responsivity to emotions: Participants characterized their impulsive responsivity to emotions using the five subscales of the UPPS-P Impulsive Behavior Scale (59 items rated 1–4; Lynam, Smith, Whiteside, & Cyders, Reference Lynam, Smith, Whiteside and Cyders2006): sensation-seeking, lack of premeditation, lack of perseverance, negative urgency, and positive urgency. Participants also responded to the Barratt Impulsiveness Scale-11 (30 items rated 1–4; Patton, Stanford, & Barratt, Reference Patton, Stanford and Barratt1995) and the Eysenck Personality Questionnaire-Impulsiveness scale (19 items rated 1–3; Eysenck, Pearson, Easting, & Allsopp, Reference Eysenck, Pearson, Easting and Allsopp1985).

Cognitive functioning: Assessors administered the Wechsler Abbreviated Scale of Intelligence-II (WASI-II; Wechsler, Reference Wechsler2011), a brief screen of intelligence consisting of the Vocabulary, Similarities, Block Design, and Matrix Reasoning tests from the Wechsler Adult Intelligence Scale. Responses result in Verbal and Performance IQ scores.

Thought dysfunction: Participants reported the degree of general thought dysfunction using the Eysenck Personality Questionnaire-Psychoticism scale (12 items rated 0–2; Eysenck & Eysenck, Reference Eysenck and Eysenck1991).

Impairment: Assessors who administered the diagnostic assessments documented global assessment of functioning (GAF) scores for each participant (one item rated 0–100; APA, 2013). Lower scores indicate greater impairments in functioning.

Data analytic method

Little's Missing Completely At Random (MCAR) test suggested the data were not missing completely at random, χ²(4433) = 7034.73, p < .01. Given the lack of systematic bias in the administration and completion of measures and the small correlations among observed variables and patterns of missingness (rs: .05–.20), the data may be considered missing at random (MAR). Thus, we created 100 multiply imputed datasets after 40 000 iterations using Bayesian estimation in Mplus Version 7.0 (Muthén & Muthén, Reference Muthén and Muthén1998–2012) which asymptotically produces the same estimates as maximum likelihood estimation under MAR. We examined descriptive statistics of the frequency of diagnoses and distributions of continuous variables.

Because models of the p-factor have included different combinations of disorders and measures, we first examined the fit of the p-factor using confirmatory factor analysis with weighted least square mean and variance adjusted estimation to account for the binary diagnostic data. We examined three solutions to test the stability and generalizability of these models based on previous specifications of the p-factor: a one-factor solution and two bifactor solutions.Footnote *Footnote ¹ In the first bifactor solution, we allowed all items to load onto a higher-order p-factor and one of three lower-order factors representing internalizing, externalizing, or thought disorders, which were restricted to be orthogonal to each other and the p-factor. In the second bifactor solution, the three lower-order factors were allowed to intercorrelate. Given the current literature on the hierarchical structure of psychopathology (HiTOP; Forbes, Reference Forbes2021; Kotov et al., Reference Kotov, Krueger, Watson, Achenbach, Althoff, Bagby and Zimmerman2017), we only allowed the somatoform disorder indicator to load onto the p-factor and allowed the borderline personality disorder (BPD) indicator to load onto both internalizing and externalizing factors.

Given concerns about the ability of fit indices to accurately distinguish factor analytic models (Bonifay & Cai, Reference Bonifay and Cai2017; Greene et al., Reference Greene, Eaton, Li, Forbes, Krueger, Markon and Kotov2019; Stanton et al., Reference Stanton, Watts, Levin-Aspenson, Carpenter, Emery and Zimmermann2021), we followed Forbes et al.'s (Reference Forbes, Greene, Levin-Aspenson, Watts, Hallquist, Lahey and Krueger2021a) recommendations to supplement standard model fit indices [root-mean-square error of approximation (RMSEA; acceptable fit ⩽.10; good fit ⩽ .06; Hu & Bentler, Reference Hu and Bentler1999), comparative fit index (CFI) and Tucker–Lewis index (TLI; acceptable fit ⩾.90; excellent fit ⩾.95; Hu & Bentler, Reference Hu and Bentler1999), weighted root-mean-square residual (WRMR; good fit <1.00; DiStefano, Lui, Jiang, & Shi, Reference DiStefano, Lui, Jiang and Shi2017)] with statistics to evaluate the unidimensionality of these models. Because residual variances are not identified with binary indicators, we estimated the reliability of the p-factor and lower-order factors with omega hierarchical (ω _h; McDonald, Reference McDonald1999; Zinbarg, Revelle, Yovel, & Li, Reference Zinbarg, Revelle, Yovel and Li2005) using the available continuous symptom count and self-report measures. We use ω _h* to denote omega hierarchical for the p-factor and ω _h,specific* to denote omega hierarchical for the lower-order factors to indicate these ω _h's do not include all variables in the model. ω _h* > .75 indicates sufficient reliability (Reise, Bonifay, & Haviland, Reference Reise, Bonifay and Haviland2013a). We calculated explained common variance (ECV; Reise, Scheines, Widaman, & Haviland, Reference Reise, Scheines, Widaman and Haviland2013b) to assess the proportion of variance across all indicators explained by the p-factor relative to the specific factors (ECV > .85 indicates likely unidimensionality; Stucky & Edelen, Reference Stucky, Edelen, Reise and Revicki2014). We also calculated ECV_S (Forbes et al. Reference Forbes, Greene, Levin-Aspenson, Watts, Hallquist, Lahey and Krueger2021a) for each specific factor to estimate the proportion of variance these factors explained. We calculated the percentage of uncontaminated correlations (PUC; Reise et al., Reference Reise, Scheines, Widaman and Haviland2013b), representing the proportion of correlations that only reflect variance from the p-factor (PUC > .70 indicates likely unidimensionality). Finally, we calculated the average parameter bias (APB), representing the difference between item loadings in the one-factor model and the bifactor model (10–15% is deemed acceptable; Muthén, Kaplan, & Hollis, Reference Muthén, Kaplan and Hollis1987) to assess the similarity of loadings between models.

We then added factors representing the five theories of the p-factor to these models. When multiple observed indicators were available (i.e., neuroticism, impulsivity, cognitive functioning), we allowed them to load onto a latent factor to represent the construct. When only single indicators were available (i.e., thought dysfunction, impairment), we created single-indicator latent variables by fixing the residual variance of the indicator to 0. We modeled the covariances between the p-factor and factors representing the five theories to test the convergent and discriminant validity of the p-factor, while simultaneously modeling the covariances among indicators of the five theories to account for their intercorrelations. We tested for differences in the strength of the absolute value of the standardized associations between the p-factor and indicators of each of the five theories using Wald tests. We examined fully standardized results in all models to enhance interpretability. All code is available at https://doi.org/10.17605/osf.io/hs8cp. Because participants did not consent to the open sharing of their data, we provide the raw correlation matrix (Table S1, Online Supplemental Materials), with raw data and measures available upon reasonable request.

Results

Descriptive statistics

The most frequently diagnosed conditions were intermittent explosive disorder (30.8%), alcohol use disorder (21.2%), any anxiety disorder (17.9%), and BPD (16.2%; Table 1). Mean scores on self-report measures of depression (Roelofs et al., Reference Roelofs, van Breukelen, de Graaf, Beck, Arntz and Huibers2013), anxiety (Gillis, Haaga, & Ford, Reference Gillis, Haaga and Ford1995), trauma (Bernstein et al., Reference Bernstein, Stein, Newcomb, Walker, Pogge, Ahluvalia and Zule2003), anger (Spielberger, Reference Spielberger1999), ADHD (Ward et al., Reference Ward, Wender and Reimherr1993), mania (Chmielewski, Fernandes, Yee, & Miller, Reference Chmielewski, Fernandes, Yee and Miller1995), and psychosis (Green et al., Reference Green, Freeman, Kuipers, Bebbington, Fowler, Dunn and Garety2008) were in line with community norms.

Table 1. Descriptive statistics for primary observed indicators

α, Cronbach's alpha; Min, observed minimum score; Max, observed maximum score; OCD, obsessive-compulsive disorder; PD, personality disorder; BDI-II, Beck Depression Inventory-II; BAI, Beck Anxiety Inventory; CTQ-SF, Childhood Trauma Questionnaire-SF; STAXI, State-Trait Anger Expression Inventory-2; WRS, Wender Utah Rating Scale; GBI, General Behavior Inventory; GPTS, Green et al. Paranoid Thoughts Scale; NEO-FFI, NEO Five Factor Inventory; EPQ, Eysenck Personality Questionnaire; BIS, Barratt Impulsiveness Scale; WASI, Wechsler Abbreviated Intelligence Scale; GAF, global assessment of functioning.

^a Borderline PD used as an indicator for internalizing and externalizing disorders.

Comparing models of the p-factor

A one-factor solution of the p-factor demonstrated relatively poor fit across imputed datasets, χ²(527) = 3813.40, RMSEA = .058, CFI = .857, TLI = .848, WRMR = 2.643, ω* = .92, although all but two indicators demonstrated loadings ⩾.35 (Table 2). The highest loading indicators were BPD symptoms, paranoid PD symptoms, and mania (Table 2).

Table 2. Fully standardized loadings of indicators on three models of the p-factor

Int, internalizing; Ext, externalizing; TD, thought disorder; PD, personality disorder; Depression, Beck Depression Inventory-II; Trauma, Childhood Trauma Questionnaire-Short Form; Anxiety, Beck Anxiety Inventory; Dx, disorder; OCD, obsessive-compulsive disorder; ADHD, Wender Utah Rating Scale; Anger, State-Trait Anger Expression Inventory-2; Mania, General Behavior Inventory–Biphasic subscale; Ideas of Persecution, Green et al. Paranoid Thoughts Scale-Form B; Ideas of Reference, Green et al. Paranoid Thoughts Scale-Form A.

Note. All loadings significant, ps < .05, except for those in italics. Indicators ordered largest to smallest by their loading on p in the bifactor, correlated lower-order factors model.

By contrast, a bifactor solution of the p-factor with orthogonal lower-order factors demonstrated acceptable-to-good fit, χ²(493) = 2314.02, RMSEA = .045, CFI = .921, TLI = .910, WRMR = 1.968, with all items loading positively and significantly on the p-factor (Table 2).Footnote ² The p-factor was highly reliable, ω _h* = .92, whereas the specific factors were substantially less so, ω _{h,Internalizing}* = .33, ω _{h,Externalizing}* = .23, ω _{h,Thought Disorder}* = .04. The p-factor also explained nearly all common variance among all items, ECV = .92, unlike the specific factors: ECV_S_{Internalizing} = .55, ECV_S_{Externalizing} = .24, ECV_S_{Thought Disorder} = .21. Just over 70% of correlations were uncontaminated, PUC = .71, and these loadings were not substantially different from the one-factor model, APB = 5.0%. Again, the highest loading indicators on the p-factor were BPD symptoms, mania, and paranoid PD symptoms (Table 2).

Finally, a bifactor solution of the p-factor with correlated lower-order factors also demonstrated acceptable-to-good fit, χ²(490) = 2238.15, RMSEA = .044, CFI = .924, TLI = .913, WRMR = 1.915, with all items loading positively and significantly on the p-factor (Table 2), and ω _h* = .92, ω _{h,Internalizing}* = .28, ω _{h,Externalizing}* = .11, ω _{h,Thought Disorder}* = .04; ECV = .95, ECV_S_{Internalizing} = .63, ECV_S_{Externalizing} = .29, ECV_S_{Thought Disorder} = .09; PUC = .71; and APB = 3.4%, together providing good evidence of unidimensionality.Footnote ³ Again, the highest loading indicators on the p-factor were BPD symptoms, mania, and paranoid PD symptoms (Table 2). Lower-order internalizing was negatively associated with externalizing, r = −.28, p < .01, and positively associated with thought disorder, r = .32, p < .01; however, externalizing was unrelated to thought disorder, r = .05, p = .68.

Testing five theories of p

When adding indicators of and factors representing the five theories of p to each of the three models of the p-factor above, no model demonstrated good fit across indices. We re-fit the models based on theory and modification indices, most notably removing UPPS-Sensation Seeking because it exhibited standardized loadings >1. The bifactor model with correlated lower-order factors was the best-fitting model with acceptable fit by RMSEA, χ²(973) = 6253.55, RMSEA = .054, CFI = .848, TLI = .832, WRMR = 2.390, and largely similar loadings on (Δλs: .01–.11) and associations with (Δrs: .01–.06) the p-factor (Tables S4a–S5).

Of the five theory indicators, the p-factor was most strongly and similarly associated with impairment (r = −.89, p < .01), impulsivity (r = .87, p < .01), and neuroticism (r = .86, p < .01), χ²(1)s < 1.57, ps > .20. However, the p-factor was more strongly associated with each of these three constructs than with thought dysfunction (r = .62, p < .01), χ²(1)s > 50.41, ps < .01, and most weakly associated with cognitive functioning (r = −.24, p < .01), χ²(1)s > 195.92, ps < .01.Footnote ⁴

Of note, the EPQ-Psychoticism scale reflects Eysenck's conceptualization of psychoticism as antisocial, creative, egocentric, impulsive, tough-minded, and unempathic (Eysenck, Reference Eysenck1987). These characteristics are relatively distinct from Caspi and Moffitt's (Reference Caspi and Moffitt2018) conceptualization of thought dysfunction as ‘illogical… and reality-distorted and -distorting cognitions’ with delusional beliefs at the most extreme end of this continuum. Thus, we re-ran the bifactor model of the p-factor with correlated lower-order factors, replacing EPQ-Psychoticism as the indicator of thought dysfunction with the two GPTS subscale scores (representing ideas of reference and ideas of persecution), and removing EPQ-Psychoticism based on modification indices. This model demonstrated numerically better fit, χ²(926) = 5276.95, RMSEA = .051, CFI = .871, TLI = .856, WRMR = 2.227.Footnote ⁵ In this model (Fig. 1), the p-factor again was most strongly and similarly associated with neuroticism, impairment, and impulsivity, χ²(1)s < .20, ps > .65. The p-factor was more strongly associated with each of these three constructs than with the revised thought dysfunction factor, χ²(1)s > 3.92, ps < .05, and the weakest association was again with cognitive functioning, χ²(1)s > 189.56, ps < .01.

Fig. 1. Confirmatory factor analysis comparing the strength of the associations of five theories of p with the p-factor.

Discussion

In this study, we compared the structure of the p-factor among three candidate models in a sample with a range of psychopathology and compared the strength of the associations between the p-factor and indicators of five leading theories of p to test the convergent and discriminant validity of these theories. Across indices, the p-factor was reliable and unidimensional, with similar patterns of factor loadings regardless of model specifications. The p-factor was nearly identical to factors representing neuroticism, impulsivity, and impairment, strongly associated with thought dysfunction, and relatively weakly associated with cognitive functioning.

Using Forbes et al.'s (Reference Forbes, Greene, Levin-Aspenson, Watts, Hallquist, Lahey and Krueger2021a) recommendations, we replicated their findings regarding the high reliability and unidimensionality of the p-factor in an independent sample with unique indicators. The consistency of these results across models and samples provides stronger evidence for the existence of a p-factor than relying solely on model fit indices (Stanton et al., Reference Stanton, Watts, Levin-Aspenson, Carpenter, Emery and Zimmermann2021). The highest loading items on the p-factor were BPD, paranoid PD, and mania in line with meta-analytic (Ringwald et al., Reference Ringwald, Forbes and Wright2021) and longitudinal (Caspi et al., Reference Caspi, Houts, Belsky, Goldman-Mellor, Harrington, Israel and Moffitt2014) research.

However, each diagnostic indicator contains multiple symptoms, so using them to infer the definition of the p-factor is less direct and may exhibit more sample-to-sample variability (Levin-Aspenson et al., Reference Levin-Aspenson, Watson, Clark and Zimmerman2020) than empirically testing the relations between the p-factor and specific theories. The p-factor was nearly identical to indicators of neuroticism, impulsivity, and impairment. The p-factor was also strongly associated with thought dysfunction, but less strongly related to cognitive functioning. These results suggest a model of p that extends Barlow et al.'s (Reference Barlow, Sauer-Zavala, Carl, Bullis and Ellard2014) model of emotional disorders and synthesizes it with Smith et al.'s (Reference Smith, Atkinson, Davis, Riley and Oltmanns2020) nonspecific impairment interpretation of the p-factor. In Barlow et al.'s (Reference Barlow, Sauer-Zavala, Carl, Bullis and Ellard2014) model, emotional disorders (e.g., mood disorders, anxiety and related disorders, BPD) are characterized by the transaction between frequent, intense experiences of negative emotions (i.e., neuroticism) and aversive, impulsive reactions to reduce the short-term intensity of those emotions (i.e., impulsivity). However, positive urgency, or the tendency to act rashly in response to positive emotions, demonstrated one of the highest loadings on the impulsivity factor. This suggests the possibility of extending Barlow et al.'s (Reference Barlow, Sauer-Zavala, Carl, Bullis and Ellard2014) theory to include impulsive responses to positive emotions that lead to maladaptive consequences, which could in turn prompt frequent negative emotions (i.e., neuroticism). Smith et al.'s (Reference Smith, Atkinson, Davis, Riley and Oltmanns2020) interpretation would add that the transaction between impulsivity and negative and/or positive emotions is necessary but not sufficient to define general psychopathology because it must lead to some level of impairment.

This tripartite definition of the p-factor can address Smith et al.'s (Reference Smith, Atkinson, Davis, Riley and Oltmanns2020) challenge that an appropriate definition of the p-factor should explain variance in all items loading on the p-factor while providing a more falsifiable theory of p (Watts, Lane, Bonifay, Steinely, & Meyer, Reference Watts, Lane, Bonifay, Steinely and Meyer2020). For instance, hallucinations may only indicate psychopathology if they are hostile or otherwise prompt negative emotions and impulsive attempts to stop them. Strong positive emotions in mania may prompt impulsive and impairing behaviors that may, in turn, lead to negative interpersonal consequences or other dysfunction and thus prompt negative emotions. Restrictive eating behaviors may be an avoidant response to strong negative emotions that can have impairing consequences for a person, despite promoting a temporary feeling of control.

Although we have focused on the relations between the p-factor and neuroticism, impulsivity, and impairment, the p-factor also demonstrated strong associations with thought dysfunction that varied by the measure of thought dysfunction used. These results, combined with the high loading of paranoid PD symptoms and mania on p, suggests the need for further study of the role of thought dysfunction in p. In particular, excluding participants with active psychosis and active mania may have restricted the range of thought disorder and thought dysfunction, attenuating the strength of their relations with the p-factor. Alternatively, our measures of thought dysfunction may not capture the breadth of Caspi and Moffitt's (Reference Caspi and Moffitt2018) definition. We encourage future researchers to include explicit measures of these thought processes in studies of the p-factor to more specifically test this theory.

Finally, the relatively small relation between the p-factor and cognitive functioning suggests this theory may be less tenable than the others. This finding is in line with Caspi et al. (Reference Caspi, Houts, Belsky, Goldman-Mellor, Harrington, Israel and Moffitt2014) in which the relation between childhood IQ and the p-factor was less than half as large as the relation between the p-factor and neuroticism. Cognitive functioning may exert a more distal, developmental effect on the p-factor, rather than reflecting psychopathology per se (Caspi et al., Reference Caspi, Houts, Belsky, Goldman-Mellor, Harrington, Israel and Moffitt2014; Caspi & Moffitt, Reference Caspi and Moffitt2018). Alternatively, method effects may reduce the strength of this association, given that cognitive functioning was measured by a behavioral task whereas indicators of the p-factor came from interview assessments and self-reports.

The results of this study should be considered in light of its limitations. Our sample was relatively small compared to other studies of the p-factor (e.g., Forbes et al. Reference Forbes, Greene, Levin-Aspenson, Watts, Hallquist, Lahey and Krueger2021a), and the exclusion of people with very low cognitive functioning may have reduced the generalizability of our results and attenuated the strength of the relations between the p-factor and cognitive functioning. The cross-sectional, between-person nature of the design restricts our ability to draw causal conclusions (Fried, Reference Fried2020). Although GAF scores characterize functional impairment and are easily implemented in clinical practice, they are only one indicator of impairment that can have low reliability in practice (Vatnaland, Vatnaland, Friis, & Opjordsmoen, Reference Vatnaland, Vatnaland, Friis and Opjordsmoen2007). Neither full model of the p-factor with indicators of its theories demonstrated good fit across indices, despite the two bifactor models of the p-factor alone demonstrating good fit. The high correlations among indicators may suggest this misfit is a result of parsing similar constructs into too many categories (Watts, Boness, Loeffelman, Steinley, & Sher, Reference Watts, Boness, Loeffelman, Steinley and Sher2021). Although model fit indices alone may not adequately distinguish models from each other (Greene et al., Reference Greene, Eaton, Li, Forbes, Krueger, Markon and Kotov2019), relatively low model fit suggests our results should be interpreted cautiously until replicated.

Conceptually, the constructs represented by the theories of p may be considered embedded in the diagnostic indicators of the p-factor, either through item content or diagnostic criteria, producing circular results. We do not dispute that these constructs are embedded in the diagnostic indicators. However, we note that diagnostic indicators include heterogeneous criteria that vary in how closely they align with the theories of p (e.g., most syndromal disorders include an impairment criterion but PDs do not). Rather than indirectly inferring the association between the p-factor and theories of p based on how heterogeneous diagnostic indicators load onto the p-factor, we believe that directly modeling these associations provides a stronger and more straightforward test of these theories, which can contribute to the formalization of theories of p by allowing for direct comparisons among these associations. Similarly, the high correlations among indicators may also result from item content overlap. When possible, we excluded scales with direct item overlap (e.g., the NEO-FFI scales include no impulsiveness items). Furthermore, previous researchers have found similarly sized associations between personality and psychopathology factors with and without overlapping items (Walton, Pantoja, & McDermot, Reference Walton, Pantoja and McDermot2017). Finally, the tripartite model of p we discuss risks defining p in terms of the primary characteristics of the lower-order factors. We echo calls from Fried (Reference Fried2020) and others to test these theories in longitudinal studies to examine whether these theories contribute to the development of the p-factor (e.g., Williams, Craske, Mineka, & Zinbarg, Reference Williams, Craske, Mineka and Zinbarg2021).

Despite these limitations, we found evidence of a unidimensional p-factor in a relatively diverse sample of adults with a range of measures of psychopathology. BPD symptoms, paranoid PD symptoms, and mania loaded most strongly on the p-factor regardless of the specific model used. The p-factor was most strongly related to neuroticism, impulsivity, and impairment, followed by thought dysfunction, and, to a much lesser degree, low cognitive functioning. We suggest a tripartite definition of the p-factor that incorporates transactions between neuroticism and impulsivity leading to impairment, and we encourage future research to test these theories longitudinally.

Supplementary material

The supplementary material for this article can be found at https://doi.org/10.1017/S0033291722001635.

Author contributions

M. W. S. and J. S. C. developed the study concept. All authors contributed to the study design. Data collection was performed by E. F. C. M. W. S. performed the data analysis and interpretation. M. W. S. drafted the paper, and J. S. C. and E. F. C. provided critical revisions. All authors approved the final version of the paper for submission.

Financial support

This work was supported by the National Institute of Health (E. F. C., grant numbers R01MH60836 and R01MH66984).

Conflict of interest

None.

Ethical standards

The authors assert that all procedures contributing to this work comply with the ethical standards of the relevant national and institutional committees on human experimentation and with the Helsinki Declaration of 1975, as revised in 2008.

Footnotes

* The notes appear after the main text.

1 We focused on these models because the p-factor does not share variance with lower-order factors, making the p-factor and associations with it more directly comparable across models (Bornovalova, Choate, Fatimah, Petersen, & Wiernik, Reference Bornovalova, Choate, Fatimah, Petersen and Wiernik2020; Moore et al., Reference Moore, Kaczkurkin, Durham, Jeong, McDowell, Dupont and Lahey2020). However, to contextualize our results among alternative conceptualizations of psychopathology, we also examined a correlated factors model and a hierarchical model with individual indicators loading onto specific second-order factors which themselves loaded onto a higher-order p-factor.

2 Because multiply imputed models cannot be directly compared in Mplus, we do not report model comparison statistics.

3 A correlated factors model demonstrated worse fit across indices, χ²(491) = 3392.27, RMSEA = .057, CFI = .872, TLI = .862, WRMR = 2.481, and very high correlations among all factors: r _{Internalizing-Externalizing} = .77, p < .01; r _{Internalizing-Thought Disorder} = .87, p < .01; r _{Externalizing-Thought Disorder} = .88, p < .01 (Table S2). A hierarchical model demonstrated similarly worse fit across indices, χ²(490) = 3376.29, RMSEA = .057, CFI = .873, TLI = .863, WRMR = 2.475, with very high loadings of each specific factor on the p-factor: λ _{Internalizing} = .87, p < .01; λ _{Externalizing} = .89, p < .01; λ _{Thought Disorder} = .97, p < .01 (Table S2).

4 Given concerns that some lower-order constructs were overrepresented in the p-factor, we re-ran this model including only the three highest-loading diagnoses and three highest-loading self-report measures from each lower-order domain. This model demonstrated relatively poor fit, χ²(331) = 5767.68, RMSEA = .095, CFI = .762, TLI = .708, WRMR = 2.413, and nearly identical associations between the p-factor and theories of p, Δrs: .00–.05 (Figure S1, Online Supplemental Materials).

5 See online Tables S6–S8 for fit statistics, correlations, and item loadings of models with this revised thought dysfunction factor.

References

American Psychiatric Association. (2000). Diagnostic and statistical manual of mental disorders (4th ed., text rev.). Washington, DC: American Psychiatric Association. Retrieved from https://doi.org/10.1176/appi.books.9780890423349.Google Scholar

American Psychiatric Association. (2013). Diagnostic and statistical manual of mental disorders (5th ed.). Washington, DC: American Psychiatric Association. Retrieved from https://doi.org/10.1176/appi.books.9780890425596.Google Scholar

Barlow, D. H., Sauer-Zavala, S., Carl, J. R., Bullis, J. R., & Ellard, K. K. (2014). The nature, diagnosis, and treatment of neuroticism: Back to the future. Clinical Psychological Science, 2(3), 344–365. https://doi.org/10.1177/2167702613505532.CrossRef Google Scholar

Beck, A. T., & Steer, R. A. (1990). Manual for the Beck Anxiety Inventory. San Antonio, TX: Psychological Corporation.Google Scholar

Beck, A. T., Steer, R. A., & Brown, G. K. (1996). Manual for the Beck Depression Inventory (2nd ed.). San Antonio, TX: Psychological Corporation.Google Scholar

Bernstein, D. P., Stein, J. A., Newcomb, M. D., Walker, E., Pogge, D., Ahluvalia, T., … Zule, W. (2003). Development and validation of a brief screening version of the Childhood Trauma Questionnaire. Child Abuse & Neglect, 27(2), 169–190. https://doi.org/10.1016/s0145-2134(02)00541-0.CrossRef Google Scholar PubMed

Bollen, K. A. (1989). Structural equations with latent variables. New York, NY: Wiley.CrossRef Google Scholar

Bonifay, W., & Cai, L. (2017). On the complexity of item response theory models. Multivariate Behavioral Research, 52(4), 465–484. https://doi.org/10.1080/00273171.2017.1309262.CrossRef Google Scholar PubMed

Bornovalova, M. A., Choate, A. M., Fatimah, H., Petersen, K. J., & Wiernik, B. M. (2020). Appropriate use of bifactor analysis in psychopathology research: Appreciating benefits and limitations. Biological Psychiatry, 88(1), 18–27. https://doi.org/10.1016/j.biopsych.2020.01.013.CrossRef Google Scholar PubMed

Brandes, C. M., Herzhoff, K., Smack, A. J., & Tackett, J. L. (2019). The p factor and the n factor: Associations between the general factors of psychopathology and neuroticism in children. Clinical Psychological Science, 7(6), 1266–1284. https://doi.org/10.1177/2167702619859332.CrossRef Google Scholar

Carver, C. S., Johnson, S. L., & Timpano, K. R. (2017). Toward a functional view of the p factor in psychopathology. Clinical Psychological Science, 5(5), 880–889. https://doi.org/10.1177/2167702617710037.CrossRef Google Scholar

Caspi, A., Houts, R. M., Belsky, D. W., Goldman-Mellor, S. J., Harrington, H., Israel, S., … Moffitt, T. E. (2014). The p factor: One general psychopathology factor in the structure of psychiatric disorders? Clinical Psychological Science, 2(2), 119–137. https://doi.org/10.1177/2167702613497473.CrossRef Google Scholar

Caspi, A., & Moffitt, T. E. (2018). All for one and one for all: Mental disorders in one dimension. American Journal of Psychiatry, 175(9), 831–844. https://doi.org/10.1176/appi.ajp.2018.17121383.CrossRef Google Scholar PubMed

Castellanos-Ryan, N., Brière, F. N., O'Leary-Barrett, M., Banaschewski, T., Bokde, A., & Bromberg, U., … IMAGEN Consortium. (2016). The structure of psychopathology in adolescence and its common personality and cognitive correlates. Journal of Abnormal Psychology, 125(8), 1039–1052. https://doi.org/10.1037/abn0000193.CrossRef Google Scholar PubMed

Chmielewski, P. M., Fernandes, L. O. L., Yee, C. M., & Miller, G. A. (1995). Ethnicity and gender in scales of psychosis proneness and mood disorders. Journal of Abnormal Psychology, 104(3), 464–470. https://doi.org/10.1037/0021-843X.104.3.464.CrossRef Google Scholar PubMed

Costa, P. T., & McCrae, R. R. (1992). Revised NEO personality inventory (NEO-PI-R) and NEO five factor inventory (NEO-FFI) professional manual. Odessa, FL: Psychological Assessment Resources.Google Scholar

Depue, R. A., & Klein, D. (1988). Identification of unipolarnd bipolar affective conditions by the general behavior inventory. In Dunner, D., Gershon, E. & Barrett, J. (Eds.), Relatives at risk for mental disorder (pp. 257–282). New York, NY: Raven Press.Google Scholar

DiStefano, C., Lui, J., Jiang, N., & Shi, D. (2017). Examination of the weighted root mean square residual: Evidence for trustworthiness? Structural Equation Modeling, 25(3), 453–466. https://doi.org/10.1080/10705511.2017.1390394.CrossRef Google Scholar

Eysenck, H. J. (1987). The definition of personality disorders and the criteria appropriate for their description. Journal of Personality Disorders, 1(3), 211–219. https://doi.org/10.1521/pedi.1987.1.3.211.CrossRef Google Scholar

Eysenck, H. J., & Eysenck, S. B. G. (1991). Manual of the Eysenck personality scale (adults). London, UK: Hodder & Stoughton.Google Scholar

Eysenck, S. B. G., Pearson, P. R., Easting, G., & Allsopp, J. F. (1985). Age norms for impulsiveness, venturesomeness and empathy in adults. Personality and Individual Differences, 6(5), 613–619. https://doi.org/10.1016/0191-8869(85)90011-X.CrossRef Google Scholar

First, M. B., Spitzer, R. L., Miriam, G., & Williams, J. B. W. (2002). Structured clinical interview for DSM-IV-TR axis I disorders, research version, patient edition (SCID-I/P). New York, NY: Biometrics Research, New York State Psychiatric Institute.Google Scholar

Forbes, M. K., Greene, A. L., Levin-Aspenson, H. F., Watts, A. L., Hallquist, M., Lahey, B. B., … Krueger, R. F. (2021a). Three recommendations based on a comparison of the reliability and validity of the predominant models used in research on the empirical structure of psychopathology. Journal of Abnormal Psychology, 130(3), 297–317. https://doi.org/10.1037/abn0000533.CrossRef Google Scholar PubMed

Forbes, M. K., Kotov, R., Ruggero, C. J., Watson, D., Zimmerman, M., & Krueger, R. F. (2017). Delineating the joint hierarchical structure of clinical and personality disorders in an outpatient psychiatric sample. Comprehensive Psychiatry, 79, 19–30. https://doi.org/10.1016/j.comppsych.2017.04.006.CrossRef Google Scholar

Forbes, M. K. [@MiriForbes]. (2021). Our closing session of @HiTOP_system 2021 was ‘HiTOP 2.0 (priorities for revisions)’. This is a basic (incomplete) summary of the [Tweet]. Twitter. Retrieved from https://twitter.com/MiriForbes/status/1377402715416891393?s=20.Google Scholar

Forbes, M. K., Sunderland, M., Rapee, R. M., Batterham, P. J., Calear, A. L., Carragher, N., … Krueger, R. F. (2021b). A detailed hierarchical model of psychopathology: From individual symptoms up to the general factor of psychopathology. Clinical Psychological Science, 9(2), 139–168. https://doi.org/10.1177/2167702620954799.CrossRef Google Scholar PubMed

Fried, E. I. (2020). Lack of theory building and testing impedes progress in the factor and network literature. Psychological Inquiry, 31(4), 271–288. https://doi.org/10.1080/1047840X.2020.1853461.CrossRef Google Scholar

Fried, E. I., Greene, A. L., & Eaton, N. R. (2021). The p factor is the sum of its parts, for now. World Psychiatry, 20(1), 69–70. https://doi.org/10.1002/wps.20814.CrossRef Google Scholar

Gaher, R. M., Simons, J. S., Hahn, A. M., Hofman, N. L., Hansen, J., & Buchkoski, J. (2014). An experience sampling study of PTSD and alcohol-related problems. Psychology of Addictive Behaviors, 28(4), 1013–1025. https://doi.org/10.1037/a0037257.CrossRef Google Scholar PubMed

Gillis, M. M., Haaga, D. A. F., & Ford, G. T. (1995). Normative values for the Beck Anxiety Inventory, Fear Questionnaire, Penn State Worry Questionnaire, and Social Phobia and Anxiety Inventory. Psychological Assessment, 7(4), 450–455. https://doi.org/10.1037/1040-3590.7.4.450.CrossRef Google Scholar

Green, C. E., Freeman, D., Kuipers, E., Bebbington, P., Fowler, D., Dunn, G., Garety, P. A. (2008). Measuring ideas of persecution and social reference: The Green et al. paranoid thought scales (GPTS). Psychological Medicine, 38(1), 101–111. https://doi.org/10.1017/S0033291707001638.CrossRef Google Scholar

Greene, A. L., Eaton, N. R., Li, K., Forbes, M. K., Krueger, R. F., Markon, K. E., … Kotov, R. (2019). Are fit indices used to test psychopathology structure biased? A simulation study. Journal of Abnormal Psychology, 128(7), 740–746. https://doi.org/10.1037/abn0000434.CrossRef Google Scholar PubMed

Hoertel, N., Franco, S., Wall, M. M., Oquendo, M. A., Kerridge, B. T., Limosin, F., & Blanco, C. (2015). Mental disorders and risk of suicide attempt: A national prospective study. Molecular Psychiatry, 20(6), 718–726. https://doi.org/10.1038/mp.2015.19.CrossRef Google Scholar PubMed

Hu, L., & Bentler, P. M. (1999). Cutoff criteria for fit indexes in covariance structure analysis: Conventional criteria versus new alternatives. Structural Equation Modeling, 6(1), 1–55. https://doi.org/10.1080/10705519909540118.CrossRef Google Scholar

Kosten, T. A., & Rounsaville, B. J. (1992). Sensitivity of psychiatric diagnoses based on the best estimate procedure. American Journal of Psychiatry, 149(9), 1225–1227. https://doi.org/10.1176/ajp.149.9.1225.Google Scholar PubMed

Kotov, R., Jonas, K. G., Carpenter, W. T., Dretsch, M. N., Eaton, N. R., & Forbes, M. K., … HiTOP Utility Workgroup. (2020). Validity and utility of hierarchical taxonomy of psychopathology (HiTOP): I. Psychosis superspectrum. World Psychiatry, 19(2), 151–172. https://doi.org/10.1002/wps.20730.CrossRef Google Scholar PubMed

Kotov, R., Krueger, R. F., Watson, D., Achenbach, T. M., Althoff, R. R., Bagby, R. M., … Zimmerman, M. (2017). The hierarchical taxonomy of psychopathology (HiTOP): A dimensional alternative to traditional nosologies. Journal of Abnormal Psychology, 126(4), 454–477. https://doi.org/10.1037/abn0000258.CrossRef Google Scholar PubMed

Krueger, R. F., & Markon, K. E. (2006). Reinterpreting comorbidity: A model-based approach to understanding and classifying psychopathology. Annual Review of Clinical Psychology, 2, 111–133. https://doi.org/10.1146/annurev.clinpsy.2.022305.095213.CrossRef Google Scholar PubMed

Laceulle, O. M., Vollebergh, W. A. M., & Ormel, J. (2015). The structure of psychopathology in adolescence: Replication of a general psychopathology factor in the trails study. Clinical Psychological Science, 3(6), 850–860. https://doi.org/10.1177/2167702614560750.CrossRef Google Scholar

Lahey, B. B., Applegate, B., Hakes, J. K., Zald, D. H., Hariri, A. R., & Rathouz, P. J. (2012). Is there a general factor of prevalent psychopathology during adulthood? Journal of Abnormal Psychology, 121(4), 971–977. https://doi.org/10.1037/a0028355.CrossRef Google Scholar

Lahey, B. B., Krueger, R. F., Rathouz, P. J., Waldman, I. D., & Zald, D. H. (2017). A hierarchical causal taxonomy of psychopathology across the life span. Psychological Bulletin, 143(2), 142–186. https://doi.org/10.1037/bul0000069.CrossRef Google Scholar PubMed

Levin-Aspenson, H. F., Khoo, S., & Kotelnikova, Y. (2019). Hierarchical taxonomy of psychopathology across development: Associations with personality. Journal of Research in Personality, 81, 72–78. https://doi.org/10.1016/j.jrp.2019.05.006.CrossRef Google Scholar

Levin-Aspenson, H. F., Watson, D., Clark, L. A., & Zimmerman, M. (2020). What is the general factor of psychopathology? Consistency of the p factor across samples. Assessment, 28(4), 1035–1049. https://doi.org/10.1177/1073191120954921.CrossRef Google Scholar

Lynam, D. R., Smith, G. T., Whiteside, S. P., & Cyders, M. A. (2006). The UPPS-P: Assessing five personality pathways to impulsive behavior (technical report). West Lafayette, IN: Purdue University.Google Scholar

Malouff, J. M., Thorsteinsson, E. B., & Schutte, N. S. (2005). The relationship between the five-factor model of personality and symptoms of clinical disorders: A meta-analysis. Journal of Psychopathology and Behavioral Assessment, 27(2), 101–114. https://doi.org/10.1007/s10862-005-5384-y.CrossRef Google Scholar

Martel, M. M., Pan, P. M., Hoffmann, M. S., Gadelha, A., do Rosário, M. C., Mari, J. J., … Salum, G. A. (2017). A general psychopathology factor (p factor) in children: Structural model analysis and external validation through familial risk and child global executive function. Journal of Abnormal Psychology, 126(1), 137–148. https://doi.org/10.1037/abn0000205.CrossRef Google Scholar PubMed

McDonald, R. P. (1999). Test theory: A unified treatment. New York, NY: Lawrence Erlbaum.Google Scholar

Moore, T. M., Kaczkurkin, A. N., Durham, E. L., Jeong, H. J., McDowell, M. G., Dupont, R. M., … Lahey, B. B. (2020). Criterion validity and relationships between alternative hierarchical dimensional models of general and specific psychopathology. Journal of Abnormal Psychology, 129(7), 677–688. https://doi.org/10.1037/abn0000601.CrossRef Google Scholar PubMed

Murray, A. L., Eisner, M., & Ribeaud, D. (2016). The development of the general factor of psychopathology ‘p factor’ through childhood and adolescence. Journal of Abnormal Child Psychology, 44(8), 1573–1586. https://doi.org/10.1007/s10802-016-0132-1.CrossRef Google Scholar PubMed

Muthén, B., Kaplan, D., & Hollis, M. (1987). On structural equation modeling with data that are not missing completely at random. Psychometrika, 52(3), 431–462. https://doi.org/10.1007/BF02294365.CrossRef Google Scholar

Muthén, L. K., & Muthén, B. O. (1998–2012). Mplus user's guide (7th ed.). Los Angeles, CA: Muthén & Muthén.Google Scholar

Oltmanns, J. R., Smith, G. T., Oltmanns, T. F., & Widiger, T. A. (2018). General factors of psychopathology, personality, and personality disorder: Across domain comparisons. Clinical Psychological Science, 6(4), 581–589. https://doi.org/10.1177/2167702617750150.CrossRef Google Scholar PubMed

Patton, J. H., Stanford, M. S., & Barratt, E. S. (1995). Factor structure of the Barratt impulsiveness scale. Journal of Clinical Psychology, 51(6), 768–774. https://doi.org/10.1002/1097-4679(199511)51:6<768::aid-jclp2270510607>3.0.co;2-1.3.0.CO;2-1>CrossRef Google Scholar PubMed

Pfohl, B., Blum, N., & Zimmerman, M. (1997). Structured interview for DSM-IV personality (SIDP-IV). Washington, DC: American Psychiatric Association.Google Scholar

Reise, S. P., Bonifay, W. E., & Haviland, M. G. (2013a). Scoring and modeling psychological measures in the presence of multidimensionality. Journal of Personality Assessment, 95(2), 129–140. https://doi.org/10.1080/00223891.2012.725437.CrossRef Google Scholar PubMed

Reise, S. P., Scheines, R., Widaman, K. F., & Haviland, M. G. (2013b). Multidimensionality and structural coefficient bias in structural equation modeling: A bifactor perspective. Educational and Psychological Measurement, 73(1), 5–26. https://doi.org/10.1177/0013164412449831.CrossRef Google Scholar

Riley, E. N., Combs, J. L., Jordan, C. E., & Smith, G. T. (2015). Negative urgency and lack of perseverance: Identification of differential pathways of onset and maintenance risk in the longitudinal prediction of nonsuicidal self-injury. Behavior Therapy, 46(4), 439–448. https://doi.org/10.1016/j.beth.2015.03.002.CrossRef Google Scholar PubMed

Riley, E. N., Rukavina, M., & Smith, G. T. (2016). The reciprocal predictive relationship between high-risk personality and drinking: An 8-wave longitudinal study in early adolescents. Journal of Abnormal Psychology, 125(6), 798–804. https://doi.org/10.1037/abn0000189.CrossRef Google Scholar PubMed

Ringwald, W. R., Forbes, M. K., & Wright, A. (2021). Meta-analysis of structural evidence for the hierarchical taxonomy of psychopathology (HiTOP) model. Psychological Medicine. Advance online publication. https://doi.org/10.1017/S0033291721001902.CrossRef Google Scholar PubMed

Roelofs, J., van Breukelen, G., de Graaf, L. E., Beck, A. T., Arntz, A., & Huibers, M. J. H. (2013). Norms for the Beck Depression Inventory (BDI-II) in a large Dutch community sample. Journal of Psychopathology and Behavioral Assessment, 35(1), 93–98. https://doi.org/10.1007/s10862-012-9309-2.CrossRef Google Scholar

Saucier, G. (1998). Replicable item-cluster subcomponents in the NEO five-factor inventory. Journal of Personality Assessment, 70(2), 263–276. https://doi.org/10.1207/s15327752jpa7002_6.CrossRef Google Scholar PubMed

Saulsman, L. M., & Page, A. C. (2004). The five-factor model and personality disorder empirical literature: A meta-analytic review. Clinical Psychology Review, 23(8), 1055–1085. https://doi.org/10.1016/j.cpr.2002.09.001.CrossRef Google Scholar PubMed

Settles, R. E., Fischer, S., Cyders, M. A., Combs, J. L., Gunn, R. L., & Smith, G. T. (2012). Negative urgency: A personality predictor of externalizing behavior characterized by neuroticism, low conscientiousness, and disagreeableness. Journal of Abnormal Psychology, 121(1), 160–172. https://doi.org/10.1037/a0024948.CrossRef Google Scholar PubMed

Smith, G. T., Atkinson, E. A., Davis, H. A., Riley, E. N., & Oltmanns, J. R. (2020). The general factor of psychopathology. Annual Review of Clinical Psychology, 16, 75–98. https://doi.org/10.1146/annurev-clinpsy-071119-115848.CrossRef Google Scholar PubMed

Spielberger, C. D. (1999). STAXI-2: State-trait anger expression inventory-2 professional manual. Odessa, FL: Psychological Assessment Resources.Google Scholar

Stanton, K., Watts, A. L., Levin-Aspenson, H. F., Carpenter, R. W., Emery, N. N., & Zimmermann, M. (2021). Is adequate model fit indicative of an adequate factor analytic model? Recognizing construct heterogeneity and model misspecification in factor analytic research. https://doi.org/10.31219/osf.io/6te2b.CrossRef Google Scholar

Stochl, J., Khandaker, G. M., Lewis, G., Perez, J., Goodyer, I. M., Zammit, S., … Jones, P. B. (2015). Mood, anxiety and psychotic phenomena measure a common psychopathological factor. Psychological Medicine, 45(7), 1483–1493. https://doi.org/10.1017/S003329171400261X.CrossRef Google Scholar

Stucky, B. D., & Edelen, M. O. (2014). Using hierarchical IRT models to create unidimensional measures from multidimensional data. In Reise, S. P. & Revicki, D. A. (Eds.), Handbook of item response theory modeling: Applications to typical performance assessment (pp. 183–206). New York, NY: Taylor & Francis.Google Scholar

Tackett, J. L., Lahey, B. B., van Hulle, C., Waldman, I., Krueger, R. F., & Rathouz, P. J. (2013). Common genetic influences on negative emotionality and a general psychopathology factor in childhood and adolescence. Journal of Abnormal Psychology, 122(4), 1142–1153. https://doi.org/10.1037/a0034151.CrossRef Google Scholar

van Bork, R., Epskamp, S., Rhemtulla, M., Borsboom, D., & van der Maas, H. L. J. (2017). What is the p-factor of psychopathology? Some risks of general factor modeling. Theory & Psychology, 27(6), 759–773. https://doi.org/10.1177/0959354317737185.CrossRef Google Scholar

Vatnaland, T., Vatnaland, J., Friis, S., & Opjordsmoen, S. (2007). Are GAF scores reliable in routine clinical use? Acta Psychiatrica Scandinavica, 115(4), 326–330. https://doi.org/10.1111/j.1600-0447.2006.00925.x.CrossRef Google Scholar PubMed

Walton, K. E., Pantoja, G., & McDermot, W. (2017). Associations between lower order facets of personality and dimensions of mental disorder. Journal of Psychopathology and Behavioral Assessment, 40(3), 465–475. https://doi.org/10.1007/s10862-017-9633-7.CrossRef Google Scholar

Ward, M. F., Wender, P. H., & Reimherr, F. W. (1993). The Wender Utah Rating Scale: An aid in the retrospective diagnosis of childhood attention deficit hyperactivity disorder. The American Journal of Psychiatry, 150(6), 885–890. https://doi.org/10.1176/ajp.150.6.885.Google Scholar PubMed

Watts, A. L., Boness, C. L., Loeffelman, J. E., Steinley, D., & Sher, K. J. (2021). Does crude measurement contribute to observed unidimensionality of psychological constructs? A demonstration with DSM-5 alcohol use disorder. Journal of Abnormal Psychology, 130(5), 512–524. https://doi.org/10.1037/abn0000678.CrossRef Google Scholar PubMed

Watts, A. L., Lane, S. P., Bonifay, W., Steinely, D., & Meyer, F. A. C. (2020). Building theories on top of, and not independent of, statistical models: The case of the p-factor. Psychological Inquiry, 31(4), 310–320. https://doi.org/10.1080/1047840x.2020.1853476.CrossRef Google Scholar

Wechsler, D. (2011). Wechsler abbreviated scale of intelligence (2nd ed.). New York, NY: NCS Pearson.Google Scholar

Widiger, T. A., & Oltmanns, J. R. (2017). The general factor of psychopathology and personality. Clinical Psychological Science, 5(1), 182–183. https://doi.org/10.1177/2167702616657042.CrossRef Google Scholar PubMed

Williams, A. L., Craske, M. G., Mineka, S., & Zinbarg, R. E. (2021). Reciprocal effects of personality and general distress: Neuroticism vulnerability is stronger than scarring. Journal of Abnormal Psychology, 130(1), 34–46. https://doi.org/10.1037/abn0000635.CrossRef Google Scholar PubMed

Wright, A. G., & Simms, L. J. (2015). A metastructural model of mental disorders and pathological personality traits. Psychological Medicine, 45(11), 2309–2319. https://doi.org/10.1017/S0033291715000252.CrossRef Google Scholar PubMed

Zinbarg, R. E., Revelle, W., Yovel, I., & Li, W. (2005). Cronbach's α, Revelle's β, and Mcdonald's ω _H: Their relations with each other and two alternative conceptualizations of reliability. Psychometrika, 70(1), 1–11. https://doi.org/10.1007/s11336-003-0974-7.CrossRef Google Scholar

Table 1. Descriptive statistics for primary observed indicators

Table 2. Fully standardized loadings of indicators on three models of the p-factor

Fig. 1. Confirmatory factor analysis comparing the strength of the associations of five theories of p with the p-factor.

Southward et al. supplementary material

File 751.5 KB

Article contents

Defining the p-factor: an empirical test of five leading theories

Abstract

Keywords

Dispositional negative emotionality

Impulsive responsivity to emotions

Low cognitive functioning

Thought dysfunction

Impairment

Limitations

Current study

Materials and methods

Participants

Measures

Indicators of the p-factor

Diagnostic assessments

Self-reported psychopathology.

Indicators of the five theories of p

Data analytic method

Results

Descriptive statistics

Comparing models of the p-factor

Testing five theories of p

Discussion

Supplementary material

Author contributions

Financial support

Conflict of interest

Ethical standards

Footnotes

References

Southward et al. supplementary material

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests