Unraveling the genetic architecture of major depressive disorder: merits and pitfalls of the approaches used in genome-wide association studies

I. Schwabe; Y. Milaneschi; Z. Gerring; P. F. Sullivan; E. Schulte; N. P. Suppli; J. G. Thorp; E. M. Derks; C. M. Middeldorp

doi:10.1017/S0033291719002502

Unraveling the genetic architecture of major depressive disorder: merits and pitfalls of the approaches used in genome-wide association studies

Published online by Cambridge University Press: 27 September 2019

Z. Gerring ,

E. Schulte ,

E. M. Derks and

I. Schwabe*: Affiliation:
Department of Methodology and Statistics, Tilburg University, Tilburg, The Netherlands Translational Neurogenomics Laboratory, QIMR Berghofer Medical Research Institute, Brisbane, Australia
Y. Milaneschi: Affiliation:
Department of Psychiatry, Amsterdam Neuroscience and Amsterdam Public Health Research Institute, Amsterdam University Medical Center, Amsterdam, The Netherlands
Z. Gerring: Affiliation:
Translational Neurogenomics Laboratory, QIMR Berghofer Medical Research Institute, Brisbane, Australia
P. F. Sullivan: Affiliation:
Department of Medical Epidemiology and Biostatistics, Karolinska Institutet, Stockholm, Sweden Department of Genetics, University of North Carolina, Chapel Hill, NC, USA Department of Psychiatry, University of North Carolina, Chapel Hill, NC, USA
E. Schulte: Affiliation:
Medical Centre of the University of Munich, Munich, Germany
N. P. Suppli: Affiliation:
Mental Health Centre Copenhagen, Copenhagen University Hospital, Copenhagen, Denmark
J. G. Thorp: Affiliation:
Translational Neurogenomics Laboratory, QIMR Berghofer Medical Research Institute, Brisbane, Australia
E. M. Derks: Affiliation:
Translational Neurogenomics Laboratory, QIMR Berghofer Medical Research Institute, Brisbane, Australia
C. M. Middeldorp: Affiliation:
Child Health Research Centre, University of Queensland, Brisbane, Australia Child and Youth Mental Health Service, Children's Health Queensland Hospital and Health Service, Brisbane, Australia Department of Biological Psychology, VU University Amsterdam, Amsterdam, The Netherlands
*: Author for correspondence: I. Schwabe, E-mail: I.Schwabe@uvt.nl

Article contents

Abstract
Introduction
GWAS in depression
Studies on heterogeneity
Merits and pitfalls of the two approaches
Discussion
Footnotes
References

Rights & Permissions

Abstract

To identify genetic risk loci for major depressive disorder (MDD), two broad study design approaches have been applied: (1) to maximize sample size by combining data from different phenotype assessment modalities (e.g. clinical interview, self-report questionnaires) and (2) to reduce phenotypic heterogeneity through selecting more homogenous MDD subtypes. The value of these strategies has been debated. In this review, we summarize the most recent findings of large genomic studies that applied these approaches, and we highlight the merits and pitfalls of both approaches with particular attention to methodological and psychometric issues. We also discuss the results of analyses that investigated the heterogeneity of MDD. We conclude that both study designs are essential for further research. So far, increasing sample size has led to the identification of a relatively high number of genomic loci linked to depression. However, part of the identified variants may be related to a phenotype common to internalizing disorders and related traits. As such, samples containing detailed clinical information are needed to dissect depression heterogeneity and enable the potential identification of variants specific to a more restricted MDD phenotype. A balanced portfolio reconciling both study design approaches is the optimal approach to progress further in unraveling the genetic architecture of depression.

Keywords

Depression GWAS MDD phenotypic heterogeneity power PRS psychometrics

Type: Review Article
Information: Psychological Medicine , Volume 49 , Issue 16 , December 2019 , pp. 2646 - 2656

DOI: https://doi.org/10.1017/S0033291719002502 [Opens in a new window]
Creative Commons: This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution, and reproduction in any medium, provided the original work is properly cited.
Copyright: Copyright © The Author(s) 2019

Introduction

The advent of the genomic era represents a turning point in unraveling the biological underpinnings of depression. The term depression, if not otherwise specified, is used throughout this review in its broadest meaning, from relevant symptoms assessed via self-report methods to clinical diagnosis ascertained by psychiatric interview. Specific definitions adopted in different studies will be described when discussing the related results. The heritability estimate for major depressive disorder (MDD), defined as psychiatric diagnosis established according to the criteria based on the Diagnostic and Statistical Manual of Mental Disorders (DSM, American Psychiatric Association, 2000, 2013) or the International Statistical Classification of Diseases and Related Health Problems (ICD, World Health Organization, 2018), is around 40% (Sullivan et al., Reference Sullivan, Neale and Kendler2000). Initial failures to reliably detect associations of single genetic variants with MDD were attributed to underpowered studies with small sample sizes and to the clinical heterogeneity of the psychiatric trait, further compromising the power of association studies. Levinson et al. (Reference Levinson, Mostafavi, Milaneschi, Rivera, Ripke, Wray and Sullivan2014) summarized the challenges facing the initial genome-wide association studies (GWAS) of MDD and proposed two, non-mutually exclusive, strategies to overcome them.

The first suggestion was to maximize the number of cases and controls, a strategy that has been successfully applied to schizophrenia (Schizophrenia Working Group of the Psychiatric Genomics Consortium, 2014) and many other complex diseases (Visscher et al., Reference Visscher, Wray, Zhang, Sklar, McCarthy, Brown and Yang2017). Due to the higher prevalence and lower heritability of MDD, it was estimated that three to five times as many cases would be required to detect the same number of genome-wide significant single-nucleotide polymorphisms (SNPs) as compared to schizophrenia (Wray et al., Reference Wray2012). The second proposed strategy was to enhance GWAS statistical power by reducing heterogeneity through selecting clinically more homogenous depression phenotypes.

In recent years, both strategies have been applied. To increase power by increasing sample size, subjects with self-reported diagnoses for depression and/or continuous measures of the whole phenotypic range of depression were included in the analyses (e.g. Wray et al., Reference Wray2018). An advantage of this approach is that this leads to an increased sample size without having to face the logistical and financial challenges of collecting large clinical MDD samples. However, the validity of this procedure has been criticized (Abbasi, Reference Abbasi2017). For example, it has been argued that a self-reported clinical diagnosis cannot be transferred one to one to a psychiatric assessment, resulting in the inclusion of misclassified and/or clinically non-relevant cases.

To increase power by decreasing heterogeneity, studies have been specifically designed to recruit depression cases with more severe profiles (e.g. with recurrent MDD or with diagnosis made in hospital settings) (CONVERGE consortium, 2015; Pedersen et al., Reference Pedersen, Bybjerg-Grauholm, Pedersen, Grove, Agerbo, Bækvad-Hansen, Poulsen, Hansen, McGrath, Als, Goldstein, Neale, Daly, Hougaard, Mors, Nordentoft, Børglum, Werge and Mortensen2018). Other researchers have tried to decrease heterogeneity by stratifying MDD patients according to relevant clinical features, such as age of onset, symptom profiles, or postpartum depression (Viktorin et al., Reference Viktorin, Meltzer-Brody, Kuja-Halkola, Sullivan, Landén, Lichtenstein and Magnusson2016; Milaneschi et al., Reference Milaneschi, Lamers, Peyrot, Baune, Breen, Dehghan, Forstner, Grabe, Homuth, Kan, Lewis, Mullins, Nauck, Pistis, Preisig, Rivera, Rietschel, Streit, Strohmaier, Teumer, Van der Auwera, Wray, Boomsma and Penninx2017; Power et al., Reference Power2017). A relatively small number of replicated loci have been identified in the studies that applied these procedures.

Greater awareness of the merits and pitfalls of both strategies is instrumental for their effective application in the next generation of studies aimed at advancing our understanding of the genetics of depression. This review summarizes the evidence emerging from the application of the two strategies in the context of large genomic studies. After reporting the most recent findings, we highlight the main strengths supporting their rationale and major points of criticism, with a particular attention to methodological and psychometric issues. The review concludes with a discussion of future opportunities and challenges.

GWAS in depression

Table 1 gives an overview of all GWAS on depression that has been published so far. The table is restricted to GWAS with a sample size of at least 10 000, but a complete list can be found in the online Supplementary material. For every study, details of the study population and the used assessment instrument(s) are provided. Different definitions of depression were used across different studies, based on clinical diagnosis, self-reported clinical diagnosis, or self-reported symptom/questionnaire data. For example, the first study that successfully identified genetic variants for depression selected a population with a relatively homogeneous phenotype. The CONVERGE consortium restricted the phenotype to recurrent severe MDD (patients from clinical settings with at least two episodes) in Han Chinese women (5303 cases and 4337 controls). Two independent and replicable genetic risk loci were significantly associated with this phenotype (CONVERGE consortium, 2015). Secondary analyses were restricted to cases who met the DSM-IV criteria for melancholia (4509 cases and 5377 controls), a more severe subtype of MDD (Kendler, Reference Kendler1997). Although this sample was smaller, the association with the two significant variants was found to be stronger. So far, these associations have not been identified in European samples (Major depression Working Group of the Psychiatric GWAS consortium et al., 2013; Wray et al., Reference Wray2018), possibly because these variants occur at low frequency in individuals of European ancestry.

Table 1. Overview of the number of significant loci and H ²_SNP in genome-wide association studies on depression (sample size >10 000 subjects)

MARS, Munich Antidepressant Response Signature project; PGC-MDD, Major Depressive Disorder Working Group of the Psychiatric Genomics Consortium; CHARGE, Cohorts for Heart and Aging Research in Genomic Epidemiology consortium; CONVERGE, China, Oxford and Virginia Commonwealth University Research on Genetic Epidemiology consortium; HCHS/SOL, Hispanic Community Health Study/Study of Latinos.

*Not reported.

^a ‘MDD’: ascertainment by clinical diagnosis or diagnostic interview fulfilling the criteria for major depressive disorder; ‘major depression’: ascertainment by self-reported diagnosis or treatment for major depressive disorder; ‘depressive symptoms’ phenotypes that utilize self-reported symptoms of major depression.

^b N total includes the number of individuals in cohorts with continuous measures as well as the total number of cases and controls.

Another group of studies tried to capitalize on maximizing sample size to identify significant genetic variants. For example, in order to assemble very large samples to increase statistical power, a number of collaborative studies combined data that relies on instruments based on self-report, potentially resulting in large differences in phenotypic depth among the different combined samples. At the time of writing, one of the largest published GWAS meta-analyses consisted of 135 458 cases and 344 901 controls (Wray et al., Reference Wray2018). The data structure included an ‘anchor PGC29 cohort’ from the PGC combining 29 samples using mostly standard methods for assessing lifetime MDD (i.e. personal interviews by trained interviewers using structured diagnostic methods). These data were combined with those of six ‘expanded’ cohorts that used different methods to identify clinically-significant depression: deCODE, GERA, and iPSYCH used electronical medical records, Generation Scotland structured diagnostic interview for MDD, UK Biobank both self-report of symptoms or help-seeking and electronic records, and 23andMe self-reported clinical diagnosis or treatment by a medical professional. Subjects meeting MDD formal clinical criteria and those self-identified with minimal phenotyping were classified as cases of a broader phenotype labeled ‘major depression’ (MD). The GWAS identified 44 genome-wide significant independent loci (Wray et al., Reference Wray2018). More recently, this GWAS was combined with the latest data released from UK Biobank, which included a phenotype labeled ‘broad depression’ which was based on two self-report questions on help-seeking for mental health difficulties, totaling 246 363 cases and 561 190 controls (Howard et al., Reference Howard, Adams, Clarke, Hafferty, Gibson, Shirali, Coleman, Ward, Wigmore, Alloza, Shen, Barbu, Xu, Whalley, Marioni, Porteous, Davies, Deary, Hemani, Tian, Hinds, Trzaskowski, Byrne, Ripke, Smith, Sullivan, Wray, Breen, Lewis and McINtosh2019). This resulted in the identification of 101 independently associated loci at a genome-wide significant level.

Studies on heterogeneity

With the exception of CONVERGE, no large GWAS has investigated genetic risk factors of a more homogenous MDD definition, but various studies have aimed at dissecting depression heterogeneity by analyzing existing large-scale genomics datasets. One of the approaches used so far is to stratify cases along relevant clinical features, which are then compared in terms of their genetic profile to establish whether they form more homogenous subgroups.

The selection of clinical features relevant for stratification, so far, has been based on the results of family and twin studies. Early-onset MDD, defined as MDD with the first episode taking place before the age of 30 years, has been observed to be associated with a higher risk of MDD in relatives, while late onset has been related to a higher risk of vascular diseases in relatives (Kendler et al., Reference Kendler, Fiske, Gardner and Gatz2009). Furthermore, two major twin studies have found a higher heritability for MDD in women than in men (approximately 40% v. 30% and 42% v. 29%, respectively) and clear evidence for sex-specific genetic effects with an estimated genetic correlation in liability to MD in men and women at approximately 0.55 and 0.63, respectively (Kendler et al., Reference Kendler, Gardner, Neale and Prescott2001, Reference Kendler, Gatz, Gardner and Pedersen2006). Moreover, unaffected co-twins of patients endorsing atypical ‘reversed vegetative symptoms’ (e.g. hyperphagia, weight gain, and hypersomnia) had a higher body mass index (BMI) (Kendler et al., Reference Kendler, Fiske, Gardner and Gatz2009). Lastly, the differential impact of environmental risk factors has been considered to be another major source of heterogeneity, as the variance in liability to MDD has a comparatively large environmental component (Sullivan et al., Reference Sullivan, Neale and Kendler2000).

Analyses based on GWAS results showed moderate to high genetic correlations between subgroups of cases stratified according to the clinical features listed above. A genetic correlation, commonly denoted as r _g, represents a correlation between the true effect sizes of SNPs affecting the different subgroups (or two different traits). A high correlation implies that, on average, SNPs have directionally similar effects on the two subgroups (see, e.g. Maier et al., Reference Maier, Visscher, Robinson and Wray2018 for more details). In data from the anchor cohort of the PGC, r _g between early-onset and late-onset MDD was approximately 0.99 (Power et al., Reference Power2017), and around 0.82 between MDD patients with atypical symptoms of increased appetite and/or weight and those with more typical symptoms of decreased appetite and/or weight (Milaneschi et al., Reference Milaneschi, Lamers, Peyrot, Baune, Breen, Dehghan, Forstner, Grabe, Homuth, Kan, Lewis, Mullins, Nauck, Pistis, Preisig, Rivera, Rietschel, Streit, Strohmaier, Teumer, Van der Auwera, Wray, Boomsma and Penninx2017). In data from CONVERGE, r _g between MDD cases exposed to early stressful life events and childhood sexual abuse and unexposed cases was around 0.62 (Peterson et al., Reference Peterson, Cai, Dahl, Bigdeli, Edwards, Webb, Bacanu, Zaitlen, Flint and Kendler2018).

However, the results of polygenic risk score analyses also indicated genetic heterogeneity. In PGC data, cases with earlier-onset MDD had a higher polygenic risk for schizophrenia and bipolar disorder (Power et al., Reference Power2017). After stratification by age of onset quantiles, one replicated genome-wide significant locus for the oldest quartile (adult-onset, >27 years) was identified. Furthermore, also in PGC data, MDD patients with atypical increased or typical decreased appetite and/or weight were divergent in the extent of overlap with genetic variants for immune-metabolic features: only MDD with atypical symptoms showed a specific overlap (r _g = 0.53) with BMI (Milaneschi et al., Reference Milaneschi, Lamers, Peyrot, Baune, Breen, Dehghan, Forstner, Grabe, Homuth, Kan, Lewis, Mullins, Nauck, Pistis, Preisig, Rivera, Rietschel, Streit, Strohmaier, Teumer, Van der Auwera, Wray, Boomsma and Penninx2017). Compared to the control group, only depressed patients with atypical symptoms carried significantly higher polygenic risk burden for increased BMI, and circulating high levels of CRP, leptin, and BMI-adjusted leptin. In stratified GWAS analyses, the direct comparison between the two subgroups of cases yielded one genome-wide significant association. Lastly, to estimate the degree of heterogeneity due to exposure to adversity, the genomic relationship matrix (GRM) of the CONVERGE data was extended to include an interaction with a measure of adversity exposure (e.g. early stressful life events and childhood sexual abuse) in the SNP-based heritability $(h_{{\rm SNP}}^2 )$ estimation. The GRM estimates the genetic relationship between individuals based on SNP information and $h_{{\rm SNP}}^2 $ denotes the variance explained by all SNPs used in a GWAS in conventionally unrelated individuals (Yang et al., Reference Yang, Zeng, Goddard, Wray and Visscher2017). Results suggested that 13.2% of MDD liability could be attributed to genome-wide interaction with adversity exposure. Furthermore, an adversity exposure-stratified GWAS comparing MDD cases against controls detected three associated loci only in participants with no history of adversities. Appropriate replication was however not feasible due to unavailability of samples with similar features (Peterson et al., Reference Peterson, Cai, Dahl, Bigdeli, Edwards, Webb, Bacanu, Zaitlen, Flint and Kendler2018). Sex-stratified analyses in the UK Biobank and Generation Scotland: Scottish Family Health Study (GS:SFHS) yielded a similar $h_{{\rm SNP}}^2 $ estimate (~0.2) across males and females and showed no detectable discrepancies in genetic overlap with health-correlated traits, but stratified GWAS analysis revealed one, non-replicated, genome-wide significant locus for MDD in male patients (Hall et al., Reference Hall, Adams, Arnau-Soler, Clarke, Howard, Zeng, Davies, Hagenaars, Fernandez-Pujals, Gibson, Wigmore, Boutin, Hayward, Porteous, Deary, Thomson, Haly and McIntosh2018).

Another strategy to investigate heterogeneity is to apply an analytical technique called Buhmbox, which aims to verify whether the genetic correlation between two traits can be explained by the presence of a subgroup in the first trait (i.e. heterogeneity) that is genetically similar to the second trait (Han et al., Reference Han, Pouget, Slowikowski, Stahl, Lee, Diogo, Hu, Park, Kim, Gregersen, Dahlgvist, Worthington, Martin, Eyre, Klareskog, Huizinga, Chen, Onengut-Gumuscu, Rich, Wray and Raychaudhuri2016). This method was applied in preliminary analyses on >30 000 samples from UK Biobank and GS:SFHS showing that genetic correlations of MDD with high triglycerides, cholesterol, and blood pressure might be explained by heterogeneity among MDD cases. This provides further evidence for the presence of a subgroup of MDD cases in which metabolic alterations may represent a specific pathophysiological pathway (Howard et al., Reference Howard, Adams, Shirali, Clarke, Marioni, Davies, Coleman, Alloza, Shen, Barbu, Wigmore, Gibson, Hagenaars, Lewis, Ward, Smith, Sullivan, Haley, Breen, Deary and McIntosh2018).

Merits and pitfalls of the two approaches

The strategy of maximizing sample size by expanding the phenotype of MDD as defined by the DSM or ICD to include self-reported diagnosis of depression and/or continuous measures of depression is based on the assumption that the underlying liability to depression is a normally distributed severity continuum in the population – with MDD representing the extreme tail of this distribution. The demarcation line that delineates MDD is thus somewhat arbitrary and not being an entirely separate entity that is in accordance with the ‘multiple-threshold model’ (Reich et al., Reference Reich, James and Morris1972), asserting that different syndromes reflect only different levels of severity on a single dimension, not distinct etiologies. Under this conceptualization, combining different measures of depression, also when ‘sampled’ at different points in the underlying distribution (at the normal and at the clinical range) substantially increases the statistical power to detect common risk variants. Evidence for the multiple threshold model was found in a recent twin study showing that MDD and minor depression (characterized by at least two but fewer than five of the symptoms of MDD) lie on the same single dimension of liability with different levels of severity (Corfield et al., Reference Corfield, Yang, Martin and Nyholt2017). It is important to note that the study by Corfield et al. (Reference Corfield, Yang, Martin and Nyholt2017) considered two similar clinically-ascertained syndromes mainly differentiated by the number of endorsed symptoms.

In contrast, it has been argued that depression diagnoses phenotypes obtained with clinical v. self-report assessment represent different entities rather than different thresholds on the same liability, and this is indicated by the difference in $h_{{\rm SNP}}^2 $ estimates for the different phenotypes. In particular, lower $h_{{\rm SNP}}^2 $ estimates obtained using self-report phenotyping have been considered as a result of misclassification of subjects with other conditions than depression (Cai et al., Reference Cai, Kendler and Flint2018). In the PGC GWAS (Wray et al., Reference Wray2018), $h_{{\rm SNP}}^2 $ estimated on the liability scale for depression varied from 0.26 in the GenScot cohort (clinically ascertained) to 0.08 in 23andMe data (self-report), although the confidence intervals largely overlapped. In a preliminary analysis of UK Biobank data, Cai et al. (Reference Cai, Kendler and Flint2018) reported an SNP-heritability of 26% for a DSM-based diagnosis of lifetime MDD derived from an online Mental Health questionnaire, and lower $h_{{\rm SNP}}^2 $ estimates (<15%) for alternative definitions based on minimal phenotyping. Previous results of the UK Biobank study (Howard et al., Reference Howard, Adams, Shirali, Clarke, Marioni, Davies, Coleman, Alloza, Shen, Barbu, Wigmore, Gibson, Hagenaars, Lewis, Ward, Smith, Sullivan, Haley, Breen, Deary and McIntosh2018) on the other hand did not reveal highly divergent $h_{{\rm SNP}}^2 $ between different depression phenotypes, with estimates of ~10% for self-report broad depression and ICD-9 or 10 coded MDD based on hospital records. Note that the findings of lower heritability estimates of MDD based on self-report symptom/questionnaire data could possibly also be explained by an attenuation in heritability estimates due to random noise (e.g. higher unsystematic measurement error) resulting from a more noisy phenotype (van den Berg et al., Reference van den Berg, Glas and Boomsma2007; van der Sluis et al., Reference Van der Sluis, Verhage, Posthuma and Dolan2010; Schwabe et al., Reference Schwabe and van den Berg2014, Reference Schwabe, Go, Tijmstra and Pohl2019) rather than from measuring a different phenotype.

Those in favor of the strategy of combining different depression phenotypes consider the presence of a strong genetic correlation between depression phenotypes measured with different instruments as empirical evidence of the validity of this approach. For example, in the PGC GWAS meta-analysis of major depression (Wray et al., Reference Wray2018), the genetic correlations for the clinically-ascertained PGC29 cohort varied from 0.97 in the deCODE sample (including cases from electronical medical records) to 0.67 in the 23andMe sample (self-report assessment) and the weighted mean of all pairwise genetic correlations between cohorts was high (weighted mean r _g = 0.76, s.e. = 0.03). A formal test of heterogeneity between r _g estimates was found to be statistically non-significant, and the authors interpreted this result as an indication of a strong overlap in the common genetic architecture of the different phenotypes supporting the comparability of assessment via different measurement methods. Furthermore, Wray et al. (Reference Wray2018) found very high genetic correlations (close to +1) between the MD phenotype of their GWAS meta-analysis and two previous GWAS that focused on current depressive symptoms measured with self-report-based questionnaires. Similar high genetic correlations between clinically defined cases and current symptoms in the general population have been reported for other disorders such as ASD, ADHD, and OCD (Middeldorp et al., Reference Lubke, Miller, Verhulst, Bartels, van Beijsterveldt, Willemsen, Boomsma and Middeldorp2016; Martin et al., Reference Martin, Taylor and Lichtenstein2018).

Nevertheless, others argue that the reported genetic correlations may arise mainly from the overlap across different depression phenotypes of a large portion of non-specific liability to poor mental health (Cai et al., Reference Cai, Kendler and Flint2018). The finding of high genetic correlations with different traits such as neuroticism and anxiety (Okbay et al., Reference Okbay, Baselmans and De Neve2016; Cai et al., Reference Cai, Kendler and Flint2018; Wray et al., Reference Wray2018) may be interpreted as emerging from shared non-specific vulnerability, while the multicomponent construct of MDD may be also influenced by specific genetic risk factors. As a consequence, the broad phenotype of MDD used in recent GWAS (Wray et al., Reference Wray2018; Howard et al., Reference Howard, Adams, Clarke, Hafferty, Gibson, Shirali, Coleman, Ward, Wigmore, Alloza, Shen, Barbu, Xu, Whalley, Marioni, Porteous, Davies, Deary, Hemani, Tian, Hinds, Trzaskowski, Byrne, Ripke, Smith, Sullivan, Wray, Breen, Lewis and McINtosh2019) may result in a substantive increase of power to detect shared genetic risk variants, but not be suitable to detect specific risk factors for MDD (McIntosh et al., Reference McIntosh, Sullivan and Lewis2019). Furthermore, the validity of interpreting the genetic correlation as a parameter to establish equivalency between phenotypes has been questioned. Some have highlighted the need to, instead, focus on the more conservative alternative of using the squared value (i.e. r _g²) reflecting the percentage of SNP effects on one phenotype that can be explained by the SNP effects on other phenotypes (Cai et al., Reference Cai, Kendler and Flint2018).

Overall, the major point of concern raised about combining different depression diagnosis phenotypes (e.g. clinical ascertained diagnoses, self-reported diagnoses/treatment, or self-report questionnaires) might not capture the same quantitative or qualitative psychopathological entity (see Fig. 1 for an illustration of this issue).

Fig. 1. A major point of criticism of combining different depression diagnosis phenotypes is that different assessment methods might identify different parts of the ‘latent depression’ population.

Instead, MDD might underlie a distinct construct and misclassification can occur when participants state ‘yes’ to the question whether they have ever experienced a depression without ever having fulfilled the criteria for MDD. Their symptoms, for instance, may better fit with conditions with overlapping clinical features, such as dysthymia, anxiety disorders, somatic illnesses, substance use, and even normal bereavement. An in-depth clinical assessment, through detailed differential diagnoses probing, will provide different results than a self-reported clinical diagnosis (e.g. endorsing the question ‘Have you ever been diagnosed for depression’) or a diagnosis based on a questionnaire (e.g. a self-reported symptom score based on a cut-off). Similarly, the self-report of the indication for antidepressant use may represent a poor proxy for actual MDD. These concerns are strengthened by the results of some studies, showing for instance poor agreement between self-report v. psychiatry-lead interviews (Eaton et al., Reference Eaton, Neufeld, Chen and Cai2000), or that the majority of US adults that are or were treated with antidepressants did not actually screen positive for MDD (Olfson et al., Reference Olfson, Blanco and Marcus2016). Furthermore, differences between clinical interview and self-report measures may emerge from the different timeframe of the assessments: while lifetime is the timeframe for clinical diagnosis, measures based on self-report often focus on current symptoms.

One final issue closely related to the possibility of misclassification is that combining phenotypic data from multiple consortia generally comes with a number of psychometric issues at the phenotypic level. For example, aggregating questionnaire data from different consortia will most likely result in a violation of measurement invariance, meaning that the perception of the (severity) of the symptoms might depend on factors other than the severity of the illness-like properties of the questionnaire (e.g. choice of items or wording) or characteristics of the respondent that are not relevant for the disease (e.g. cultural or language background) (van den Berg et al., Reference van den Berg2014).

The strategy focused on selecting a more homogeneous depression phenotype is based on the assumption that the clinical heterogeneity in depression may emerge from an aggregation of different underlying liabilities expressed through partially distinct biological pathways (see Fig. 2 for an illustration).

Fig. 2. MDD is likely caused by multiple different etiopathological mechanisms. Studies investigating distinct subtypes of depression aim at reducing the underlying pathophysiological heterogeneity.

Researchers who have applied this strategy argue that narrowing the phenotype and increasing phenotypic homogeneity may tag higher underlying genetic homogeneity and, hence, heritability, compensating for the related drop in sample size. This approach has been shown to be effective in the CONVERGE study that focused on recurrent severe depression in Han Chinese women and resulted in two significant hits. Furthermore, polygenic analyses provide evidence for the existence of heterogeneity in the depression phenotype. Nevertheless, applying this strategy can be challenging. First, the best criterion for stratification, allowing to accurately identify more homogenous subtypes, is a matter of debate. Second, few large genomic datasets include complete measures of the potential features of interest, such as specific symptom profiles. Overall, while different studies showed an increase in point-estimates for $h_{{\rm SNP}}^2 $ – in particular when moving from self-report-based instruments to clinically-ascertained depression and in some instances when stratifying MDD cases for certain features – the confidence intervals surrounding those estimates did not allow to formally confirm that the differences in $h_{{\rm SNP}}^2 $ were of statistical significance. An under-investigated area in this respect is the use of repeated measures. Twin studies have shown that the stability in depression is largely explained by genetic factors (see, e.g. Nivard et al., Reference Nivard, Dolan, Kendler, Kan, Willemnsen, van Beijsterveldt, Lindauer, van Beek, Geels, Bartels, Middeldorp and Boomsma2015). Moreover, the factor reflecting the stability or agreement over measures has been reported to have a higher heritability than the individual measures (Foley et al., Reference Foley, Neale and Kendler1998; Lubke et al., Reference Lubke, Miller, Verhulst, Bartels, van Beijsterveldt, Willemsen, Boomsma and Middeldorp2016). Cheesman et al. (Reference Cheesman, Purves, Pingault, Breen, Rijsdijk, Plomin and Eley2018) estimated twin and $h_{{\rm SNP}}^2 $ of a stable emotional problems phenotype that was constructed based on 12 measures from three ages and three raters using confirmatory factor analysis and item response theory modeling. They found that SNP heritability rose from 5% (not significant) on average for individual measures to 14% (s.e. = 0.049; p = 0.002) by focusing on stable trait variance.

Discussion

Genetic discoveries in depression have lagged behind for a long time, due to several challenges unique to this phenotype (e.g. modest heritability, a high prevalence, the role of environmental influences, and phenotypic heterogeneity). To forward the field and increase power to find meaningful genetic associations, Levinson et al. (Reference Levinson, Mostafavi, Milaneschi, Rivera, Ripke, Wray and Sullivan2014) proposed to apply two different non-mutually exclusive strategies to accelerate genetic discovery for depression: (1) to substantially increase the sample size of GWAS and (2) to reduce phenotypic heterogeneity by selecting clinically more homogenous subgroups of cases. The present review summarized the main findings, and the strengths and pitfalls of these two strategies.

In the light of the discussion to which extent the different depression phenotypes measure the same entity, it is important to acknowledge that we cannot directly observe MDD as we can other human characteristics (such as a person's height or someone's hair color). Consequently, in order to make MDD measurable, we need to specify an underlying (psychometric) model to operationalize it. As for every other latent (unobservable) psychiatric trait, the ‘correct’ model is unknown and we can adopt different frameworks. Those in favor of broadening the phenotype adopt a model that suggests that the difference in MDD among people is a matter of degree (e.g. MDD being represented by a dimension on which people can be ordered). On the other hand, those in favor of decreasing heterogeneity adopt a model where the difference in MDD is a matter of ‘type’ one belongs to or not (e.g. MDD being a different entity than other depressive disorders or subthreshold). This is an enduring issue in psychology (can psychological attributes be best represented as dimensions or categories?) that further complicates choosing between the two strategies. The findings that are discussed in this review provide evidence for both frameworks: strong and consistent genetic correlations across studies using different depression phenotypes provide empirical support for a common underlying internalizing trait and specific polygenic signatures for subgroups of depressed patients provide empirical support for the presence of distinct pathophysiological processes acting under the same diagnostic label. Overall, results from large-scale genetic studies draw a composite picture of the underlying liability of depression.

The aforementioned debate is closely related to what has been referred to as the ‘reification problem’ by Kendler (Reference Kendler2014) stating that diagnostic criteria have been misinterpreted as the actual dimension they are designed to assess. The definition of MDD is not based on a fundamental biomarker or pathophysiology. Consequently, the diagnostic criteria currently used to evaluate the presence of MDD or, for that matter, any other psychiatric disorder, are based on descriptive signs, often selected from clinical tradition rather than from empirical evidence. In the absence of biological markers reflective of etiopathological mechanisms, major updates of psychiatry nosology have, so far, been revolving around the debate whether psychiatric disorders should be conceptualized as fewer broad categories or as more fine-grained categories. As Kendler (Reference Kendler2014) highlighted, the absolute reliance on these criteria and the loss of awareness of their indexical function leads to reification and diagnostic literalism, confusing diagnostic criteria with the actual trait they are designed to assess.

Despite indications of detectable heterogeneity, so far the studies applying the strategy aimed at maximizing sample sizes identified a higher number of genome-wide replicated loci compared to those that focused on increasing homogeneity. The number of significantly associated genetic variants has been steadily increasing with increasing sample sizes, indicating that combining different depression phenotypes increases power to detect a large number of common genetic variants shared across similar but distinct phenotypes. In contrast, with the exception of CONVERGE, no other GWAS so far identified specific genetic variants reliably associated with MDD phenotypes defined more strictly. Given the high co-morbidity between MDD and other psychiatric disorders, such as anxiety disorders (Kessler et al., Reference Kessler, Berglund, Demler, Jin, Koretz, Merikangas, Rush, Walters and Wang2003) and personality traits such as neuroticism, and the prediction of MDD by subthreshold depression (Lee et al., Reference Lee, Stockings, Harris, Doi, Page, Davidson and Barendregt2018) these identified genetic variants may not be specific for MDD, but may be shared with the vulnerability for anxiety disorders, other depressive disorders, such as dysthymia and with subthreshold symptoms. We think that identifying these variants is evenly useful as identifying variants underlying specific more homogeneous MDD subtypes. This discovery base may be subsequently leveraged to disentangle divergent genetic effects for specific traits.

The availability of increasing numbers of samples with genotype–phenotype data in expanding cohorts and biobanks do provide the opportunity to apply different approaches without decreasing or even increasing power and also give justice to the heterogeneity. These approaches make the most efficient use of all data collected in the samples. For instances, the results of recent papers leveraging UkBiobank data suggest that specific genetic variants differentially influence individual items used to asses neuroticism (Nagel et al., Reference Nagel, Watanabe, Stringer, Posthuma and van der Sluis2018) and depressive symptoms (Thorp et al., Reference Thorp, Marees, Ong, An, MacGregor and Derksin press). Population-based birth, child, and adolescent cohorts also provide excellent resources to make optimal use of the already available data by investigating genetic variants that influence stability over time (Middeldorp et al., Reference Middeldorp, Felix, Mahjan and McCarthy2019).

Recent initiatives introduced the application of efficient and scalable electronic instruments to assess psychiatric disorders, which may reconcile the need of both a larger number of persons screened and a more refined phenotyping of the trait of interest. For example, the comprehensive online mental-health questionnaire used in UK Biobank identifies operationally defined syndromes such as lifetime depression, mania, anxiety disorder, psychotic-like experiences and self-harm, post-traumatic stress disorder, and substance use disorders. Another example is the Lifetime Depression Assessment Self-report (LIDAS, Bot et al., Reference Bot, Middeldorp, de Geus, Lau, van Nieuwenhuizen, Smit, Boomsma and Penninx2017), which is largely based on the Composite International Diagnostic Interview (CIDI) short form for lifetime depression (CIDI-SF; Kessler et al., Reference Kessler, Andrews, Mroczek, Ustun and Ittchen1998; Hamilton et al., Reference Hamilton, Strader, Pratt, Maiese, Hendershot, Kwok, Hammond, Huggins, Jackman, Pan, Nettles, Beaty, Farrer, Kraft, Marazita, Ordovas, Pato, Spitz, Wagener, Williams, Junkins, Harlan, Ramos and Haines2011) and assesses lifetime history of MDD according to DSM criteria. In feasibility studies, the LIDAS has shown adequate sensitivity (0.85) and specificity (0.80), and has a short median completion time (Bot et al., Reference Bot, Middeldorp, de Geus, Lau, van Nieuwenhuizen, Smit, Boomsma and Penninx2017). These tools hold the potential to provide reliable measures of depression at low cost in large (existing) cohorts and biobanks with genetic data. Furthermore, such instruments may provide information on specific clinical features along which cases can be stratified to identify more homogeneous subgroups, such as endorsement of single or specific combinations of symptoms. An important application of genetic stratification may also involve trials testing the efficacy of different treatments for depression: the identification of interactions between specific genomic risk profiles of various traits only with a certain class of treatments may provide interesting insights in the complex underlying pathophysiological mechanisms active in depression.

Finally, further developments in analytical methods and knowledge on depression heterogeneity may enhance the strengths of the two main strategies reviewed here. For instance, by leveraging the identified shared genetic liability across different phenotypes, the newly developed multi-trait analysis of GWAS (MTAG) has been shown to substantially increase statistical power to detect genetic association for each trait in a joint analysis of multiple traits using GWAS summary statistics (Turley et al., Reference Turley, Walters, Maghzian, Okbay, Lee, Fontana, Nguyen-Viet, Wedow, Zacher, Furlotte, Magnusson, OSkarsson, Johannesson, Visscher, Laibson, Cesarini, Neale and Benjamin2018). Pooling data on depression, neuroticism, and subjective well-being, the application of MTAG substantially increased the number of significantly associated loci for each trait, compared to single-trait analyses, with depression-associated loci going from 32 to 64 loci. Recently, multivariate methods were proposed that allow the identification of variants with effects on common cross-trait liability and variants that cause divergence, such as genomic structural equation modeling (GenomicSEM, Grotzinger et al., Reference Grotzinger, Rhemtulla, de Vlaming, Ritchie, Mallard, Hill, Ip, Marioni, McIntosh, Deary, Koellinger, Harden, Nivard and Tucker-Drob2019) and genome-wide association meta-analysis (GWAMA, Baselmans et al., Reference Baselmans, Jansen, Ip, van Dongen, Abdellaoui, van de Weijer, Bao, Smart, Kumari, Willemsen, Hottenga, Boomsma, de Geus, Nivard and Bartels2019).

To conclude, so far increasing the sample size by carefully and thoughtfully adding samples with non-traditional diagnostic approaches has enabled the identification of a large number of genetic variants for depression, but this must be done with great care. Part of these variants probably also influence other internalizing disorders and related traits. As sample sizes increase, further increases in the number of genetic loci robustly associated with depression will likely be achieved.

At the same time, it is also important to collect samples with genotype data and detailed clinical information. These samples will be essential to dissect clinical and biological heterogeneity using genetic instruments. Eventually, the achievement of larger sample sizes as compared to those available nowadays may allow to reliably identify genetic loci specifically linked to depression phenotypes defined more strictly. Adoption of a balanced portfolio reconciling the two main strategies discussed in the present review is probably the optimal approach to progress further in unraveling depression genetic architecture.

Supplementary material

The supplementary material for this article can be found at https://doi.org/10.1017/S0033291719002502.

Footnotes

I. Schwabe and Y. Milaneschi are the co-first authors.

References

Abbasi, J (2017) 23andme, big data, and the genetics of depression. Journal of the American Medical Association 317, 14–16.Google Scholar

American Psychiatric Association (2000) Diagnostic and Statistical Manual of Mental Disorders, 4th ed., text rev. Washington, DC.Google Scholar

American Psychiatric Association (2013) Diagnostic and Statistical Manual of Mental Disorders, 5th ed., text rev. Washington, DC.Google Scholar

Baselmans, BML, Jansen, R, Ip, HF, van Dongen, J, Abdellaoui, A, van de Weijer, MP, Bao, Y, Smart, M, Kumari, M, Willemsen, G, Hottenga, J, BIOS consortium, Social Science Genetic Association Consortium, Boomsma, DI, de Geus, EJC, Nivard, MG and Bartels, M (2019) Multivariate genome-wide analyses of the well-being spectrum. Nature Genetics 51, doi:10.1038/s41588-018-0320-8.Google Scholar

Bot, M, Middeldorp, CM, de Geus, EJC, Lau, HM, van Nieuwenhuizen, B, Smit, JH, Boomsma, DI and Penninx, BWJH (2017) Validity of LIDAS (Lifetime Depression Assessment Self-report): a self-report online assessment of lifetime major depressive disorder. Psychological Medicine 47, 279–289.Google Scholar

Cai, N, et al. (2015) Sparse whole-genome sequencing identifies two loci for major depressive disorder. Nature 523, 588.Google Scholar

Cai, N, Kendler, K and Flint, J (2018) Minimal phenotyping yields GWAS hits of low specificity for major depression. bioRxiv https://doi.org/10.1101/440735.Google Scholar

Cheesman, R, Major Depressive Disorder Working Group of the Psychiatric Genomics Consortium, Purves, KL, Pingault, J, Breen, G, Rijsdijk, F, Plomin, R and Eley, TC (2018) Extracting stability increases the SNP heritability of emotional problems in young people. Translational Psychiatry 8, e223.Google Scholar

CONVERGE consortium (2015) Sparse whole-genome sequencing identifies two loci for major depressive disorder. Nature 523, 588–591.Google Scholar

Corfield, EC, Yang, Y, Martin, NG and Nyholt, DR (2017) A continuum of genetic liability for minor and major depression. Translational Psychiatry 16, e1131.Google Scholar

Direk, N, et al. (2016) An analysis of two genome-wide association meta-analyses identifies a new locus for broad depression phenotype. Biological Psychiatry 82, 322–329.Google Scholar

Dunn, EC, Sofer, T, Wang, MJ, Soare, TW, Gallo, LC, Gogarten, SM, et al. (2018) Genome-wide association study of depressive symptoms in the Hispanic Community Health Study/Study of Latinos. Journal of Psychiatric Research 99, 167–176.Google Scholar

Eaton, WW, Neufeld, K, Chen, LS and Cai, G (2000) A comparison of self-report and clinical diagnostic interviews for depression: diagnostic interview schedule and schedules for clinical assessment in neuropsychiatry in the Baltimore epidemiologic catchment area follow-up. Archives of General Psychiatry 57, 217–222.Google Scholar

Foley, DL, Neale, MC and Kendler, KS (1998) Reliability of a lifetime history of major depression: implications for heritability and co-morbidity. Psychological Medicine 28, 857–870.Google Scholar

Grotzinger, AD, Rhemtulla, M, de Vlaming, R, Ritchie, SJ, Mallard, TT, Hill, D, Ip, HF, Marioni, RE, McIntosh, AM, Deary, IJ, Koellinger, PD, Harden, P, Nivard, MG and Tucker-Drob, EM (2019) Genomic structural equation modelling provides insights into the multivariate genetic architecture of complex traits. Nature Human Behaviour 3, 513–525.Google Scholar

Hall, SL, Adams, MJ, Arnau-Soler, A, Clarke, TK, Howard, DM, Zeng, Y, Davies, G, Hagenaars, SP, Fernandez-Pujals, AM, Gibson, J, Wigmore, EM, Boutin, TS, Hayward, C, Generation Scotland, Major Depressive Disorder Working Group of the Psychiatric Genomics Consortium, Porteous, DJ, Deary, IJ, Thomson, PA, Haly, CS and McIntosh, AM (2018) Genome-wide meta-analyses of stratified depression in Generation Scotland and UK Biobank. Translational Psychiatry 8, 9.Google Scholar

Hamilton, CM, Strader, LC, Pratt, JG, Maiese, D, Hendershot, T, Kwok, RK, Hammond, JA, Huggins, W, Jackman, D, Pan, HQ, Nettles, DS, Beaty, TH, Farrer, LA, Kraft, P, Marazita, ML, Ordovas, JM, Pato, CN, Spitz, MR, Wagener, D, Williams, M, Junkins, HA, Harlan, WR, Ramos, EM and Haines, J (2011) The PhenX toolkit: get the most from your measures. American Journal of Epidemiology 174, 253–260.Google Scholar

Han, B, Pouget, JG, Slowikowski, K, Stahl, E, Lee, CH, Diogo, D, Hu, X, Park, YR, Kim, E, Gregersen, PK, Dahlgvist, SR, Worthington, J, Martin, J, Eyre, S, Klareskog, L, Huizinga, T, Chen, WM, Onengut-Gumuscu, S, Rich, SS, Major Depressive Disorder Working Group of the Psychiatric Genomics Consortium, Wray, NR and Raychaudhuri, S (2016) A method to decipher pleiotropy by detecting underlying heterogeneity driven by hidden subgroups applied to autoimmune and neuropsychiatric diseases. Nature Genetics 48, 803–810.Google Scholar

Hek, K, et al. (2013) A genome-wide association study of depressive symptoms. Biological Psychiatry 73, 667–678.Google Scholar

Howard, DM, Adams, MJ, Shirali, M, Clarke, T, Marioni, RE, Davies, G, Coleman, JRI, Alloza, C, Shen, X, Barbu, MC, Wigmore, EM, Gibson, J, 23andMe Research Team, Hagenaars, SP, Lewis, CM, Ward, J, Smith, DJ, Sullivan, PF, Haley, CS, Breen, G, Deary, I and McIntosh, AM (2018) Genome-wide association study of depression phenotypes in UK Biobank identifies variants in excitatory synaptic pathways. Nature Communications 9, 1470.Google Scholar

Howard, DM, Adams, MJ, Clarke, T, Hafferty, JD, Gibson, J, Shirali, M, Coleman, JRI, Ward, J, Wigmore, EM, Alloza, C, Shen, X, Barbu, MC, Xu, EY, Whalley, HC, Marioni, RE, Porteous, DJ, Davies, G, Deary, IJ, Hemani, G, Tian, C, Hinds, DA, 23andMe Research Team, Major Depressive Disorder Working Group of the Psychiatrics Genomic Consortium, Trzaskowski, MT, Byrne, EM, Ripke, S, Smith, DJ, Sullivan, PF, Wray, NR, Breen, G, Lewis, CM and McINtosh, AM (2019) Genome-wide meta-analysis of depression in 807,553 individuals identifies 102 independent variants with replication in a further 1,507,153 individuals. Nature Neuroscience 22, 343–352.Google Scholar

Hyde, CL, Nagle, MW, Tian, C, Chen, X, Paciga, SA, Wendland, JR, Tung, JY, Hinds, DA, Perlis, RH and Winslow, AR (2016) Identification of 15 genetic loci associated with risk of major depression in individuals of European descent. Nature Genetics 48, 1031–1036.Google Scholar

Kendler, KS (1997) The diagnostic validity of melancholic major depression in a population-based sample of female twins. Archives of General Psychiatry 54, 299–304.Google Scholar

Kendler, KS (2014) DSM issues: incorporation of biological tests, avoidance of reification, and an approach to the ‘box canyon problem’. American Journal of Psychiatry 171, 1248–1250.Google Scholar

Kendler, KS, Gardner, CO, Neale, MC and Prescott, CA (2001) Genetic risk factors for major depression in men and women: similar or different heritabilities and same or partly distinct genes? Psychological Medicine 31, 605–616.Google Scholar

Kendler, KS, Gatz, M, Gardner, CO and Pedersen, NL (2006) A Swedish national twin study of lifetime major depression. American Journal of Psychiatry 163, 109–114.Google Scholar

Kendler, KS, Fiske, A, Gardner, CO and Gatz, M (2009) Delineation of two genetic pathways to major depression. Biological Psychiatry 65, 808–811.Google Scholar

Kessler, RC, Andrews, G, Mroczek, D, Ustun, B and Ittchen, HU (1998) The world health organization composite international diagnostic interview short-form (CIDI-SF). International Journal of Methods in Psychiatric Research 7, 171–185.Google Scholar

Kessler, RC, Berglund, P, Demler, O, Jin, R, Koretz, D, Merikangas, KR, Rush, AJ, Walters, EE, Wang, PS and National Comorbidity Survey Replication (2003) The epidemiology of major depressive disorder: results from the National Comorbidity Survey Replication (NCS-R). Journal of the American Medical Association 289, 3095–3105.Google Scholar

Kohli, MA, et al. (2011) The neuronal transporter gene SLC6A15 confers risk to major depression. Neuron 70, 252–265.Google Scholar

Lee, YY, Stockings, EA, Harris, MG, Doi, SAR, Page, IS, Davidson, SK and Barendregt, JJ (2018) The risk of developing major depression among individuals with subthreshold depression: a systematic review and meta-analysis of longitudinal cohort studies. Psychological Medicine 13, 1–11.Google Scholar

Levinson, DF, Mostafavi, S, Milaneschi, Y, Rivera, M, Ripke, S, Wray, NR and Sullivan, PF (2014) Genetic studies of major depressive disorder: Why are there no genome-wide association study findings and what can we do about it? Biological Psychiatry 76, 510–512.Google Scholar

Li, X, Luo, Z, Gu, C, Hall, LS, McIntosh, AM, Zeng, Y, et al. (2018) Common variants on 6q16.2, 12q24.31 and 16p13.3 are associated with major depressive disorder. Neuropsychopharmacology 43, 2146–2153.Google Scholar

Lubke, GH, Miller, PJ, Verhulst, B, Bartels, M, van Beijsterveldt, T, Willemsen, G, Boomsma, DI and Middeldorp, CM (2016) A powerful phenotype for gene-finding studies derived from trajectory analyses of symptoms of anxiety and depression between age seven and 18. American Journal of Medical Genetics Part B Neuropsychiatric Genetics 171, 948–957.Google Scholar

Maier, RM, Visscher, PM, Robinson, MR and Wray, NR (2018) Embracing polygenicity: a review of methods and tools for psychiatric genetics research. Psychological Medicine 48, 1055–1067.Google Scholar

Major Depressive Disorder Working Group of the Psychiatric GWAS Consortium (2013) A mega-analysis of genome-wide association studies for major depressive disorder. Molecular Psychiatry 18, 497–511.Google Scholar

Martin, J, Taylor, MJ and Lichtenstein, P (2018) Assessing the evidence for shared genetic risks across psychiatric disorders and traits. Psychoogical Medicine 48, 1759–1774.Google Scholar

McIntosh, AM, Sullivan, PF and Lewis, CM (2019) Uncovering the genetic architecture of major depression. Neuron 3, 91–103.Google Scholar

Middeldorp, CM, et al. (2016) A genome-wide association meta-analysis of attention-deficit/hyperactivity disorder symptoms in population-based paediatric cohorts. Journal of the American Academy of Child & Adolescent Psychiatry 55, 896–905.Google Scholar

Middeldorp, CM, Felix, JF, Mahjan, A, EArly Genetics Lifecourse Epidemiology (EAGLE) consortium, Early Growth Genetics (EGG) consortium and McCarthy, MI (2019) The Early Growth Genetics (EGG) and EArly Genetics and Lifecourse Epidemiology (EAGLE) consortia: design, results and future prospects. European Journal of Epidemiology 34, 279–300.Google Scholar

Milaneschi, Y, Lamers, F, Peyrot, WJ, Baune, BT, Breen, G, Dehghan, A, Forstner, AJ, Grabe, HJ, Homuth, G, Kan, C, Lewis, C, Mullins, N, Nauck, M, Pistis, G, Preisig, M, Rivera, M, Rietschel, M, Streit, F, Strohmaier, J, Teumer, A, Van der Auwera, S, Wray, NR, Boomsma, DI, Penninx, BWJH and CHARGE Inflammation Working Group and the Major Depressive Disorder Working Group of the Psychiatric Genomics Consortium (2017) Genetic association of major depression with atypical features and obesity-related immunometabolic dysregulations. JAMA Psychiatry 74, 1214–1225.Google Scholar

Nagel, M, Watanabe, K, Stringer, S, Posthuma, D and van der Sluis, S (2018) Item-level analyses reveal genetic heterogeneity in neuroticism. Nature Communications 9, e905.Google Scholar

Nivard, MG, Dolan, CV, Kendler, KS, Kan, KJ, Willemnsen, G, van Beijsterveldt, CE, Lindauer, RJ, van Beek, JH, Geels, LM, Bartels, M, Middeldorp, CM and Boomsma, DI (2015) Stability in symptoms of anxiety and depression as a function of genotype and environment: a longitudinal twin study from ages 3 to 63 years. Psychological Medicine 45, 1039–1049.Google Scholar

Okbay, A, Baselmans, BM, De Neve, JE, et al. (2016) Genetic variants associated with subjective well-being, depressive symptoms, and neuroticism identified through genome-wide analyses. Nature Genetics 48, 624–633.Google Scholar

Olfson, M, Blanco, C and Marcus, SC (2016) Treatment of adult depression in the United States. JAMA Internal Medicine 176, 1482–1491.Google Scholar

Pedersen, CB, Bybjerg-Grauholm, J, Pedersen, MG, Grove, J, Agerbo, E, Bækvad-Hansen, M, Poulsen, JB, Hansen, CS, McGrath, JJ, Als, TD, Goldstein, JI, Neale, BM, Daly, MJ, Hougaard, DM, Mors, O, Nordentoft, M, Børglum, AD, Werge, T and Mortensen, PB (2018) The iPSYCH2012 case-cohort sample: new directions for unravelling genetic and environmental architectures of severe mental disorders. Molecular Psychiatry 23, 6–14.Google Scholar

Peterson, RE, Cai, N, Dahl, AW, Bigdeli, TB, Edwards, AC, Webb, BT, Bacanu, S, Zaitlen, N, Flint, J and Kendler, KS (2018) Molecular genetic analysis subdivided by adversity exposure suggests etiologic heterogeneity in major depression. American Journal of Psychiatry 175, 545–545.Google Scholar

Power, RA, et al. (2017) Genome-wide association for major depression through age at onset stratification: major depressive disorder working group of the psychiatric genomics consortium. Biological Psychiatry 81, 325–335.Google Scholar

Reich, T, James, JW and Morris, CA (1972) The use of multiple thresholds in determining the mode of transmission of semi-continuous traits. Annals of Human Genetics 36, 163–184.Google Scholar

Ripke, S, et al. (2013) Genome-wide association analysis identifies 13 new risk loci for schizophrenia. Nature Genetics 45, 1150–1159.Google Scholar

Schizophrenia Working Group of the Psychiatric Genomics Consortium (2014) Biological insights from 108 schizophrenia-associated genetic loci. Nature 511, 421–427.Google Scholar

Schwabe, I and van den Berg, SM (2014) Assessing genotype by environment interaction in case of heterogeneous measurement error. Behavior Genetics 44, 394–406.Google Scholar

Schwabe, I, Go, Z, Tijmstra, J and Pohl, S (2019) Psychometric modelling of longitudinal genetically-informative twin data. Frontiers in Genetics 10, doi:10.3389/fgene.2019.00837Google Scholar

Sullivan, PF, Neale, MC and Kendler, KS (2000) Genetic epidemiology of major depression: review and meta-analysis. American Journal of Psychiatry 157, 1552–1562.Google Scholar

Thorp, JG, Marees, A, Ong, J, An, J, MacGregor, S and Derks, EM (in press) Investigating genetic heterogeneity in major depression through item-level genetic analyses of the PHQ-9. Psychological Medicine.Google Scholar

Turley, P, Walters, RK, Maghzian, O, Okbay, A, Lee, JJ, Fontana, MA, Nguyen-Viet, TA, Wedow, R, Zacher, M, Furlotte, NA, Magnusson, P, OSkarsson, S, Johannesson, M, Visscher, PM, Laibson, D, Cesarini, D, Neale, BM, Benjamin, DJ and 23andMe Research Team, Social Science Genetic Association Consortium (2018) Multi-trait analysis of genome-wide association summary statistics using MTAG. Nature Genetics 50, 229–237.Google Scholar

van den Berg, SM, Glas, CA and Boomsma, DI (2007) Variance decomposition using an irt measurement model. Behavior Genetics 37, 604–616.Google Scholar

van den Berg, SM, et al. (2014) Harmonization of neuroticism and extraversion phenotypes across inventories and cohorts in the genetics of personality consortium: an application of item response theory. Behavior Genetics 14, 295–313.Google Scholar

Van der Sluis, S, Verhage, M, Posthuma, D and Dolan, CV (2010) Phenotypic complexity, measurement bias and poor phenotypic resolution contribute to the missing heritability problem in genetic association studies. PLoS ONE 5, e13929.Google Scholar

Viktorin, A, Meltzer-Brody, S, Kuja-Halkola, R, Sullivan, PF, Landén, M, Lichtenstein, P and Magnusson, PK (2016) Heritability of perinatal depression and genetic overlap with nonperinatal depression. American Journal of Psychiatry 173, 158–165.Google Scholar

Visscher, PM, Wray, NR, Zhang, Q, Sklar, P, McCarthy, MI, Brown, MA and Yang, J (2017) 10 years of GWAS discovery: biology, function, and translation. American Journal of Human Genetics 101, 5–22.Google Scholar

World Health Organization (2018) International Statistical Classification of Diseases and Related Health Problems, 11th ed., text rev. Geneva, Switzerland.Google Scholar

Wray, NR, et al. (2012) Genome-wide association study of major depressive disorder: new results, meta-analysis, and lessons learned. Molecular Psychiatry 17, 36–48.Google Scholar

Wray, NR, et al. (2018) Genome-wide association analyses identify 44 risk variants and refine the genetic architecture of major depression. Nature Genetics 50, 668.Google Scholar

Yang, J, Zeng, J, Goddard, ME, Wray, NR and Visscher, PM (2017) Concepts, estimation and interpretation of SNP-based heritability. Nature Genetics 49, 1304–1310.Google Scholar

Table 1. Overview of the number of significant loci and H2SNP in genome-wide association studies on depression (sample size >10 000 subjects)

Fig. 2. MDD is likely caused by multiple different etiopathological mechanisms. Studies investigating distinct subtypes of depression aim at reducing the underlying pathophysiological heterogeneity.

Schwabe et al. supplementary material

File 96.8 KB

Article contents

Unraveling the genetic architecture of major depressive disorder: merits and pitfalls of the approaches used in genome-wide association studies

Abstract

Keywords

Introduction

GWAS in depression

Studies on heterogeneity

Merits and pitfalls of the two approaches

Discussion

Supplementary material

Footnotes

References

Schwabe et al. supplementary material

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests