Hostname: page-component-586b7cd67f-t7fkt Total loading time: 0 Render date: 2024-11-26T08:15:59.681Z Has data issue: false hasContentIssue false

Polygenic scores, and the genome-wide association studies they derive from, will have difficulty identifying genes that predispose one to develop a social behavioral trait

Published online by Cambridge University Press:  11 September 2023

Edward Fox*
Affiliation:
Department of Psychological Science, Purdue University, West Lafayette, IN, USA [email protected] https://www.purdue.edu/hhs/psy/directory/faculty/Fox_Edward.html

Abstract

Polygenic scores (PGSs) have several limitations. They are confounded with environmental effects on behavior and cannot be used to study how mutations affect brain function and behavior. For this, mutations with large effects, which often arise in only one geographical population are needed. Genome-wide association studies (GWASs), commonly used for identifying mutations, have difficulty detecting these mutations. A strategy that overcomes this challenge is discussed.

Type
Open Peer Commentary
Copyright
Copyright © The Author(s), 2023. Published by Cambridge University Press

Proponents of sociogenomics argue polygenic scores (PGSs) should be incorporated into social science research. PGSs are derived from many variants (up to thousands) of very small effect size (common variants) that are associated with measures of a social behavior trait as determined by genome-wide association studies (GWASs; Pain et al., Reference Pain, Glanville, Hagenaars, Selzam, Fürtjes, Gaspar and Lewis2021). As Burt described, although PGSs are suggested to be measures of genetic influence or propensity for complex traits, several factors make it difficult or impossible to distinguish genetic and environmental effects on such traits. Thus, PGSs are unlikely to be strictly genetic predictors of the propensity to exhibit a trait. Moreover, PGSs do not typically identify alleles or variants responsible for a phenotype (Astle et al., Reference Astle, Elding, Jiang, Allen, Ruklisa, Mann and Soranzo2016). This is unfortunate because identifying a deleterious mutation would permit its biological activity to be studied. The information obtained could provide the knowledge needed to repair or counteract the deleterious effects of the mutation. This limitation of PGSs is exemplary of a broader issue as GWASs have identified thousands of strong associations with complex diseases and traits, but in very few instances has the actual risk variant been identified (Chorley et al., Reference Chorley, Wang, Campbell, Pittman, Noureddine and Bell2008), or have they been successfully translated into clinical use (Bomba, Walter, & Soranzo, Reference Bomba, Walter and Soranzo2017). Identifying causal common variants in GWASs has been difficult because they usually map to regulatory regions (Astle et al., Reference Astle, Elding, Jiang, Allen, Ruklisa, Mann and Soranzo2016), where they influence gene expression, including processes involved in execution of gene expression such as splicing (Lalonde et al., Reference Lalonde, Ha, Wang, Bemmo, Kleinman, Kwan and Majewski2011).

In contrast to common variants of small effect size, rare variants that have large effects on phenotypes have been identified. These variants are often associated with the protein-coding portion of a gene. As proteins are important for structural and physiological functions of cells, mutations that affect them can produce these large effects. An explanation for the rarity of these variants based on evolutionary theory proposes that the detrimental effect of disease on fitness results in selection against variants that promote disease (Gibson, Reference Gibson2012).

Rare variants are often identified by quantitative trait locus (QTL) analysis, which looks for correlations between variants and measures of continuous phenotypic traits (Bloom et al., Reference Bloom, Boocock, Treusch, Sadhu, Day, Oates-Barker and Kruglyak2019). The goal is to uncover the locations in the genome important for these traits. A variation of this analysis that has identified rare variants of large effect size used individuals that displayed the trait of interest and individuals that did not display it from multiple generations of families or isolated populations. Rare variants might be found at higher frequencies in isolated populations because of previous bottleneck events, genetic drift or adaptation, and selection (Moltke et al., Reference Moltke, Grarup, Jørgensen, Bjerregaard, Treebak, Fumagalli and Hansen2014). This increases the power to detect associations between rare variants and phenotypes (Colonna et al., Reference Colonna, Pistis, Bomba, Mona, Matullo, Boano and Toniolo2013). In these studies that sample from families or isolated populations, variants that are closely linked to the mutation or causative allele are present in individuals that exhibit the trait at higher frequency than in individuals that do not display the trait. The locations of these variants indicate the chromosome region likely to contain the mutation. Positional cloning within this region can be used to identify the mutated gene and then comparison of this gene's DNA sequence in subjects with and without the trait can identify the causative mutation. Even though this mutation might only be present in a family or isolated population, the ability to study how any mutation alters the brain to influence a complex behavioral trait would be a breakthrough. An example of a study with success using this strategy focused on Canadian families of Celtic descent with multiple relatives in up to three generations diagnosed for schizophrenia (Brzustowicz, Hodgkinson, Chow, Honer, & Bassett, Reference Brzustowicz, Hodgkinson, Chow, Honer and Bassett2000). A highly significant association between schizophrenia and a locus on chromosome 1q21–q22 was found. Then additional variants within this region were used to pinpoint the nitric oxide synthase 1 adaptor gene (Brzustowicz et al., Reference Brzustowicz, Simone, Mohseni, Hayter, Hodgkinson, Chow and Bassett2004). This gene is overexpressed in the frontal cortex of people with schizophrenia, and it is involved in synaptic function and cortical neuron development, effects that could contribute to schizophrenia (Carrel et al., Reference Carrel, Hernandez, Kwon, Mau, Trivedi, Brzustowicz and Firestein2015; Hernandez et al., Reference Hernandez, Swiatkowski, Patel, Liang, Dudzinski, Brzustowicz and Firestein2016).

In contrast, GWASs, and thus PGSs, do not typically detect QTLs or rare variants of large effect size because these variants are rare in the total population sampled by GWASs. The power to detect a variant of any effect size decreases with the frequency of the variant because fewer individuals in the sample carry a less-frequent variant (Zuk et al., Reference Zuk, Schaffner, Samocha, Do, Hechter, Kathiresan and Lander2014). Put another way, because GWASs calculate the average effects of alleles across thousands of individuals, they cannot capture heterogeneity of effect sizes at the family level (Gibson, Reference Gibson2012).

Can approaches that detect rare variants be useful for sociogenomics? It could be argued that some measures of interest in sociogenomics, for example, level of educational attainment, could not be accounted for by one or a few rare variants. However, the contrast between what GWASs and PGSs identify best (common variants of small effect size) versus what QTL and related approaches identify best (rare variants of large effect size) suggests QTL and related approaches could have significant relevance for sociogenomics. As discussed above, by studying the right population it may be possible to identify associations of a complex behavioral trait with rare variants of large effect and ultimately identify one or more causative alleles. Social behaviors are complex and depend on multiple interacting neural systems as illustrated in a recent review on neural encoding of social valence (Padilla-Coreano, Tye, & Zelikowsky, Reference Padilla-Coreano, Tye and Zelikowsky2022). Social attributes, social memory, social rank, and social isolation were proposed to influence valence assignment to social stimuli, which in turn influences social interactions. Also, the separate neural circuits that control each of these influences were described, noting some overlap of these circuits. Interestingly, they suggest that across psychiatric disorders, brain regions that contribute to encoding of valence and social functions exhibit abnormal activity during emotional processing (e.g., Laviolette, Reference Laviolette2007). Thus, if a mutation disrupts one or more of the neural systems that influences valence assignment, this might lead to abnormal social interactions and a search might identify causal variants, including rare ones of large effect size.

Financial support

This research received no special grant from any funding agency, commercial, or not-for-profit sectors.

Competing interest

None.

References

Astle, W. J., Elding, H., Jiang, T., Allen, D., Ruklisa, D., Mann, A. L., … Soranzo, N. (2016). The allelic landscape of human blood cell trait variation and links to common complex disease. Cell, 167(5), 14151429, e1419. doi: 10.1016/j.cell.2016.10.042CrossRefGoogle ScholarPubMed
Bloom, J. S., Boocock, J., Treusch, S., Sadhu, M. J., Day, L., Oates-Barker, H., & Kruglyak, L. (2019). Rare variants contribute disproportionately to quantitative trait variation in yeast. eLife, 8. https://doi.org/10.7554/eLife.49212CrossRefGoogle ScholarPubMed
Bomba, L., Walter, K., & Soranzo, N. (2017). The impact of rare and low-frequency genetic variants in common disease. Genome Biology, 18(1), 77. doi: 10.1186/s13059-017-1212-4CrossRefGoogle ScholarPubMed
Brzustowicz, L. M., Hodgkinson, K. A., Chow, E. W., Honer, W. G., & Bassett, A. S. (2000). Location of a major susceptibility locus for familial schizophrenia on chromosome 1q21–q22. Science (New York, N.Y.), 288(5466), 678682. doi: 10.1126/science.288.5466.678CrossRefGoogle Scholar
Brzustowicz, L. M., Simone, J., Mohseni, P., Hayter, J. E., Hodgkinson, K. A., Chow, E. W., & Bassett, A. S. (2004). Linkage disequilibrium mapping of schizophrenia susceptibility to the CAPON region of chromosome 1q22. American Journal of Human Genetics, 74(5), 10571063. doi: 10.1086/420774CrossRefGoogle Scholar
Carrel, D., Hernandez, K., Kwon, M., Mau, C., Trivedi, M. P., Brzustowicz, L. M., & Firestein, B. L. (2015). Nitric oxide synthase 1 adaptor protein, a protein implicated in schizophrenia, controls radial migration of cortical neurons. Biological Psychiatry, 77(11), 969978. doi: 10.1016/j.biopsych.2014.10.016CrossRefGoogle ScholarPubMed
Chorley, B. N., Wang, X., Campbell, M. R., Pittman, G. S., Noureddine, M. A., & Bell, D. A. (2008). Discovery and verification of functional single nucleotide polymorphisms in regulatory genomic regions: Current and developing technologies. Mutation Research, 659(1–2), 147157. doi: 10.1016/j.mrrev.2008.05.001CrossRefGoogle ScholarPubMed
Colonna, V., Pistis, G., Bomba, L., Mona, S., Matullo, G., Boano, R., … Toniolo, D. (2013). Small effective population size and genetic homogeneity in the Val Borbera isolate. European Journal of Human Genetics, 21(1), 8994. doi: 10.1038/ejhg.2012.113CrossRefGoogle ScholarPubMed
Gibson, G. (2012). Rare and common variants: Twenty arguments. Nature Review Genetics, 13(2), 135145. doi: 10.1038/nrg3118CrossRefGoogle ScholarPubMed
Hernandez, K., Swiatkowski, P., Patel, M. V., Liang, C., Dudzinski, N. R., Brzustowicz, L. M., & Firestein, B. L. (2016). Overexpression of isoforms of nitric oxide synthase 1 adaptor protein, encoded by a risk gene for schizophrenia, alters actin dynamics and synaptic function. Frontiers in Cellular Neuroscience, 10, 6. doi: 10.3389/fncel.2016.00006CrossRefGoogle ScholarPubMed
Lalonde, E., Ha, K. C., Wang, Z., Bemmo, A., Kleinman, C. L., Kwan, T., … Majewski, J. (2011). RNA sequencing reveals the role of splicing polymorphisms in regulating human gene expression. Genome Research, 21(4), 545554. doi: 10.1101/gr.111211.110CrossRefGoogle ScholarPubMed
Laviolette, S. R. (2007). Dopamine modulation of emotional processing in cortical and subcortical neural circuits: Evidence for a final common pathway in schizophrenia? Schizophrenia Bulletin, 33(4), 971981. doi: 10.1093/schbul/sbm048CrossRefGoogle ScholarPubMed
Moltke, I., Grarup, N., Jørgensen, M. E., Bjerregaard, P., Treebak, J. T., Fumagalli, M., … Hansen, T. (2014). A common Greenlandic TBC1D4 variant confers muscle insulin resistance and type 2 diabetes. Nature, 512(7513), 190193. doi: 10.1038/nature13425CrossRefGoogle ScholarPubMed
Padilla-Coreano, N., Tye, K. M., & Zelikowsky, M. (2022). Dynamic influences on the neural encoding of social valence. Nature Reviews Neuroscience, 23(9), 535550. doi: 10.1038/s41583-022-00609-1CrossRefGoogle ScholarPubMed
Pain, O., Glanville, K. P., Hagenaars, S. P., Selzam, S., Fürtjes, A. E., Gaspar, H. A., … Lewis, C. M. (2021). Evaluation of polygenic prediction methodology within a reference-standardized framework. PLoS Genetics, 17(5), e1009021. doi: 10.1371/journal.pgen.1009021CrossRefGoogle ScholarPubMed
Zuk, O., Schaffner, S. F., Samocha, K., Do, R., Hechter, E., Kathiresan, S., … Lander, E. S. (2014). Searching for missing heritability: Designing rare variant association studies. Proceedings of the National Academy of Sciences USA, 111(4), E455E464. doi: 10.1073/pnas.1322563111CrossRefGoogle ScholarPubMed