Multiomic prioritisation of risk genes for anorexia nervosa

Danielle M. Adams; William R. Reay; Murray J. Cairns

doi:10.1017/S0033291723000235

Multiomic prioritisation of risk genes for anorexia nervosa

Published online by Cambridge University Press: 20 February 2023

Danielle M. Adams ,

William R. Reay and

Murray J. Cairns

Show author details

Danielle M. Adams: Affiliation:
School of Biomedical Sciences and Pharmacy, Centre for Complex Disease Neurobiology and Precision Medicine, The University of Newcastle, Callaghan, NSW, Australia Precision Medicine Research Program, Hunter Medical Research Institute, Newcastle, NSW, Australia
William R. Reay: Affiliation:
School of Biomedical Sciences and Pharmacy, Centre for Complex Disease Neurobiology and Precision Medicine, The University of Newcastle, Callaghan, NSW, Australia Precision Medicine Research Program, Hunter Medical Research Institute, Newcastle, NSW, Australia
Murray J. Cairns*: Affiliation:
School of Biomedical Sciences and Pharmacy, Centre for Complex Disease Neurobiology and Precision Medicine, The University of Newcastle, Callaghan, NSW, Australia Precision Medicine Research Program, Hunter Medical Research Institute, Newcastle, NSW, Australia
*: Author for correspondence: Murray J. Cairns, E-mail: Murray.Cairns@newcastle.edu.au

Article contents

Abstract
Background
Methods
Results
Conclusions
Introduction
Materials and methods
Results
Discussion
Author contributions
Conflict of interest
Data availability
References

Rights & Permissions

Abstract

Background

Anorexia nervosa (AN) is a psychiatric disorder associated with marked morbidity. Whilst AN genetic studies could identify novel treatment targets, integration of functional genomics data, including transcriptomics and proteomics, would assist to disentangle correlated signals and reveal causally associated genes.

Methods

We used models of genetically imputed expression and splicing from 14 tissues, leveraging mRNA, protein, and mRNA alternative splicing weights to identify genes, proteins, and transcripts, respectively, associated with AN risk. This was accomplished through transcriptome, proteome, and spliceosome-wide association studies, followed by conditional analysis and finemapping to prioritise candidate causal genes.

Results

We uncovered 134 genes for which genetically predicted mRNA expression was associated with AN after multiple-testing correction, as well as four proteins and 16 alternatively spliced transcripts. Conditional analysis of these significantly associated genes on other proximal association signals resulted in 97 genes independently associated with AN. Moreover, probabilistic finemapping further refined these associations and prioritised putative causal genes. The gene WDR6, for which increased genetically predicted mRNA expression was correlated with AN, was strongly supported by both conditional analyses and finemapping. Pathway analysis of genes revealed by finemapping identified the pathway regulation of immune system process (overlapping genes = MST1, TREX1, PRKAR2A, PROS1) as statistically overrepresented.

Conclusions

We leveraged multiomic datasets to genetically prioritise novel risk genes for AN. Multiple-lines of evidence support that WDR6 is associated with AN, whilst other prioritised genes were enriched within immune related pathways, further supporting the role of the immune system in AN.

Keywords

Alternative splicing anorexia nervosa mRNA protein expression PWAS TWAS

Type: Original Article
Information: Psychological Medicine , Volume 53 , Issue 14 , October 2023 , pp. 6754 - 6762

DOI: https://doi.org/10.1017/S0033291723000235 [Opens in a new window]
Creative Commons: This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright: Copyright © The Author(s), 2023. Published by Cambridge University Press

Introduction

Anorexia nervosa (AN) is a complex psychiatric disorder associated with alterations to satiety, activity, and self-perception that results in severe mental distress and malnourishment (Sibeoni et al., Reference Sibeoni, Orri, Valentin, Podlipski, Colin, Pradere and Revah-Levy2017). Cognitive behavioural therapy and weight rehabilitation are the first-line treatments for AN, as there are still no approved pharmacotherapies for the disorder (Wonderlich, Bulik, Schmidt, Steiger, & Hoek, Reference Wonderlich, Bulik, Schmidt, Steiger and Hoek2020). Unravelling the biological complexity of AN onset and its clinical course will be key to developing more effective interventions and improving clinical management.

AN is influenced by genetic and environmental factors, with twin studies estimating heritability at 56% (Bulik et al., Reference Bulik, Sullivan, Tozzi, Furberg, Lichtenstein and Pedersen2006), and common variants are now shown to account for around 10–20% of liability scale heritability through genome-wide association studies (GWAS) (Hirtz & Hinney, Reference Hirtz and Hinney2020). The most recent AN GWAS uncovered eight independent genome-wide associated loci (Watson et al., Reference Watson, Yilmaz, Thornton, Hübel, Coleman, Gaspar and Bulik2019). GWAS present an opportunity to better understand the biology of AN, as well as potentially identify novel treatment targets and opportunities for drug repurposing (Reay & Cairns, Reference Reay and Cairns2021). For example, the latest AN GWAS consolidated the strong genetic overlap between the disorder and systemic metabolic factors like cholesterol and insulin, leading to AN being conceptualised a ‘metabo-psychiatric’ disorder (Adams, Reay, Geaghan, & Cairns, Reference Adams, Reay, Geaghan and Cairns2021; Watson et al., Reference Watson, Yilmaz, Thornton, Hübel, Coleman, Gaspar and Bulik2019). An ongoing challenge in the field of AN genetics is to identify key genes and biological systems that are informative to the pathogenesis of the disorder and may be relevant for treatment.

One way to approach this challenge is through gene-based aggregation methods that can increase power to detect associations beyond genome-wide significant loci and yield more biologically relevant information. In other words, rather than studying individual risk variants, these data can be collapsed at the level of genes to reduce multiple-testing burden and resolve key disorder-associated biological processes. For example, transcriptome wide association studies (TWAS) achieve this by integrating mRNA expression data with GWAS association data to detect genes for which genetically predicted expression is associated with the trait (Wainberg et al., Reference Wainberg, Sinnott-Armstrong, Mancuso, Barbeira, Knowles, Golan and Kundaje2019). TWAS can be conceptualised as a genetic approach to more traditional differential expression analyses. Specifically, rather than directly measuring mRNA expression in cases and controls, estimates of genetic effects on mRNA expression are integrated with the effect of those same genetic variants on a phenotype or disorder of interest. The expression component of this approach constructs a model that predicts mRNA expression for each gene using genetic variants. As a result, TWAS is capable of prioritising genes that may be involved in the disorder and assign them a direction of genetically predicted expression that is odds increasing (Wainberg et al., Reference Wainberg, Sinnott-Armstrong, Mancuso, Barbeira, Knowles, Golan and Kundaje2019). The direction of effect associated with the disorder derived from TWAS, that is, upregulation or downregulation, is particularly useful in the context of identifying compounds that could reverse the risk-increasing direction of expression. This method can also be extended to other quantitative functional data such as protein and alternatively spliced mRNA isoform abundance. Despite these useful features, TWAS alone is not a test of causality as genes identified may arise due to the confounding factors like co-regulation between genes or linkage disequilibrium between the variants associated with expression (Reay & Cairns, Reference Reay and Cairns2021). However, TWAS can be subjected to different statistical approaches to attempt to increase the fidelity to identify true risk genes for a trait (Hall et al., Reference Hall, Medway, Pain, Pardiñas, Rees, Escott-Price and O'Donovan2019; Mancuso et al., Reference Mancuso, Freund, Johnson, Shi, Kichaev, Gusev and Pasaniuc2019).

AN TWAS have previously been performed and revealed some insights into the pathogenesis of the disorder (Chatzinakos et al., Reference Chatzinakos, Georgiadis, Lee, Cai, Vladimirov, Docherty and Bacanu2020; Cheng et al., Reference Cheng, Qi, Liang, Zhang, Ma, Li and Zhang2020; Johnson et al., Reference Johnson, Cote, Dobbyn, Sloofman, Xu, Cotter and Huckins2022). For example, TWAS using the S-PrediXcan approach (Barbeira et al., Reference Barbeira, Dickinson, Bonazzola, Zheng, Wheeler, Torres and Im2018), was previously performed in the study outlining the largest AN GWAS, uncovering 36 genes with predicted expression associated with the disorder using data from Genotype-Tissue Expression (GTEx) project brain and blood data (Watson et al., Reference Watson, Yilmaz, Thornton, Hübel, Coleman, Gaspar and Bulik2019). However, these studies in AN have only considered mRNA expression, which may miss effects mediated from alternative splicing or protein expression. Moreover, large sample size post-mortem brain datasets like the PsychENCODE consortium (N _Samples > 1000) with expression and genetic data available present an opportunity to boost power to discover AN associated genes through TWAS. In this study, we utilise models of genetically predicted mRNA expression, protein expression, and alternative mRNA isoform abundance to prioritise genes involved in AN. These association signals were further refined through conditional analysis and finemapping to reveal several genes including WDR6 that may play a role in AN biology. Pathways analysis also implicated regulation of immune system process with gene set functional analysis including signal from MST1, TREX1, PRKAR2A and PROS1.

Materials and methods

Overview of study

We aimed to prioritise genes associated with AN and implicate potential mechanisms related to expression and/or splicing. Firstly, we conducted a comprehensive brain and blood-based transcriptome-wide association study (TWAS) (Gamazon et al., Reference Gamazon, Wheeler, Shah, Mozaffari, Aquino-Michaels, Carroll and Im2015; Gusev et al., Reference Gusev, Ko, Shi, Bhatia, Chung, Penninx and Pasaniuc2016; Reay & Cairns, Reference Reay and Cairns2021). TWAS requires expression data which can be imputed from independent SNP-mRNA expression weights from multivariate models of cis-acting genetically regulated expression (GReX). TWAS then compares imputed expression with SNP-AN effect sizes to test the association between predicted expression and the odds of AN. Genes uncovered from this approach that survived multiple-testing correction were then further probed to refine candidate causal genes through conditional analysis and probabilistic finemapping (Gusev et al., Reference Gusev, Ko, Shi, Bhatia, Chung, Penninx and Pasaniuc2016; Mancuso et al., Reference Mancuso, Freund, Johnson, Shi, Kichaev, Gusev and Pasaniuc2019). Whilst mRNA expression is arguably the most well studied cellular readout, genes may operate more specifically in the pathogenesis of AN through dysregulation of other factors like protein expression and alternative splicing. As a result, we leveraged SNP weights, where available, for protein expression and splicing to also perform a proteome-wide association study (PWAS) and an alternative splicing based test (spliceWAS). Notably, PWAS and spliceWAS analyses have not previously be published for AN. Genes were prioritised based on evidence from mRNA, protein, and alternative isoform expression and then subjected to further in silico analyses related to overrepresentation in biological pathways. The AN GWAS utilised in this study was a meta-analysis encompassing European ancestry cohorts that totalled 16 992 cases and 55 525 controls. In the AN GWAS, case status was mostly ascertained from online questionnaires or structured interviews based on standardised clinical criteria, for example, DSM-IV, whilst the UK Biobank derived cases were self-reported. Further details related to collection of samples, phenotype acquisition, and GWAS approach are described in the original publication (Watson et al., Reference Watson, Yilmaz, Thornton, Hübel, Coleman, Gaspar and Bulik2019).

Weights for genetically predicted mRNA, protein, and splicing

Brain related SNP weights (multivariate GReX) for TWAS were derived from GTEx v7 and PsychENCODE, whilst whole blood weights were also obtained from GTEx v7 (Gandal et al., Reference Gandal, Zhang, Hadjimichael, Walker, Chen, Liu and Abyzov2018; Gusev et al., Reference Gusev, Ko, Shi, Bhatia, Chung, Penninx and Pasaniuc2016). The GTEx v7 SNP weights comprise data from twelve different brain regions – with the sample sizes of the cohorts utilised for GReX estimation as follows: amygdala (N = 88), anterior cingulate cortex (N = 109), caudate (N = 144), cerebellar hemisphere (N = 125), cerebellum (N = 154), cortex (N = 136), frontal cortex (N = 136), hippocampus (N = 111), hypothalamus (N = 108), nucleus accumbens (N = 130), putamen (N = 80) and substantia nigra (N = 80). HapMap3 SNPs from the 1000 genomes phase 3 European reference panel were used as a linkage disequilibrium (LD) estimate, to correspond with how the weights were calculated with those same HapMap3 SNPs. The TWAS using whole blood weights utilised the same LD reference strategy, with 369 GTEx v7 participants in these models. Frontal or cerebral cortex tissue from the larger PsychENCODE cohort was also utilised in terms of SNP weights, with a sample size of 1695. As described in the original PsychENCODE publication, gene-wise GReX were estimated for all imputed SNPs, not just the HapMap3 panel, and thus, we utilised the full suite of the phase 3 1000 genomes European subset as the LD reference. There were two tissues for which SNP weights related to protein expression were available – the dorsolateral prefrontal cortex (DLPFC, N = 376) and plasma (N = 7213) (Wingo et al., Reference Wingo, Liu, Gerasimov, Gockley, Logsdon, Duong and Wingo2021; Zhang et al., Reference Zhang, Dutta, Köttgen, Tin, Schlosser, Grams and Chatterjee2021). It should be noted that the plasma weights were derived from the European subset of the study cohort and the authors only used the elastic net method to derive GReX. Analogous to the difference between the GTEx v7 and PsychENCODE studies above, the DLPFC SNP weights were estimated using the HapMap3 panel, and thus, we only used those SNPs as an LD reference. The plasma PWAS utilised the full reference panel. Finally, the splicing related weights were derived from the DLPFC samples from the Common Mind Consortium (N = 452) which utilised the HapMap3 restricted 1000 genomes panel (Gusev et al., Reference Gusev, Mancuso, Won, Kousi, Finucane, Reshef and Price2018).

Implementation of the FUSION pipeline

We conducted TWAS/PWAS/spliceWAS using the FUSION package (Gusev et al., Reference Gusev, Ko, Shi, Bhatia, Chung, Penninx and Pasaniuc2016). Specifically, the FUSION.assoc_test.R script was utilised, with the best performing GReX model selected from five-fold cross-validation (R ²) and the SNP weight set selected by FUSION for calculating the TWAS Z score. In accordance with usual practice for the FUSION approach, only genes/transcripts with significantly non-zero cis-acting heritability (cis-h ²) are included. Prior to analysis, summary statistics were also munged whereby SNPs were retained with an imputation INFO > 0.9, as well as removing indels, strand ambiguous SNPs and SNPs with MAF < 0.01. We corrected for the number of non-missing models tested for the TWAS, PWAS, and spliceWAS independently. Our primary method for correcting for multiple testing was the Bonferroni approach considering all models tested for each modality (TWAS, PWAS, and spliceWAS independently), however, this is inherently conservative due to genes having a GReX model in multiple tissues and correlations between genes. As a result, we also used a more exploratory Benjamini-Hochberg false discovery rate (FDR) approach for the TWAS, PWAS, and spliceWAS separately.

Conditional analyses and probabilistic finemapping

We applied conditional analysis via the FUSION framework (FUSION.post_process.R) to the regions of significant genes after correction to investigate the proportion of implicated genes in any given locus that are independently associated. Specifically, jointly significant genes retain their significance after jointly estimating association for all models within a 500 000 base pair region of a significant gene. Marginally associated genes, which are not jointly significant, likely arise due to factors such as genes for which predicted expression was correlated.

Moreover, we applied probabilistic finemapping to prioritise candidate causal genes from any locus in the AN GWAS summary statistics with at least a suggestively significant SNP (p < 1 × 10⁻⁵) using the FOCUS method (Mancuso et al., Reference Mancuso, Freund, Johnson, Shi, Kichaev, Gusev and Pasaniuc2019). The default prior (p = 1 × 10⁻³), and prior variance (nσ ² = 40), were utilised to approximate Bayes' factors such that the posterior inclusion probability (PIP) of each gene being a member of a credible set with 90% probability of containing the causal gene could be derived. Finemapping was performed with default tissue prioritisation, as well as the prioritisation of brain tissue. In the TWAS, there were two reference panels utilised for finemapping – for genes uncovered from a GTEx v7 tissue, we utilised the default combined FOCUS SNP weight set which collated GTEx v7 tissues, DLPFC (CommonMind), blood (YFS, NTR), and adipose (METSIM) SNP weight sets (https://www.dropbox.com/s/ep3dzlqnp7p8e5j/focus.db?dl=0), with genes discovered using the PsychENCODE weights finemapped specifically using that panel given the different LD parameters and its more complete set of genes with cis-heritable models in that one tissue. The multi-tissue finemapping panel contains several other non-brain tissues, and thus, some GReX models that would not have been available in brain and blood. We sought to balance maximising the number of models available for finemapping, whilst acknowledging that some of the tissues in this panel are less likely to be disease relevant. As FOCUS allows the null mode that the causal feature is not typed to be predicted as a possible member of the credible set, we excluded any genes for which that occurred. The credible set was defined by summing normalised PIP such that ρ was exceeded, sorting the genes, and then including those genes until at least ρ of the normalised-posterior mass is explained, as described in more detail elsewhere (Mancuso et al., Reference Mancuso, Freund, Johnson, Shi, Kichaev, Gusev and Pasaniuc2019; Reay et al., Reference Reay, El Shair, Geaghan, Riveros, Holliday, McEvoy and Cairns2021). As an exploratory analysis, we also applied finemapping to the PWAS and spliceWAS results, although the limited number of SNP weights available for protein and alternative splicing means that the probability of a causal gene for any region not being present is higher.

Investigation of prioritised AN associated genes

We investigated two sets of prioritised genes in silico: (1) conditionally independent associations (TWAS/PWAS/spliceWAS) from marginally significant signals (FDR < 0.05) that are at least nominally jointly significant (p < 0.05), and (2) genes in the finemapped 90% credible set with PIP > 0.4 and the absence of the null model in the credible set. Firstly, we considered biological pathways and other ontological sets for which these two sets of genes could separately be overrepresented via the g:Profiler framework and the Benjamini-Hochberg method for multiple-testing correction (Reimand, Kull, Peterson, Hansen, & Vilo, Reference Reimand, Kull, Peterson, Hansen and Vilo2007). We used the default background for gene-set enrichment (statistical domain size) in g:Profiler of only genes annotated to at one least domain out of the thousands of gene-sets considered.

Results

Novel AN risk genes uncovered through genetically predicted expression or splicing

Firstly, a TWAS was performed using models of genetically predicted mRNA expression across twelve brain regions from GTEx, cortical samples from the PsychENCODE consortium, and whole blood GTEx samples. This was the most well-powered approach as there were 57 596 mRNA models, that is genetically predicted models of expression, available to test, with the number of unique genes totalling 11242, 12183, and 5915, for GTEx brain, PsychENCODE brain, and GTEx whole blood samples, respectively (online Supplementary Table S1). TWAS tested the association between genetically predicted expression of these mRNA and the odds of AN. Correcting for all 57 596 genetic models of mRNA expression (p < 8.68 × 10⁻⁷) revealed 40 association signals (14 unique genes), with several genes revealed in multiple brain regions (online Supplementary Table S1). We note here that ‘signals’ refers to any detected association in a tissue, which is different from unique genes as genes often have GReX models available to test in multiple tissues. The most significant association signals were found in a gene dense region on chromosome 3 (Fig. 1), in line with expectation given the significant GWAS signal for AN (chromosome 3: 47 588 253–51 368 253) overlaps this cluster of significant genes. Upregulation of the gene encoding WD Repeat Domain 6 (WDR6) was the top hit in this region associated with increased odds of AN – Z _TWAS = 7.44, p = 9.96 × 10⁻¹⁴ (GTEx cortex). This gene also surpassed Bonferroni correction in several other tissues, including the larger sample size PsychENCODE cortical samples, whole blood, nucleus accumbens, and caudate basal ganglia. The WD repeat protein family effects signal transduction (Li & Roberts, Reference Li and Roberts2001), whilst WDR6 is thought to influence cell cycle arrest (Xie, Wang, & Chen, Reference Xie, Wang and Chen2007). Several other proximal genes to WDR6 also survived Bonferroni correction in multiple tissues, such as, CCDC71, NCKIPSD, and MST1R. O-6-Methylguanine-DNA Methyltransferase (MGMT) was the most significant gene outside of chromosome 3 (Z _TWAS = −5.33, p = 9.88 × 10⁻⁸ – caudate basal ganglia), followed by CDK11B on chromosome 1, and the long non-coding RNA (lncRNA) LINC00324 on chromosome 17. Previous analysis, differing in methodology and samples, found an association between SUOX expression and AN which was not replicated in the present study (Baird et al., Reference Baird, Liu, Zheng, Sieberts, Perumal, Elsworth and Gaunt2021; Chatzinakos et al., Reference Chatzinakos, Georgiadis, Lee, Cai, Vladimirov, Docherty and Bacanu2020). Using a more lenient FDR approach for multiple-testing correction revealed more association signals (273 signals, 104 unique genes, online Supplementary Table S1). Additionally, we replicated 18 of the 36 significant associations from a previously published AN PrediXcan analysis, including WDR6 and their most significant association DALRD3. We posit that any signals not replicated would be a function of FUSION only using cis-heritable genes, differences in GReX construction between the methods, and our study only including brain and blood tissues.

Fig. 1. TWAS associations and region plot of the densely associated AN signal on chromosome 3. a: Heatmap of genes with at least one Bonferroni significant eQTL tissue associated with AN. Red indicates positive z scores; blue indicates negative z scores (legend). Columns indicates genes, rows indicate tissue models. * Indicates nominally significant genes, ** indicates Benjamini-Hochberg significant associations, *** indicates Bonferroni significant associations. Grey squares indicate that a significantly cis-heritable model of imputed expression data was unavailable in that tissue. b: Relative AN gene and SNP locations and significance. Points in the top panel indicate SNPs, legend indicates r ², left side y-axis indicates the negative log transformed p value of SNPs, right side y-axis indicates the recombination rate (cM/Mb). The bottom panel indicates the location of genes relative to the top panel SNPs. Plot generated using ZoomLocus (Pruim et al., Reference Pruim, Welch, Sanna, Teslovich, Chines, Gliedt and Willer2010) with 200 kb flanking size. The SNP with the most significant p value in this region: rs73082362 is highlighted.

We then sought to extend the power for gene discovery, as well as supporting signals uncovered by the TWAS, through integration of models of genetically regulated protein expression (PWAS) and alternative splicing (spliceWAS, online Supplementary Tables S2, S3). In other words, these analyses considered the association of genetically predicted protein expression and alternative splicing with AN rather than mRNA expression. There were 2300 protein expression SNP weight sets in total available to test across the DLPFC and blood cohorts in which the imputed expression models were trained. Two proteins survived Bonferroni correction in the PWAS, and we uncovered four signals that were significant after applying a less stringent FDR based cut-off (FDR < 0.05). The strongest protein association, MHC class I chain-related gene B (MICB) (Z _PWAS = 4.53, p = 5.75 × 10⁻⁶, plasma) was correlated with increased odds risk of AN, however, due to the extensive LD of the MHC region it is difficult to apply approaches such as PWAS, and thus, this signal should be treated cautiously. The densely associated region on chromosome 3 harboured the next most significant protein expression signal, with decreased predicted expression of Glutathione peroxidase 1 (GPX1) associated with AN in the DLPFC (Z _PWAS = −4.41), mirroring its negative TWAS test statistic in the hypothalamus for mRNA expression (Z _TWAS = −4.51). Thereafter, the remaining two proteins surviving correction were each on different chromosomes (FGF23, chromosome 12 and CTNND1, chromosome 11). The spliceWAS yielded fourteen transcripts that survived Bonferroni correction from the 7708 transcripts tested relating to 3291 total genes. Once more the AN association dense region on chromosome 3 implicated already by GWAS, TWAS, and PWAS, yielded the most significant signal, with seven transcripts of Ariadne RBR E3 Ubiquitin Protein Ligase 2 (ARIH2) significantly associated. Interestingly, there were mixed directions of effect amongst the different isoforms as increased predicted abundance of three splice variants were associated with AN, whilst the converse was true for the remaining four. None of these isoforms correspond to the canonical transcript; chr3:48 918 821:48 967 151 (Howe et al., Reference Howe, Achuthan, Allen, Allen, Alvarez-Jarreta, Amode and Flicek2020). In general, the ARIH2 gene is postulated to regulate ubiquitination (Kelsall et al., Reference Kelsall, Duda, Olszewski, Hofmann, Knebel, Langevin and Alpi2013; Marteijn et al., Reference Marteijn, van Emst, Erpelinck-Verschueren, Nikoloski, Menke, de Witte and van der Reijden2005) and post-transcriptionally modifies NLRP3 to reduce inflammatory activity (Kawashima et al., Reference Kawashima, Karasawa, Tago, Kimura, Kamata, Usui-Kawanishi and Takahashi2017), however, the functional specificity of particular splicing isoforms is less well characterised. This gene was also not significant in the TWAS, despite having a well-powered model of imputed expression in the PsychENCODE cortical samples. The only remaining transcripts surviving Bonferroni correction not proximally located to the chromosome 3 region were two isoforms of GPR75-ASB3 Readthrough (GPR75-ASB3). Decreased predicted abundance of these two isoforms was associated with AN.

Conditional analyses and probabilistic finemapping further refine AN association signals

Co-regulation and LD between genes can confound TWAS signals and lead to spurious associations for non-causal genes. Conditional analysis and finemapping was performed to distinguish genes with increased evidence of exerting an independent causal effect on AN. In other words, we tested whether there was statistical evidence to support each of our identified association signals as directly relevant for the disorder. Conditional analysis (Gusev et al., Reference Gusev, Ko, Shi, Bhatia, Chung, Penninx and Pasaniuc2016) estimates the residual independent association of TWAS signals after controlling for the predicted expression of nearby significant genes. Benjamini-Hochberg significant TWAS genes were subjected to conditional analysis to predict which genes accounted for the localised signal. From 313 significant TWAS signals, 97 genes had a conditionally independent association (p _Joint < 0.05) with AN as indicated by their nominally significant joint p value (online Supplementary Table S4). The gene most significantly associated with AN, WDR6 (Z _{conditional−TWAS} = 7.4, p = 1 × 10⁻¹³, Cortex) maintained an independent association after conditioning on the 77 other TWAS significant gene models (21 unique genes) from chromosome 3, three of which are also conditionally independent (CTNNB1, GOLIM4, STX19).

Finemapping through the FOCUS method (Mancuso et al., Reference Mancuso, Freund, Johnson, Shi, Kichaev, Gusev and Pasaniuc2019) is a Bayesian statistical method designed to isolate subsets of genes more likely to contain causal genes based on a prior expectation of the number of causal genes we expect. We applied this approach to all suggestively significant regions in the AN GWAS (p _GWAS < 1 × 10⁻⁵) to derive 90% credible sets. Credible sets were removed if they contained the null model that is included by FOCUS to account for missing causal mechanisms like genes without a suitable GReX model. Across either mRNA, protein or mRNA alternative splicing weights, there were 116 genes that were prioritised in a credible set (online Supplementary Table S5). Of these, eight genes (Table 1) had moderate (PIP > 0.4) evidence of a causal effect on AN, while five genes demonstrated strong evidence (PIP > 0.8). The strongest evidence of a causal relationship was observed for Neurexophilin And PC-Esterase Domain Family Member 1 (NXPE1) (PIP = 1, testis). Surprisingly, NXPE1 was indicated for the testis (GTEx), whilst it was also highly expressed in the colon (Aguet et al., Reference Aguet, Brown, Castel, Davis, He, Jo and Zappala2017). The other four genes with strong evidence of a potential causal effect on AN were as follows: WDR6 (PIP = 0.997, DLPFC), PRKAR2A (PIP = 0.814, GTEx artery tibial), PROS1 (PIP = 0.895, GTEx cortex) and the non-coding RNA RP13-238F13.5 (PIP = 0.971, GTEx spinal cord cervical c-1). WDR6 and MST1 were the only genes found to be both conditionally independent with at least moderate finemap evidence of a causal relationship (Table 1). Macrophage Stimulating 1 (MST1) mediates cell division and apoptosis (Wang et al., Reference Wang, Yang, Shen, Zhang, Xiang and Weng2020; Zhang et al., Reference Zhang, Sun, Li, Wang, Li and Liu2019) and is predicted to increase risk of AN (Z _TWAS = 4.85, p = 1.2 × 10⁻⁶, GTEx Hypothalamus). However, finemapping of the MST1 PWAS indicates evidence of a protective relationship with AN (Z _finemap = −4.63, PIP = 0.457, blood plasma protein). This discordant direction between mRNA and protein requires further investigation to refine its biological salience. The power of finemapping to identify causal genes increases when more GReX expression models are included, and thus, tissues were used for finemapping that were not subjected to the marginal TWAS, since removing these would decrease finemapping power. There are three genes (RP13- 28F13.5, NXPE1 and PRKAR2A) without TWAS blood and brain associations. One of these; PRKAR2A, exists in a credible set with four other genes (the same set as the finemapped TREX1), three of which are associated within brain regions.

Table 1. Finemapped associations

Finemapped genes with moderate or strong evidence of a causal effect on AN (PIP > 0.4). Tissue indicates which tissue the expression weights were derived from, Ref name indicates the group/consortium responsible for eQTL generation, Z _finemap indicates the estimated Z score association, PIP indicates the posterior inclusion probability and region indicates the GRCh37 genomic region the credible set was derived from. Credible set size indicates the number of genes in the credible set when prioritising brain or any tissue. All genes are in the credible set. Z _TWAS (P) -tissue and Z _PWAS (P) -tissue indicates the TWAS and PWAS Z score and p value associated with the most significant tissue for each association. No spliceWAS associations are available for these eight genes.

Functional interrogation of prioritised AN risk genes suggests a role for immune function

Pathway analysis was performed on the conditionally independent (N = 97) and finemapped credible set of genes (N=8) using g:Profiler (Ensembl 103, Ensembl Genomes 50). We focused on pathways that were enriched for either of these input sets of genes that survived FDR correction (FDR < 0.05) and had at least three intersecting genes with the input list. Firstly, the conditionally independent genes were overrepresented amongst pathways related to the presynaptic active zone (overlapping genes = STX19, CTNND1, CTBP2, CTNNB1) (online Supplementary Table S6). While the genes prioritised by finemapping were overrepresented in the regulation of immune system process (overlapping genes = MST1, TREX1, PRKAR2A, PROS1) (online Supplementary Table S7). We note that as there were only eight genes that survived our finemapping pipeline that pathway analyses using this set of genes is somewhat underpowered.

Discussion

We integrated genome wide association signals for AN with genetically regulated gene expression to identify novel risk genes and biological insights into the disorder. Critically, through the application of methods such as conditional analysis and finemapping, we refined a smaller set of genes with greater confidence of true association that can be subjected to future follow-up study. Leveraging proteomic and alternative splicing data also revealed association signals that were not seen using mRNA data alone. While previous studies have uncovered potential AN risk genes, our use of statistical finemapping and conditional analysis allowed us to prioritise confident associations such as WDR6 that more likely exert a causal effect. Upregulation of WDR6 exhibited a plausible causal relationship with AN across all methods with available data (TWAS, conditional analysis, finemapping). While the surrounding region is rich with genes predicted to have an association, a conditionally significant TWAS signal along with the finemapping evidence suggests that WDR6 is a causal gene at this locus, however, further analyses such as SNP based finemapping and in silico prediction of variants in this locus are warranted to confirm this finding given genetic influence on disease may not directly be mediated by cis-acting expression. WDR6 is a conserved repeat region expressed ubiquitously during human development that may function as a restriction factor to inhibit virus replication and a protein complex assembly platform (Li et al., Reference Li, Burch, Gonzalez, Kashork, Shaffer, Bachinski and Roberts2000; Sivan, Ormanoglu, Buehler, Martin, & Moss, Reference Sivan, Ormanoglu, Buehler, Martin and Moss2015; Smith, Reference Smith, Clemen, Eichinger and Rybakin2008). Previous work in a rodent model suggests WDR6 may modulate insulin signalling in the hypothalamus (Chiba et al., Reference Chiba, Inoue, Mizuno, Komatsu, Fujita, Kubota and Shimokawa2009), which is interesting given that we observed evidence to suggest fasting insulin level exerts a protective effect against AN risk (Adams et al., Reference Adams, Reay, Geaghan and Cairns2021).

Mounting evidence suggests a relationship may exist between immune function and AN (Dalton et al., Reference Dalton, Bartholdy, Robinson, Solmi, Ibrahim, Breen and Himmerich2018a; Reay et al., Reference Reay, Kiltschewskij, Geaghan, Atkins, Carr, Green and Cairns2022). Starvation can lead to inflammation; however, there remains differences between the presentation of immune dysregulation during malnutrition and AN, suggesting an underlying relationship may exist beyond the pathology associated with malnutrition (Gibson & Mehler, Reference Gibson and Mehler2019). Interestingly, four of the eight finemapped genes were members in the immune system processes gene-set, which was a statistically significant overrepresentation. This is supportive of previous observations suggesting that immune system dysregulation may contribute to AN onset and maintenance (Gibson & Mehler, Reference Gibson and Mehler2019). Furthermore, the strongest protein PWAS signal MICB, which implicates the MHC region, and the lead mRNA isoform gene ARIH2 were also present within the immune system processes pathway. However, the overrepresentation in this instance was not significant like the finemapped genes. The identification of MICB in the PWAS necessitates further analysis of the MHC region in AN, such as the role of specific human leucocyte antigen types. Finemapped genes; MST1 (Chanda et al., Reference Chanda, Li, Oligschlaeger, Jeurissen, Houben, Walenbergh and Neumann2016; Lu, Zhao, & Liu, Reference Lu, Zhao and Liu2020), PRKAR2A (Kong et al., Reference Kong, Shen, Liu, Zuo, Ji, Lu and Yu2016) and WDR6 (Lv, Reference Lv2022) have previously been linked to inflammatory processes. Additionally, AN is also observationally associated with a pro-inflammatory state which includes increased levels of cytokines such as tumour necrosis factor and interleukins 1 and 6 (Caso et al., Reference Caso, Graell, Navalón, MacDowell, Gutiérrez, Soto and Marsá2020; Dalton et al., Reference Dalton, Campbell, Chung, Breen, Schmidt and Himmerich2018b; Gibson & Mehler, Reference Gibson and Mehler2019). These studies are likely confounded by factors like reverse causality that complicates their interpretation, whilst there is evidence suggesting that some protein expression differences in severe AN may disappear after rehabilitation (Nilsson et al., Reference Nilsson, Millischer, Göteson, Hübel, Thornton, Bulik and Landén2020). Recent genetic studies suggest a protective effect of C-reactive protein (CRP) on AN, which may relate to infection susceptibility given that CRP is not simply a marker of inflammation (Reay et al., Reference Reay, Kiltschewskij, Geaghan, Atkins, Carr, Green and Cairns2022; Tylee et al., Reference Tylee, Sun, Hess, Tahir, Sharma, Malik and Glatt2018), as is often characterised, and directly participates in processes like phagocytosis (Dalton et al., Reference Dalton, Bartholdy, Robinson, Solmi, Ibrahim, Breen and Himmerich2018a; Del Giudice & Gangestad, Reference Del Giudice and Gangestad2018). Our data suggests that genes with immune related functions are involved in the pathogenesis of AN. Further work is now needed to refine whether the immune system is a plausible target for AN that could lead to new treatments. Pathway analysis performed on prioritised genes support these data that genetic risk for AN may exert a functional role in the immune system. The mechanistic action of the immune system in AN pathophysiology remains still largely uncharacterised, however, further experimental exploration of the specific genes prioritised in this study may reveal clinically relevant insights.

Whilst TWAS, SpliceWAS and PWAS provide a mechanistic framework for the associative evidence between genes and disease, there is also significant confounding by co-regulation derived correlation. TWAS associations can also be confounded by linkage disequilibrium which can bias SNP effect estimates for both expression weights and disease associations (Wainberg et al., Reference Wainberg, Sinnott-Armstrong, Mancuso, Barbeira, Knowles, Golan and Kundaje2019). Performing TWAS is also limited to some extent by the sample size of GReX data from different tissues and that some genes are not expressed and are therefore ‘missing’ from a relevant gene set. Incompleteness of gene expression data diminishes the effectiveness of finemapping, null models within the credible set could be better linked to potentially causal genes which would allow for the identification of other causal genes. Our understanding of the genetic architecture of AN is also far from complete, both in terms of common variants and effects mediated through rare or structural variants. Future larger-scale AN GWAS planned will help to consolidate the strength and replicability of these findings. Previous work in power analysis for TWAS approaches have suggested that both expression and trait heritability/sample size influence discovery power. The liability scale SNP heritability of AN even at current sample sizes is larger than several other psychiatric disorders, supporting that post-GWAS analyses can be deployed despite the above limitations. Moreover, expression-based approaches would be greatly improved by access to cell-type specific genetic models of expression. In terms of bulk-tissue panels, the PsychENCODE cortical dataset is well-powered and captures genetic effects on numerous genes, however, this is not the case for other brain regions and tissues in GTEx where sample size is much smaller. Another limitation of the eQTL association data used in this analysis is that it captures steady-state expression levels which cannot directly distinguish between decay rates of mRNA and transcriptional variance (Pai et al., Reference Pai, Cain, Mizrahi-Man, De Leon, Lewellen, Veyrieras and Gilad2012). Furthermore, these analyses were performed in exclusively European ancestries, and as more diverse non-European samples and trans-ancestry GWAS become available, this is likely to improve TWAS and finemapping studies (Aguet et al., Reference Aguet, Brown, Castel, Davis, He, Jo and Zappala2017; Pai et al., Reference Pai, Cain, Mizrahi-Man, De Leon, Lewellen, Veyrieras and Gilad2012; Veturi & Ritchie, Reference Veturi and Ritchie2018; Watson et al., Reference Watson, Yilmaz, Thornton, Hübel, Coleman, Gaspar and Bulik2019). Finally, the existing AN GWAS has predominately female composition which somewhat restricts the identification of sex specific causal risk/protective factors.

In conclusion, our study highlighted novel genes, proteins and mRNA isoforms predicted to affect AN risk. We provide strong evidence suggesting that WDR6 and several genes related to immune system function contribute to the pathogenesis of the disorder. Further research is warranted to establish these mechanisms and determine their potential as targets for treatment.

Supplementary material

The supplementary material for this article can be found at https://doi.org/10.1017/S0033291723000235

Acknowledgements

This work was supported by a National Health and Medical Research Council (NHMRC) grant (1188493). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the paper. The authors wish to acknowledge the Eating Disorders working group of the Psychiatric Genomics Consortium for making their summary statistics publicly available.

Author contributions

W.R.R. conceived and designed the study with input from D.M.A and M.J.C. D.M.A performed the analyses. D.M.A, W.R.R. and M.J.C wrote the manuscript. W.R.R. and M.J.C contributed to the interpretation and visualisation of results.

Conflict of interest

The authors have no relevant conflicts of interest to disclose related to this study.

Data availability

All data in this study are publicly available. GWAS summary statistics for AN are available to download from the Psychiatric Genomics Consortium (https://www.med.unc.edu/pgc/download-results/).

The TWAS and spliceWAS SNP weights are available from (http://gusevlab.org/projects/fusion/), whilst the PWAS SNP weights can be obtained from (http://nilanjanchatterjeelab.org/pwas/) and (https://www.synapse.org/#!Synapse:syn23627957) for plasma and brain, respectively.

Code related to this study can be found at the following link: https://github.com/DanielleMAdams/Anorexia_nervosa_associated_genes_manuscript

References

Adams, D. M., Reay, W. R., Geaghan, M. P., & Cairns, M. J. (2021). Investigation of glycaemic traits in psychiatric disorders using Mendelian randomisation revealed a causal relationship with anorexia nervosa. Neuropsychopharmacology, 46(6), 1093–1102. doi: 10.1038/s41386-020-00847-wCrossRef Google Scholar PubMed

Aguet, F., Brown, A. A., Castel, S. E., Davis, J. R., He, Y., Jo, B., … Zappala, Z. (2017). Genetic effects on gene expression across human tissues. Nature, 550(7675), 204–213. doi: 10.1038/nature24277Google Scholar

Baird, D. A., Liu, J. Z., Zheng, J., Sieberts, S. K., Perumal, T., Elsworth, B., … Gaunt, T. R. (2021). Identifying drug targets for neurological and psychiatric disease via genetics and the brain transcriptome. PLOS Genetics, 17(1), e1009224. doi: 10.1371/journal.pgen.1009224CrossRef Google Scholar PubMed

Barbeira, A. N., Dickinson, S. P., Bonazzola, R., Zheng, J., Wheeler, H. E., Torres, J. M., … Im, H. K. (2018). Exploring the phenotypic consequences of tissue specific gene expression variation inferred from GWAS summary statistics. Nature Communications, 9(1), 1825. doi: 10.1038/s41467-018-03621-1CrossRef Google Scholar PubMed

Bulik, C. M., Sullivan, P. F., Tozzi, F., Furberg, H., Lichtenstein, P., & Pedersen, N. L. (2006). Prevalence, heritability, and prospective risk factors for anorexia nervosa. Archives of General Psychiatry, 63(3), 305–312. doi: 10.1001/archpsyc.63.3.305CrossRef Google Scholar PubMed

Caso, J. R., Graell, M., Navalón, A., MacDowell, K. S., Gutiérrez, S., Soto, M., … Marsá, M. D. (2020). Dysfunction of inflammatory pathways in adolescent female patients with anorexia nervosa. Progress in Neuro-Psychopharmacology and Biological Psychiatry, 96, 109727. doi: 10.1016/j.pnpbp.2019.109727CrossRef Google Scholar PubMed

Chanda, D., Li, J., Oligschlaeger, Y., Jeurissen, M. L., Houben, T., Walenbergh, S. M., … Neumann, D. (2016). MSP is a negative regulator of inflammation and lipogenesis in ex vivo models of non-alcoholic steatohepatitis. Experimental and Molecular Medicine, 48(9), e258. doi: 10.1038/emm.2016.79CrossRef Google Scholar

Chatzinakos, C., Georgiadis, F., Lee, D., Cai, N., Vladimirov, V. I., Docherty, A., … Bacanu, S. A. (2020). TWAS pathway method greatly enhances the number of leads for uncovering the molecular underpinnings of psychiatric disorders. American Journal of Medical Genetics Part B: Neuropsychiatric Genetics, 183(8), 454–463. doi: 10.1002/ajmg.b.32823CrossRef Google Scholar PubMed

Cheng, B., Qi, X., Liang, C., Zhang, L., Ma, M., Li, P., … Zhang, F. (2020). Integrative genomic enrichment analysis identified the brain regions and development stages related to anorexia nervosa and obsessive-compulsive disorder. Cerebral Cortex, 30(12), 6481–6489. doi: 10.1093/cercor/bhaa214CrossRef Google Scholar PubMed

Chiba, T., Inoue, D., Mizuno, A., Komatsu, T., Fujita, S., Kubota, H., … Shimokawa, I. (2009). Identification and characterization of an insulin receptor substrate 4-interacting protein in rat brain: Implications for longevity. Neurobiology of Aging, 30(3), 474–482. doi: 10.1016/j.neurobiolaging.2007.07.008CrossRef Google Scholar PubMed

Dalton, B., Bartholdy, S., Robinson, L., Solmi, M., Ibrahim, M. A. A., Breen, G., … Himmerich, H. (2018a). A meta-analysis of cytokine concentrations in eating disorders. Journal of Psychiatric Research, 103, 252–264. doi: 10.1016/j.jpsychires.2018.06.002CrossRef Google Scholar PubMed

Dalton, B., Campbell, I. C., Chung, R., Breen, G., Schmidt, U., & Himmerich, H. (2018b). Inflammatory markers in anorexia nervosa: An exploratory study. Nutrients, 10(11), 1573. doi: 10.3390/nu10111573CrossRef Google Scholar PubMed

Del Giudice, M., & Gangestad, S. W. (2018). Rethinking IL-6 and CRP: Why they are more than inflammatory biomarkers, and why it matters. Brain, Behavior, and Immunity, 70, 61–75. doi: 10.1016/j.bbi.2018.02.013CrossRef Google Scholar PubMed

Gamazon, E. R., Wheeler, H. E., Shah, K. P., Mozaffari, S. V., Aquino-Michaels, K., Carroll, R. J., … Im, H. K. (2015). A gene-based association method for mapping traits using reference transcriptome data. Nature Genetics, 47(9), 1091–1098. doi: 10.1038/ng.3367CrossRef Google Scholar PubMed

Gandal, M. J., Zhang, P., Hadjimichael, E., Walker, R. L., Chen, C., Liu, S., … Abyzov, A. (2018). Transcriptome-wide isoform-level dysregulation in ASD, schizophrenia, and bipolar disorder. Science (New York, N.Y.), 362(6420), eaat8127. doi: 10.1126/science.aat8127CrossRef Google Scholar PubMed

Gibson, D., & Mehler, P. S. (2019). Anorexia nervosa and the immune system-A narrative review. Journal of Clinical Medicine, 8(11), 1915. doi: 10.3390/jcm8111915CrossRef Google Scholar PubMed

Gusev, A., Ko, A., Shi, H., Bhatia, G., Chung, W., Penninx, B. W., … Pasaniuc, B. (2016). Integrative approaches for large-scale transcriptome-wide association studies. Nature Genetics, 48(3), 245–252. doi: 10.1038/ng.3506CrossRef Google Scholar PubMed

Gusev, A., Mancuso, N., Won, H., Kousi, M., Finucane, H. K., Reshef, Y., … Price, A. L. (2018). Transcriptome-wide association study of schizophrenia and chromatin activity yields mechanistic disease insights. Nature Genetics, 50(4), 538–548. doi: 10.1038/s41588-018-0092-1CrossRef Google Scholar PubMed

Hall, L. S., Medway, C. W., Pain, O., Pardiñas, A. F., Rees, E. G., Escott-Price, V., … O'Donovan, M. C. (2019). A transcriptome-wide association study implicates specific pre- and post-synaptic abnormalities in schizophrenia. Human Molecular Genetics, 29(1), 159–167. doi: 10.1093/hmg/ddz253CrossRef Google Scholar

Hirtz, R., & Hinney, A. (2020). Genetic and epigenetic findings in anorexia nervosa. Medizinische Genetik, 32(1), 25–29. doi: 10.1515/medgen-2020-2005CrossRef Google Scholar

Howe, K. L., Achuthan, P., Allen, J., Allen, J., Alvarez-Jarreta, J., Amode, M. R., … Flicek, P. (2020). Ensembl 2021. Nucleic Acids Research, 49(D1), D884–D891. doi: 10.1093/nar/gkaa942CrossRef Google Scholar

Johnson, J. S., Cote, A. C., Dobbyn, A., Sloofman, L. G., Xu, J., Cotter, L., … Huckins, L. M. (2022). Mapping anorexia nervosa genes to clinical phenotypes. Psychological Medicine, 1–15. doi: 10.1017/s0033291721004554Google Scholar PubMed

Kawashima, A., Karasawa, T., Tago, K., Kimura, H., Kamata, R., Usui-Kawanishi, F., … Takahashi, M. (2017). ARIH2 Ubiquitinates NLRP3 and negatively regulates NLRP3 inflammasome activation in macrophages. Journal of Immunology, 199(10), 3614–3622. doi: 10.4049/jimmunol.1700184CrossRef Google Scholar PubMed

Kelsall, I. R., Duda, D. M., Olszewski, J. L., Hofmann, K., Knebel, A., Langevin, F., … Alpi, A. F. (2013). TRIAD1 And HHARI bind to and are activated by distinct neddylated cullin-RING ligase complexes. The EMBO Journal, 32(21), 2848–2860. doi: 10.1038/emboj.2013.209CrossRef Google Scholar PubMed

Kong, D., Shen, Y., Liu, G., Zuo, S., Ji, Y., Lu, A., … Yu, Y. (2016). PKA regulatory IIα subunit is essential for PGD2-mediated resolution of inflammation. Journal of Experimental Medicine, 213(10), 2209–2226. doi: 10.1084/jem.20160459CrossRef Google Scholar PubMed

Li, D., Burch, P., Gonzalez, O., Kashork, C. D., Shaffer, L. G., Bachinski, L. L., & Roberts, R. (2000). Molecular cloning, expression analysis, and chromosome mapping of WDR6, a novel human WD-repeat gene. Biochemical and Biophysical Research Communications, 274(1), 117–123. doi: 10.1006/bbrc.2000.3012CrossRef Google Scholar PubMed

Li, D., & Roberts, R. (2001). Human genome and diseases:WD-repeat proteins: Structure characteristics, biological function, and their involvement in human diseases. Cellular and Molecular Life Sciences CMLS, 58(14), 2085–2097. doi: 10.1007/PL00000838CrossRef Google Scholar

Lu, K., Zhao, J., & Liu, W. (2020). Macrophage stimulating 1-induced inflammation response promotes aortic aneurysm formation through triggering endothelial cells death and activating the NF-κB signaling pathway. Journal of Receptors and Signal Transduction, 40(4), 374–382. doi: 10.1080/10799893.2020.1738484CrossRef Google Scholar PubMed

Lv, M. (2022). WD repeat domain 6 as a novelty prognostic biomarker correlates with immune infiltration in lung cancer: A preliminary study. Immunity, Inflammation and Disease, 10(9), e681. doi: 10.1002/iid3.681CrossRef Google Scholar PubMed

Mancuso, N., Freund, M. K., Johnson, R., Shi, H., Kichaev, G., Gusev, A., & Pasaniuc, B. (2019). Probabilistic fine-mapping of transcriptome-wide association studies. Nature genetics, 51(4), 675–682. doi: 10.1038/s41588-019-0367-1CrossRef Google Scholar PubMed

Marteijn, J. A., van Emst, L., Erpelinck-Verschueren, C. A., Nikoloski, G., Menke, A., de Witte, T., … van der Reijden, B. A. (2005). The E3 ubiquitin-protein ligase Triad1 inhibits clonogenic growth of primary myeloid progenitor cells. Blood, 106(13), 4114–4123. doi: 10.1182/blood-2005-04-1450CrossRef Google Scholar PubMed

Nilsson, I. A. K., Millischer, V., Göteson, A., Hübel, C., Thornton, L. M., Bulik, C. M., … Landén, M. (2020). Aberrant inflammatory profile in acute but not recovered anorexia nervosa. Brain, Behavior, and Immunity, 88, 718–724. doi: 10.1016/j.bbi.2020.05.024CrossRef Google Scholar

Pai, A. A., Cain, C. E., Mizrahi-Man, O., De Leon, S., Lewellen, N., Veyrieras, J.-B., … Gilad, Y. (2012). The contribution of RNA decay quantitative trait loci to inter-individual variation in steady-state gene expression levels. PLOS Genetics, 8(10), e1003000. doi: 10.1371/journal.pgen.1003000CrossRef Google Scholar PubMed

Pruim, R. J., Welch, R. P., Sanna, S., Teslovich, T. M., Chines, P. S., Gliedt, T. P., … Willer, C. J. (2010). LocusZoom: Regional visualization of genome-wide association scan results. Bioinformatics (Oxford, England), 26(18), 2336–2337. doi: 10.1093/bioinformatics/btq419Google Scholar PubMed

Reay, W. R., & Cairns, M. J. (2021). Advancing the use of genome-wide association studies for drug repurposing. Nature Reviews Genetics, 22(10), 658–671. doi: 10.1038/s41576-021-00387-zCrossRef Google Scholar PubMed

Reay, W. R., El Shair, S. I., Geaghan, M. P., Riveros, C., Holliday, E. G., McEvoy, M. A., … Cairns, M. J. (2021). Genetic association and causal inference converge on hyperglycaemia as a modifiable factor to improve lung function. Elife, 10, e63115. doi: 10.7554/eLife.63115CrossRef Google Scholar PubMed

Reay, W. R., Kiltschewskij, D. J., Geaghan, M. P., Atkins, J. R., Carr, V. J., Green, M. J., & Cairns, M. J. (2022). Genetic estimates of correlation and causality between blood-based biomarkers and psychiatric disorders. Science Advances, 8(14), eabj8969. doi: 10.1126/sciadv.abj8969CrossRef Google Scholar PubMed

Reimand, J., Kull, M., Peterson, H., Hansen, J., & Vilo, J. (2007). g:Profiler – a web-based toolset for functional profiling of gene lists from large-scale experiments. Nucleic Acids Research, 35(Web Server issue), W193–W200. doi: 10.1093/nar/gkm226CrossRef Google Scholar

Sibeoni, J., Orri, M., Valentin, M., Podlipski, M.-A., Colin, S., Pradere, J., & Revah-Levy, A. (2017). Metasynthesis of the views about treatment of anorexia nervosa in adolescents: Perspectives of adolescents, parents, and professionals. PLOS ONE, 12(1), e0169493. doi: 10.1371/journal.pone.0169493CrossRef Google Scholar PubMed

Sivan, G., Ormanoglu, P., Buehler, E. C., Martin, S. E., & Moss, B. (2015). Identification of restriction factors by human genome-wide RNA interference screening of viral host range mutants exemplified by discovery of SAMD9 and WDR6 as inhibitors of the vaccinia virus K1L-C7L- mutant. mBio, 6(4), e01122. doi: 10.1128/mBio.01122-15CrossRef Google Scholar PubMed

Smith, T. F. (2008). Diversity of WD-repeat proteins. In Clemen, C. S., Eichinger, L. & Rybakin, V. (Eds.), The coronin family of proteins: Subcellular biochemistry (pp. 20–30). New York, NY: Springer New York.CrossRef Google Scholar

Tylee, D. S., Sun, J., Hess, J. L., Tahir, M. A., Sharma, E., Malik, R., … Glatt, S. J. (2018). Genetic correlations among psychiatric and immune-related phenotypes based on genome-wide association data. American Journal of Medical Genetics Part B: Neuropsychiatric Genetics, 177(7), 641–657. doi: 10.1002/ajmg.b.32652CrossRef Google Scholar PubMed

Veturi, Y., & Ritchie, M. D. (2018). How powerful are summary-based methods for identifying expression-trait associations under different genetic architectures? Pacific Symposium on Biocomputing, 23, 228–239. doi: 10.1142/9789813235533_0021Google Scholar PubMed

Wainberg, M., Sinnott-Armstrong, N., Mancuso, N., Barbeira, A. N., Knowles, D. A., Golan, D., … Kundaje, A. (2019). Opportunities and challenges for transcriptome-wide association studies. Nature Genetics, 51(4), 592–599. doi: 10.1038/s41588-019-0385-zCrossRef Google Scholar PubMed

Wang, Y., Yang, Q., Shen, S., Zhang, L., Xiang, Y., & Weng, X. (2020). Mst1 promotes mitochondrial dysfunction and apoptosis in oxidative stress-induced rheumatoid arthritis synoviocytes. Aging (Albany NY), 12(16), 16211–16223. doi: 10.18632/aging.103643CrossRef Google Scholar PubMed

Watson, H. J., Yilmaz, Z., Thornton, L. M., Hübel, C., Coleman, J. R. I., Gaspar, H. A., … Bulik, C. M. (2019). Genome-wide association study identifies eight risk loci and implicates metabo-psychiatric origins for anorexia nervosa. Nature Genetics, 51(8), 1207–1214. doi: 10.1038/s41588-019-0439-2CrossRef Google Scholar PubMed

Wingo, A. P., Liu, Y., Gerasimov, E. S., Gockley, J., Logsdon, B. A., Duong, D. M., … Wingo, T. S. (2021). Integrating human brain proteomes with genome-wide association data implicates new proteins in Alzheimer's disease pathogenesis. Nature Genetics, 53(2), 143–146. doi: 10.1038/s41588-020-00773-zCrossRef Google Scholar PubMed

Wonderlich, S. A., Bulik, C. M., Schmidt, U., Steiger, H., & Hoek, H. W. (2020). Severe and enduring anorexia nervosa: Update and observations about the current clinical reality. International Journal of Eating Disorders, 53(8), 1303–1312. doi: 10.1002/eat.23283CrossRef Google Scholar PubMed

Xie, X., Wang, Z., & Chen, Y. (2007). Association of LKB1 with a WD-repeat protein WDR6 is implicated in cell growth arrest and p27(Kip1) induction. Molecular and Cellular Biochemistry, 301(1-2), 115–122. doi: 10.1007/s11010-006-9402-5CrossRef Google Scholar PubMed

Zhang, J., Dutta, D., Köttgen, A., Tin, A., Schlosser, P., Grams, M. E., … Chatterjee, N. (2021). Large Bi-Ethnic Study of Plasma Proteome Leads to Comprehensive Mapping of cis-pQTL and Models for Proteome-wide Association Studies. bioRxiv, 2021.2003.2015.435533. doi: 10.1101/2021.03.15.435533CrossRef Google Scholar

Zhang, J., Sun, L., Li, W., Wang, Y., Li, X., & Liu, Y. (2019). Overexpression of macrophage stimulating 1 enhances the anti-tumor effects of IL-24 in esophageal cancer via inhibiting ERK-Mfn2 signaling-dependent mitophagy. Biomedicine & Pharmacotherapy, 114, 108844. doi: 10.1016/j.biopha.2019.108844CrossRef Google Scholar PubMed

Fig. 1. TWAS associations and region plot of the densely associated AN signal on chromosome 3. a: Heatmap of genes with at least one Bonferroni significant eQTL tissue associated with AN. Red indicates positive z scores; blue indicates negative z scores (legend). Columns indicates genes, rows indicate tissue models. * Indicates nominally significant genes, ** indicates Benjamini-Hochberg significant associations, *** indicates Bonferroni significant associations. Grey squares indicate that a significantly cis-heritable model of imputed expression data was unavailable in that tissue. b: Relative AN gene and SNP locations and significance. Points in the top panel indicate SNPs, legend indicates r2, left side y-axis indicates the negative log transformed p value of SNPs, right side y-axis indicates the recombination rate (cM/Mb). The bottom panel indicates the location of genes relative to the top panel SNPs. Plot generated using ZoomLocus (Pruim et al., 2010) with 200 kb flanking size. The SNP with the most significant p value in this region: rs73082362 is highlighted.

Table 1. Finemapped associations

Adams et al. supplementary material

File 13.7 MB

Article contents

Multiomic prioritisation of risk genes for anorexia nervosa

Abstract

Keywords

Introduction

Materials and methods

Overview of study

Weights for genetically predicted mRNA, protein, and splicing

Implementation of the FUSION pipeline

Conditional analyses and probabilistic finemapping

Investigation of prioritised AN associated genes

Results

Novel AN risk genes uncovered through genetically predicted expression or splicing

Conditional analyses and probabilistic finemapping further refine AN association signals

Functional interrogation of prioritised AN risk genes suggests a role for immune function

Discussion

Supplementary material

Acknowledgements

Author contributions

Conflict of interest

Data availability

References

Adams et al. supplementary material

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests