Book contents
- Frontmatter
- Dedication
- Contents
- List of contributors
- Forewords
- Preface
- List of abbreviations
- Part I Genome-wide association studies
- Part II Genome-wide studies in disease biology
- Part III Single nucleotide polymorphisms, copy number variants, haplotypes and eQTLs
- 10 Finding SNPs that affect microRNA regulation in disease-associated genomic regions
- 11 From linkage to complex associations: the role of GABRA2 as a risk factor for alcohol use
- 12 Copy number variation in monozygous twins
- 13 Haplotypes of CpG-related SNPs and associations with DNA methylation patterns
- 14 eQTL mapping
- Part IV Next-generation sequencing technology and pharmaco-genomics
- Part V Population genetics and personalized medicine
- Index
- Plate section
- References
14 - eQTL mapping
from Part III - Single nucleotide polymorphisms, copy number variants, haplotypes and eQTLs
Published online by Cambridge University Press: 18 December 2015
- Frontmatter
- Dedication
- Contents
- List of contributors
- Forewords
- Preface
- List of abbreviations
- Part I Genome-wide association studies
- Part II Genome-wide studies in disease biology
- Part III Single nucleotide polymorphisms, copy number variants, haplotypes and eQTLs
- 10 Finding SNPs that affect microRNA regulation in disease-associated genomic regions
- 11 From linkage to complex associations: the role of GABRA2 as a risk factor for alcohol use
- 12 Copy number variation in monozygous twins
- 13 Haplotypes of CpG-related SNPs and associations with DNA methylation patterns
- 14 eQTL mapping
- Part IV Next-generation sequencing technology and pharmaco-genomics
- Part V Population genetics and personalized medicine
- Index
- Plate section
- References
Summary
Introduction
With an influx of successful genome-wide association studies to identify genetic variations associated with complex diseases, an unprecedented wealth of knowledge has been accumulated for SNP–phenotype associations (McCarthy et al., 2008; Witte 2010; Manolio 2013). However, many SNP–disease associations do not lend themselves to molecular interpretations, because many of the identified loci are located outside of the coding regions. Even when a gene can be inferred to be causal, there is often a significant gap towards the understanding of the underlying molecular mechanisms (Schadt et al., 2005; McCarthy et al., 2008). Genome-wide eQTL mapping has been one effective approach to bridge this gap (Mackay et al., 2009). In eQTL studies, gene expression levels measured by high-throughput technologies, such as microarrays and RNA-Seq, are treated as quantitative traits. Marker genotypes are also collected from the same set of individuals, and statistical analyses are performed to detect associations between markers and expression traits. By simultaneously capturing many regulatory interactions, eQTLs offer valuable insights on the genetic architecture of expression regulation (Rockman and Kruglyak 2006). The ultimate goal of eQTL studies is to elucidate how genetic variations affect phenotypes by using gene expression levels as intermediate molecular phenotypes (Nica and Dermitzakis 2008). In this chapter, we provide an overview of the eQTL analysis workflow (Figure 14.1), introduce publicly available tools for analysis, and further discuss challenges and issues.
Data pre-processing
Genome-wide eQTL mapping considers high-density SNP genotype data and gene expression data from the same individuals in a segregating population. Both require appropriate pre-processing as described below for subsequent analysis.
Genotype data
Three quality control (QC) criteria are often used in the pre-processing of the genotype data. (1) Missing rate: individuals with a large proportion of missing SNP genotypes (e.g., 10%) should be excluded because the DNA samples of those individuals may be of poor quality. SNPs with a large missing rate (e.g., 5%) should also be filtered out. (2) Hardy–Weinberg Equilibrium (HWE): statistically significant deviations from HWE often result from genotyping errors. Therefore, SNPs that fail an exact HWE test (e.g., a P-value less than 0.001) should be filtered out. The criterion does not apply to haploid organisms, such as yeast. (3) Minor allele frequency (MAF): SNPs with low MAF (e.g., 0.05) are sometimes filtered out because of the insufficient statistical power for studies with a relatively small sample size and potentially higher genotype calling error.
- Type
- Chapter
- Information
- Genome-Wide Association StudiesFrom Polymorphism to Personalized Medicine, pp. 208 - 228Publisher: Cambridge University PressPrint publication year: 2016