Skip to main content Accessibility help
×
Hostname: page-component-78c5997874-mlc7c Total loading time: 0 Render date: 2024-11-09T22:09:08.223Z Has data issue: false hasContentIssue false

5 - Integrative Analysis of Multiple ChIP-X Data Sets Using Correlation Motifs

from Part A - Horizontal Meta-Analysis

Published online by Cambridge University Press:  05 September 2015

George Tseng
Affiliation:
University of Pittsburgh
Debashis Ghosh
Affiliation:
Pennsylvania State University
Xianghong Jasmine Zhou
Affiliation:
University of Southern California
Hongkai Ji
Affiliation:
Johns Hopkins Bloomberg School of Public Health, Baltimore, MD
Yingying Wei
Affiliation:
The Chinese University of Hong Kong, Hong Kong
Get access

Summary

Abstract

Genome-wide chromatin immunoprecipitation experiments including ChIP-seq and ChIP-chip, jointly referred to as ChIP-X, are high-throughput technologies to map protein-DNA interactions in the genome. When multiple related ChIP-X data sets are available, separately analyzing each data set is not optimal because it may lack power to detect consistent but relatively weak signals in multiple studies. Jointly analyzing all data sets may allow one to borrow information across studies to improve signal detection. However, a common problem in data integration is the difficulty in handling data set–specific signals that cannot be dealt with by simply assuming that the signal status for each genomic locus is the same across all studies. An integration model that naively enumerates all possible study specificity patterns, conversely, has exponential complexity because there are 2D possible combinatorial signal presence and absence patterns for D studies. Correlation motifs provide a useful solution to this problem. By introducing a small number of latent probability vectors called correlation motifs, this approach can describe the major correlation structure among multiple data sets, which can then be used to guide information sharing across data sets. The correlation motif approach is capable of improving signal detection. At the same time, it does not have the problem of exponential model complexity and is flexible enough to handle all possible data set–specific signal configurations.

Introduction

Chromatin immunoprecipitation (ChIP) coupled with high-throughput sequencing (ChIP-seq) (Johnson et al., 2007; Barski et al., 2007) and ChIP coupled with genome tiling array hybridization (ChIP-chip) (Ren et al., 2000; Cawley et al., 2004) are two high-throughput technologies capable of mapping protein-DNA interactions (PDIs) genome-wide. These two technologies, collectively referred to as “ChIP-X,” have been widely used to produce information on transcription factor binding sites (TFBSs) (Johnson et al., 2007; Robertson et al., 2007), histone modifications (HMs) (Barski et al., 2007; Mikkelsen et al., 2007), nucleosome positioning (Schmid and Bucher, 2007), allele-specific PDIs (Mikkelsen et al., 2007; Rozowsky et al., 2011), variations of PDIs among the population (McDaniell et al., 2010; Kasowski et al., 2010), and the evolution of gene regulation (Schmidt et al., 2010). Today, ChIP-X has become an indispensable tool used by both individual investigators and large consortium projects such as the Encyclopedia of DNA Elements (ENCODE), modENCODE, and Roadmap Epigenomics consortia to annotate the human genome and epigenome (Consortium, 2012; Celniker et al., 2009; Bernstein et al., 2010).

Type
Chapter
Information
Publisher: Cambridge University Press
Print publication year: 2015

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Save book to Kindle

To save this book to your Kindle, first ensure [email protected] is added to your Approved Personal Document E-mail List under your Personal Document Settings on the Manage Your Content and Devices page of your Amazon account. Then enter the ‘name’ part of your Kindle email address below. Find out more about saving to your Kindle.

Note you can select to save to either the @free.kindle.com or @kindle.com variations. ‘@free.kindle.com’ emails are free but can only be saved to your device when it is connected to wi-fi. ‘@kindle.com’ emails can be delivered even when you are not connected to wi-fi, but note that service fees apply.

Find out more about the Kindle Personal Document Service.

Available formats
×

Save book to Dropbox

To save content items to your account, please confirm that you agree to abide by our usage policies. If this is the first time you use this feature, you will be asked to authorise Cambridge Core to connect with your account. Find out more about saving content to Dropbox.

Available formats
×

Save book to Google Drive

To save content items to your account, please confirm that you agree to abide by our usage policies. If this is the first time you use this feature, you will be asked to authorise Cambridge Core to connect with your account. Find out more about saving content to Google Drive.

Available formats
×