Book contents
- Frontmatter
- Contents
- List of Contributors
- Preface
- 1 An Introduction to High-Throughput Bioinformatics Data
- 2 Hierarchical Mixture Models for Expression Profiles
- 3 Bayesian Hierarchical Models for Inference in Microarray Data
- 4 Bayesian Process-Based Modeling of Two-Channel Microarray Experiments: Estimating Absolute mRNA Concentrations
- 5 Identification of Biomarkers in Classification and Clustering of High-Throughput Data
- 6 Modeling Nonlinear Gene Interactions Using Bayesian MARS
- 7 Models for Probability of Under- and Overexpression: The POE Scale
- 8 Sparse Statistical Modelling in Gene Expression Genomics
- 9 Bayesian Analysis of Cell Cycle Gene Expression Data
- 10 Model-Based Clustering for Expression Data via a Dirichlet Process Mixture Model
- 11 Interval Mapping for Expression Quantitative Trait Loci
- 12 Bayesian Mixture Models for Gene Expression and Protein Profiles
- 13 Shrinkage Estimation for SAGE Data Using a Mixture Dirichlet Prior
- 14 Analysis of Mass Spectrometry Data Using Bayesian Wavelet-Based Functional Mixed Models
- 15 Nonparametric Models for Proteomic Peak Identification and Quantification
- 16 Bayesian Modeling and Inference for Sequence Motif Discovery
- 17 Identification of DNA Regulatory Motifs and Regulators by Integrating Gene Expression and Sequence Data
- 18 A Misclassification Model for Inferring Transcriptional Regulatory Networks
- 19 Estimating Cellular Signaling from Transcription Data
- 20 Computational Methods for Learning Bayesian Networks from High-Throughput Biological Data
- 21 Bayesian Networks and Informative Priors: Transcriptional Regulatory Network Models
- 22 Sample Size Choice for Microarray Experiments
- Plate section
12 - Bayesian Mixture Models for Gene Expression and Protein Profiles
Published online by Cambridge University Press: 23 November 2009
- Frontmatter
- Contents
- List of Contributors
- Preface
- 1 An Introduction to High-Throughput Bioinformatics Data
- 2 Hierarchical Mixture Models for Expression Profiles
- 3 Bayesian Hierarchical Models for Inference in Microarray Data
- 4 Bayesian Process-Based Modeling of Two-Channel Microarray Experiments: Estimating Absolute mRNA Concentrations
- 5 Identification of Biomarkers in Classification and Clustering of High-Throughput Data
- 6 Modeling Nonlinear Gene Interactions Using Bayesian MARS
- 7 Models for Probability of Under- and Overexpression: The POE Scale
- 8 Sparse Statistical Modelling in Gene Expression Genomics
- 9 Bayesian Analysis of Cell Cycle Gene Expression Data
- 10 Model-Based Clustering for Expression Data via a Dirichlet Process Mixture Model
- 11 Interval Mapping for Expression Quantitative Trait Loci
- 12 Bayesian Mixture Models for Gene Expression and Protein Profiles
- 13 Shrinkage Estimation for SAGE Data Using a Mixture Dirichlet Prior
- 14 Analysis of Mass Spectrometry Data Using Bayesian Wavelet-Based Functional Mixed Models
- 15 Nonparametric Models for Proteomic Peak Identification and Quantification
- 16 Bayesian Modeling and Inference for Sequence Motif Discovery
- 17 Identification of DNA Regulatory Motifs and Regulators by Integrating Gene Expression and Sequence Data
- 18 A Misclassification Model for Inferring Transcriptional Regulatory Networks
- 19 Estimating Cellular Signaling from Transcription Data
- 20 Computational Methods for Learning Bayesian Networks from High-Throughput Biological Data
- 21 Bayesian Networks and Informative Priors: Transcriptional Regulatory Network Models
- 22 Sample Size Choice for Microarray Experiments
- Plate section
Summary
Abstract
We review the use of semiparametric mixture models for Bayesian inference in high-throughput genomic data. We discuss three specific approaches for microarray data, for protein mass spectrometry experiments, and for serial analysis of gene expression (SAGE) data. For the microarray data and the protein mass spectrometry we assume group comparison experiments, that is, experiments that seek to identify genes and proteins that are differentially expressed across two biologic conditions of interest. For the SAGE data example we consider inference for a single biologic sample. For all three applications we use flexible mixture models to implement inference. For the microarray data we define a Dirichlet process mixture of normal model. For the mass spectrometry data we introduce a mixture of Beta model. The proposed inference for SAGE data is based on a semiparametric mixture of Poisson distributions.
Introduction
We discuss semiparametric Bayesian data analysis for high-throughput genomic data. We introduce suitable semiparametric mixture models to implement inference for microarray data, mass spectrometry data, and SAGE data. The proposed models include a Dirichlet process mixture of normals for microarray data, a mixture of Beta distributions with a random number of terms for mass spectrometry data, and a Dirichlet process mixture of Poisson model for SAGE data. For the microarray data and the protein mass spectrometry data we consider experiments that compare two biologic conditions of interest. We assume that the aim of the experiment is to find genes and proteins, respectively, that are differentially expressed under the two conditions. For the SAGE example, we propose data analysis for a single biologic sample.
- Type
- Chapter
- Information
- Bayesian Inference for Gene Expression and Proteomics , pp. 238 - 253Publisher: Cambridge University PressPrint publication year: 2006
- 4
- Cited by