Book contents
- Frontmatter
- Contents
- List of Contributors
- Preface
- 1 An Introduction to High-Throughput Bioinformatics Data
- 2 Hierarchical Mixture Models for Expression Profiles
- 3 Bayesian Hierarchical Models for Inference in Microarray Data
- 4 Bayesian Process-Based Modeling of Two-Channel Microarray Experiments: Estimating Absolute mRNA Concentrations
- 5 Identification of Biomarkers in Classification and Clustering of High-Throughput Data
- 6 Modeling Nonlinear Gene Interactions Using Bayesian MARS
- 7 Models for Probability of Under- and Overexpression: The POE Scale
- 8 Sparse Statistical Modelling in Gene Expression Genomics
- 9 Bayesian Analysis of Cell Cycle Gene Expression Data
- 10 Model-Based Clustering for Expression Data via a Dirichlet Process Mixture Model
- 11 Interval Mapping for Expression Quantitative Trait Loci
- 12 Bayesian Mixture Models for Gene Expression and Protein Profiles
- 13 Shrinkage Estimation for SAGE Data Using a Mixture Dirichlet Prior
- 14 Analysis of Mass Spectrometry Data Using Bayesian Wavelet-Based Functional Mixed Models
- 15 Nonparametric Models for Proteomic Peak Identification and Quantification
- 16 Bayesian Modeling and Inference for Sequence Motif Discovery
- 17 Identification of DNA Regulatory Motifs and Regulators by Integrating Gene Expression and Sequence Data
- 18 A Misclassification Model for Inferring Transcriptional Regulatory Networks
- 19 Estimating Cellular Signaling from Transcription Data
- 20 Computational Methods for Learning Bayesian Networks from High-Throughput Biological Data
- 21 Bayesian Networks and Informative Priors: Transcriptional Regulatory Network Models
- 22 Sample Size Choice for Microarray Experiments
- Plate section
22 - Sample Size Choice for Microarray Experiments
Published online by Cambridge University Press: 23 November 2009
- Frontmatter
- Contents
- List of Contributors
- Preface
- 1 An Introduction to High-Throughput Bioinformatics Data
- 2 Hierarchical Mixture Models for Expression Profiles
- 3 Bayesian Hierarchical Models for Inference in Microarray Data
- 4 Bayesian Process-Based Modeling of Two-Channel Microarray Experiments: Estimating Absolute mRNA Concentrations
- 5 Identification of Biomarkers in Classification and Clustering of High-Throughput Data
- 6 Modeling Nonlinear Gene Interactions Using Bayesian MARS
- 7 Models for Probability of Under- and Overexpression: The POE Scale
- 8 Sparse Statistical Modelling in Gene Expression Genomics
- 9 Bayesian Analysis of Cell Cycle Gene Expression Data
- 10 Model-Based Clustering for Expression Data via a Dirichlet Process Mixture Model
- 11 Interval Mapping for Expression Quantitative Trait Loci
- 12 Bayesian Mixture Models for Gene Expression and Protein Profiles
- 13 Shrinkage Estimation for SAGE Data Using a Mixture Dirichlet Prior
- 14 Analysis of Mass Spectrometry Data Using Bayesian Wavelet-Based Functional Mixed Models
- 15 Nonparametric Models for Proteomic Peak Identification and Quantification
- 16 Bayesian Modeling and Inference for Sequence Motif Discovery
- 17 Identification of DNA Regulatory Motifs and Regulators by Integrating Gene Expression and Sequence Data
- 18 A Misclassification Model for Inferring Transcriptional Regulatory Networks
- 19 Estimating Cellular Signaling from Transcription Data
- 20 Computational Methods for Learning Bayesian Networks from High-Throughput Biological Data
- 21 Bayesian Networks and Informative Priors: Transcriptional Regulatory Network Models
- 22 Sample Size Choice for Microarray Experiments
- Plate section
Summary
Abstract
We review Bayesian sample size arguments for microarray experiments, focusing on a decision theoretic approach. We start by introducing a choice based on minimizing expected loss as theoretical ideal. Practical limitations of this approach quickly lead us to consider a compromise solution that combines this idealized solution with a sensitivity argument. The finally proposed approach relies on conditional expected loss, conditional on an assumed true level of differential expression to be discovered. The expression for expected loss can be interpreted as a version of power, thus providing for ease of interpretation and communication
Introduction
We discuss approaches for a Bayesian sample size argument in microarray experiments. As is the case for most sample size calculations in clinical trials and other biomedical applications the nature of the sample size calculation is to provide the investigator with decision support, and allow an informed sample size choice, rather than providing a black-box method to deliver an optimal sample size.
Several classical approaches for microarray sample size choices have been proposed in the recent literature. Pan et al. (2002) develop a traditional power argument, using a finite mixture of normal sampling model for difference scores in a group comparison microarray experiment. Zien et al. (2002) propose to plot ROC-type curves to show achievable combinations of false-negative and falsepositive rates. Mukherjee et al. (2003) use a machine learning perspective. They consider a parametric learning curve for the empirical error rate as a function of the sample size, and proceed to estimate the unknown parameters in the learning curve.
- Type
- Chapter
- Information
- Bayesian Inference for Gene Expression and Proteomics , pp. 425 - 437Publisher: Cambridge University PressPrint publication year: 2006