Book contents
- Frontmatter
- Contents
- Preface
- 1 Introduction
- 2 Theory of Tests, p-Values, and Confidence Intervals
- 3 From Scientific Theory to Statistical Hypothesis Test
- 4 One-Sample Studies with Binary Responses
- 5 One-Sample Studies with Ordinal or Numeric Responses
- 6 Paired Data
- 7 Two-Sample Studies with Binary Responses
- 8 Assumptions and Hypothesis Tests
- 9 Two-Sample Studies with Ordinal or Numeric Responses
- 10 General Methods for Frequentist Inferences
- 11 k-Sample Studies and Trend Tests
- 12 Clustering and Stratification
- 13 Multiplicity in Testing
- 14 Testing from Models
- 15 Causality
- 16 Censoring
- 17 Missing Data
- 18 Group Sequential and Related Adaptive Methods
- 19 Testing Fit, Equivalence, and Noninferiority
- 20 Power and Sample Size
- 21 Bayesian Hypothesis Testing
- References
- Notation Index
- Concept Index
2 - Theory of Tests, p-Values, and Confidence Intervals
Published online by Cambridge University Press: 17 April 2022
- Frontmatter
- Contents
- Preface
- 1 Introduction
- 2 Theory of Tests, p-Values, and Confidence Intervals
- 3 From Scientific Theory to Statistical Hypothesis Test
- 4 One-Sample Studies with Binary Responses
- 5 One-Sample Studies with Ordinal or Numeric Responses
- 6 Paired Data
- 7 Two-Sample Studies with Binary Responses
- 8 Assumptions and Hypothesis Tests
- 9 Two-Sample Studies with Ordinal or Numeric Responses
- 10 General Methods for Frequentist Inferences
- 11 k-Sample Studies and Trend Tests
- 12 Clustering and Stratification
- 13 Multiplicity in Testing
- 14 Testing from Models
- 15 Causality
- 16 Censoring
- 17 Missing Data
- 18 Group Sequential and Related Adaptive Methods
- 19 Testing Fit, Equivalence, and Noninferiority
- 20 Power and Sample Size
- 21 Bayesian Hypothesis Testing
- References
- Notation Index
- Concept Index
Summary
This chapter defines statistical hypothesis tests mathematically. Those tests assume two sets of probability models, called the null and alternative hypotheses. A decision rule is a function that depends on the data and a specified ?-level and determines whether or not to reject the null hypothesis. We define concepts related to properties of hypothesis tests such as Type I and II error rates, validity, size, power, invariance, and robustness. The definitions are general but are explained with examples such as testing a binomial parameter, or Wilcoxon–Mann–Whitney tests. P-values are defined as the smallest ?-level for observed data for which we would reject the null at that level and all larger levels. Confidence sets and confidence intervals are defined in relation to a series of hypothesis tests with changing null hypotheses. Compatibility between p-value functions and confidence intervals is defined, and an example with Fisher’s exact test shows that compatibility is not always present for some common tests.
- Type
- Chapter
- Information
- Statistical Hypothesis Testing in ContextReproducibility, Inference, and Science, pp. 8 - 22Publisher: Cambridge University PressPrint publication year: 2022
- 1
- Cited by