Search

6 - Is Bigger Always Better?
from Part II - Rethinking Research
Karen B. Schmaling, Washington State University, Robert M. Kaplan, Stanford University
Book:

Rethinking Clinical Research

Published online:

13 March 2025

Print publication:

20 March 2025, pp 117-136
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

The criteria for evaluating research studies often include large sample size. It is assumed that studies with large sample sizes are more meaningful than those that include a fewer number of participants. This chapter explores biases associated with the traditional application of null hypothesis testing. Statisticians now challenge the idea that retention of the null hypothesis signifies that a treatment is not effective. A finding associated with an exact probability value of p = 0.049 is not meaningfully different from one in which p = 0.051. Yet the interpretation of these two studies can be dramatically different, including the likelihood of publication. Large studies are not necessarily more accurate or less biased. In fact, biases in sampling strategy are amplified in studies with large sample sizes. These problems are of increasing concern in the era of big data and the analysis of electronic health records. Studies that are overpowered (because of very large sample sizes) are capable of identifying statistically significant differences that are of no clinical importance.

Diet, glutathione S-transferases M1 and T1 gene polymorphisms and cancer risk: a systematic review of observational studies
Elham Karimi, Shalaleh Abbasnezhad, Sheida Zeraattalab-Motlagh, Reza Amiri Khosroshahi, Seyed Reza Beh-Afarin, Hamed Mohammadi, Marjan Yaghmaie
Journal:

British Journal of Nutrition , First View

Published online by Cambridge University Press:

21 February 2025, pp. 1-14
- Article
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Understanding the correlation between genes and diet holds significance in formulating tailored nutritional guidance and enhancing public health initiatives. Consequently, a thorough examination is undertaken to clarify the interplay between varying nutrient intake, glutathione S-transferases Mu1 and Theta 1 (GSTM1 & T1) gene variants and susceptibility to cancer development. In this study, we conducted a comprehensive search on MEDLINE/PubMed, Scopus and Web of Science databases up to 30 April 2023. The review included observational studies that explored the relationship between dietary consumption of acrylamide, fruits, vegetables, plant-based foods, total meat, red meat, coffee and green tea, as well as the presence of GSTM1 and T1 gene polymorphisms, and the risk of cancer in adult populations. The review findings indicated that high levels of risk factors, particularly red meat, have been linked to a higher chance of developing colorectal cancer risk among individuals with the GSTM1 null genotype. In contrast, heightened levels of protective factors, such as cruciferous vegetables, green tea, coffee and fruit, have been associated with a decreased risk of lung cancer, adult leukaemia, cutaneous melanoma and lung cancer in individuals exhibiting GST polymorphisms. There is a scarcity of comprehensive studies examining different types of cancer due to various dietary patterns and genetic variations. Research has illuminated the complex interplay among dietary factors, gene polymorphisms and cancer risk, further comprehensive studies are needed to understand and validate these findings fully. More robust investigations across diverse populations are crucial to developing personalised nutritional interventions and strengthening public health strategies.

Decoupling Visualization and Testing when Presenting Confidence Intervals
David A. Armstrong II, William Poirier
Journal:

Political Analysis , First View

Published online by Cambridge University Press:

17 January 2025, pp. 1-6
- Article
- - You have access
  - Open access
- PDF
- HTML
- Export citation
Confidence intervals are ubiquitous in the presentation of social science models, data, and effects. When several intervals are plotted together, one natural inclination is to ask whether the estimates represented by those intervals are significantly different from each other. Unfortunately, there is no general rule or procedure that would allow us to answer this question from the confidence intervals alone. It is well known that using the overlaps in 95% confidence intervals to perform significance tests at the 0.05 level does not work. Recent scholarship has developed and refined a set of tools for inferential confidence intervals that permit inference on confidence intervals with the appropriate type I error rate in many different bivariate contexts. These are all based on the same underlying idea of identifying the multiple of the standard error (i.e., a new confidence level) such that the overlap in confidence intervals matches the desired type I error rate. These procedures remain stymied by multiple simultaneous comparisons. We propose an entirely new procedure for developing inferential confidence intervals that decouples the testing and visualization that can overcome many of these problems in any visual testing scenario. We provide software in R and Stata to accomplish this goal.

Saddlepoint Approximations of the Distribution of the Person Parameter in the Two Parameter Logistic Model
Martin Biehler, Heinz Holling, Philipp Doebler
Journal:

Psychometrika / Volume 80 / Issue 3 / September 2015

Published online by Cambridge University Press:

01 January 2025, pp. 665-688
- Article
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Large sample theory states the asymptotic normality of the maximum likelihood estimator of the person parameter in the two parameter logistic (2PL) model. In short tests, however, the assumption of normality can be grossly wrong. As a consequence, intended coverage rates may be exceeded and confidence intervals are revealed to be overly conservative. Methods belonging to the higher-order-theory, more specifically saddlepoint approximations, are a convenient way to deal with small-sample problems. Confidence bounds obtained by these means hold the approximate confidence level for a broad range of the person parameter. Moreover, an approximation to the exact distribution permits to compute median unbiased estimates (MUE) that are as likely to overestimate as to underestimate the true person parameter. Additionally, in small samples, these MUE are less mean-biased than the often-used maximum likelihood estimator.

On the Sampling Interpretation of Confidence Intervals and Hypothesis Tests in the Context of Conditional Maximum Likelihood Estimation
E. Maris
Journal:

Psychometrika / Volume 63 / Issue 1 / March 1998

Published online by Cambridge University Press:

01 January 2025, pp. 65-71
- Article
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
In the context of conditional maximum likelihood (CML) estimation, confidence intervals can be interpreted in three different ways, depending on the sampling distribution under which these confidence intervals contain the true parameter value with a certain probability. These sampling distributions are (a) the distribution of the data given the incidental parameters, (b) the marginal distribution of the data (i.e., with the incidental parameters integrated out), and (c) the conditional distribution of the data given the sufficient statistics for the incidental parameters. Results on the asymptotic distribution of CML estimates under sampling scheme (c) can be used to construct asymptotic confidence intervals using only the CML estimates. This is not possible for the results on the asymptotic distribution under sampling schemes (a) and (b). However, it is shown that the conditional asymptotic confidence intervals are also valid under the other two sampling schemes.

Optimal and Most Exact Confidence Intervals for Person Parameters in Item Response Theory Models
Anna Doebler, Philipp Doebler, Heinz Holling
Journal:

Psychometrika / Volume 78 / Issue 1 / January 2013

Published online by Cambridge University Press:

01 January 2025, pp. 98-115
- Article
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
The common way to calculate confidence intervals for item response theory models is to assume that the standardized maximum likelihood estimator for the person parameter θ is normally distributed. However, this approximation is often inadequate for short and medium test lengths. As a result, the coverage probabilities fall below the given level of significance in many cases; and, therefore, the corresponding intervals are no longer confidence intervals in terms of the actual definition. In the present work, confidence intervals are defined more precisely by utilizing the relationship between confidence intervals and hypothesis testing. Two approaches to confidence interval construction are explored that are optimal with respect to criteria of smallness and consistency with the standard approach.

The Normal-Theory and Asymptotic Distribution-Free (ADF) Covariance Matrix of Standardized Regression Coefficients: Theoretical Extensions and Finite Sample Behavior
Jeff A. Jones, Niels G. Waller
Journal:

Psychometrika / Volume 80 / Issue 2 / June 2015

Published online by Cambridge University Press:

01 January 2025, pp. 365-378
- Article
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Yuan and Chan (Psychometrika, 76, 670–690, 2011) recently showed how to compute the covariance matrix of standardized regression coefficients from covariances. In this paper, we describe a method for computing this covariance matrix from correlations. Next, we describe an asymptotic distribution-free (ADF; Browne in British Journal of Mathematical and Statistical Psychology, 37, 62–83, 1984) method for computing the covariance matrix of standardized regression coefficients. We show that the ADF method works well with nonnormal data in moderate-to-large samples using both simulated and real-data examples. R code (R Development Core Team, 2012) is available from the authors or through the Psychometrika online repository for supplementary materials.

On Approximate Confidence Intervals for Measures of Concordance
Albert D. Palachek, William R. Schucany
Journal:

Psychometrika / Volume 49 / Issue 1 / March 1984

Published online by Cambridge University Press:

01 January 2025, pp. 133-141
- Article
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
The use of U-statistics based on rank correlation coefficients in estimating the strength of concordance among a group of rankers is examined for cases where the null hypothesis of random rankings is not tenable. The studentized U-statistics is asymptotically distribution-free, and the Student-t approximation is used for small and moderate sized samples. An approximate confidence interval is constructed for the strength of concordance. Monte Carlo results indicate that the Student-t approximation can be improved by estimating the degrees of freedom.

Approximate Interval Estimation for a Certain Intraclass Correlation Coefficient
Joseph L. Fleiss, Patrick E. Shrout
Journal:

Psychometrika / Volume 43 / Issue 2 / June 1978

Published online by Cambridge University Press:

01 January 2025, pp. 259-262
- Article
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
When the raters participating in a reliability study are a random sample from a larger population of raters, inferences about the intraclass correlation coefficient must be based on the three mean squares from the analysis of variance table summarizing the results: between subjects, between raters, and error. An approximate confidence interval for the parameter is presented as a function of these three mean squares.

Profile Likelihood-Based Confidence Intervals and Regions for Structural Equation Models
Jolynn Pek, Hao Wu
Journal:

Psychometrika / Volume 80 / Issue 4 / December 2015

Published online by Cambridge University Press:

01 January 2025, pp. 1123-1145
- Article
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Structural equation models (SEM) are widely used for modeling complex multivariate relationships among measured and latent variables. Although several analytical approaches to interval estimation in SEM have been developed, there lacks a comprehensive review of these methods. We review the popular Wald-type and lesser known likelihood-based methods in linear SEM, emphasizing profile likelihood-based confidence intervals (CIs). Existing algorithms for computing profile likelihood-based CIs are described, including two newer algorithms which are extended to construct profile likelihood-based confidence regions (CRs). Finally, we illustrate the use of these CIs and CRs with two empirical examples, and provide practical recommendations on when to use Wald-type CIs and CRs versus profile likelihood-based CIs and CRs. OpenMx example code is provided in an Online Appendix for constructing profile likelihood-based CIs and CRs for SEM.

7 - Effect Size, Power, and Parameter Estimation
Roberto R. Heredia, Texas A & M University International, Richard D. Hartley, University of Texas at San Antonio
Book:

Social Behavioral Statistics

Published online:

13 January 2025

Print publication:

28 November 2024, pp 149-171
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

Chapter 7 introduces statistical power and effect size in hypothesis testing. Guidelines for interpretation of effect size, along with other sources of increasing statistical power, are provided. Point estimation and interval estimation and their relationship to population parameter estimates and the hypothesis-testing process are considered. Statistical significance is highly sensitive to large sample sizes. This means that researchers, in addition to selecting desired statistical significance p-values, need to know the magnitude of the treatment effect or the effect size of the behavior under consideration. Effect size determines sample size, and sample size is intimately related to statistical power or the likelihood of rejecting a false null hypothesis.

6 - Heads or tails? The role of chance
Penelope Webb, QIMR Berghofer Medical Research Institute, Chris Bain, Andrew Page, Western Sydney University
Book:

Essential Epidemiology

Published online:

27 September 2024

Print publication:

12 November 2024, pp 146-160
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

If the results of a study reveal an interesting association between an exposure and a health outcome, there is a natural tendency to assume that it is real. (Note: we are considering whether two things are associated. This does not imply that one causes the other to occur.) However, before we can even contemplate this possibility we have to try to rule out other possible explanations for the results. There are three main ‘alternative explanations’ that we have to consider whenever we analyse epidemiological data or read the reports of others, whatever the study design; namely, could the results be due to chance, bias or error, or confounding? We discuss the first of these, chance, in this chapter and cover bias and confounding in Chapters 7 and 8, respectively.

Association between dietary fatty acids and depressive symptoms in Chinese haemodialysis patients: a cross-sectional study
Shuang Zhang, Shu-Xin Liu, Qi-Jun Wu, Zhi-Hong Wang, Hong Liu, Ping Xiao, Yan Lu, Cui Dong, Qing-Mei Meng
Journal:

British Journal of Nutrition / Volume 132 / Issue 7 / 14 October 2024

Published online by Cambridge University Press:

15 October 2024, pp. 935-945

Print publication:

14 October 2024
- Article
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Depression is highly prevalent in haemodialysis patients, and diet might play an important role. Therefore, we conducted this cross-sectional study to determine the association between dietary fatty acids (FA) consumption and the prevalence of depression in maintenance haemodialysis (MHD) patients. Dietary intake was assessed using a validated FFQ between December 2021 and January 2022. The daily intake of dietary FA was categorised into three groups, and the lowest tertile was used as the reference category. Depression was assessed using the Patient Health Questionnaire-9. Logistic regression and restricted cubic spline (RCS) models were applied to assess the relationship between dietary FA intake and the prevalence of depression. As a result, after adjustment for potential confounders, a higher intake of total FA [odds ratio (OR)T3 vs. T1 = 1·59, 95 % confidence interval (CI) = 1·04, 2·46] and saturated fatty acids (SFA) (ORT3 vs. T1 = 1·83, 95 % CI = 1·19, 2·84) was associated with a higher prevalence of depressive symptoms. Significant positive linear trends were also observed (P < 0·05) except for SFA intake. Similarly, the prevalence of depression in MHD patients increased by 20% (OR = 1.20, 95% CI = 1.01–1.43) for each standard deviation increment in SFA intake. RCS analysis indicated an inverse U-shaped correlation between SFA and depression (Pnonlinear > 0·05). Additionally, the sensitivity analysis produced similar results. Furthermore, no statistically significant association was observed in the subgroup analysis with significant interaction. In conclusion, higher total dietary FA and SFA were positively associated with depressive symptoms among MHD patients. These findings inform future research exploring potential mechanism underlying the association between dietary FA and depressive symptoms in MHD patients.

8 - Stochastic Modeling
Paul Embrechts, Swiss Federal University (ETH), Zürich, Marius Hofert, The University of Hong Kong, Valérie Chavez-Demoulin, Université de Lausanne, Switzerland
Book:

Risk Revealed

Published online:

05 April 2024

Print publication:

11 April 2024, pp 94-218
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

This rather long chapter constitutes part of the hike in our walk/hike/stroll set-up. We introduce the reader to the basics of stochastics (representing both probability and statistics) necessary for the more technical discussions on risk later. The path followed starts from probability space (a theoretical concept we quickly leave aside); we then move to the notion of a random variable and,, its distribution function, including the most important discrete as well as continuous examples. Historical examples as well as pedagogical ones are always included in order to support the understanding of the new concepts introduced. These examples often show that there is more to randomness than meets the eye. For the applications discussed later, we will measure statistical uncertainty through the concept of confidence intervals. These can be based either on some asymptotic theory involving the famous bell curve, the normal distribution, or on some form of resampling known under the name of bootstrapping. Further, we add some tools that are very important for measuring and communicating risk; these include the concepts of return periods and quantile functions.

Chapter Ten - Inferences Involving the Mean of a Single Population When σ is Known
Sharon Lawner Weinberg, New York University, Daphna Harel, New York University, Sarah Knapp Abramowitz, Drew University
Book:

Statistics Using R

Published online:

07 December 2023

Print publication:

07 December 2023, pp 289-317
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

Chapter 10 covers INFERENCES INVOLVING THE MEAN OF A SINGLE POPULATION WHEN σ IS KNOWN and includes the following specific topics, among others: Estimating the Population Mean, μ, Interval Estimation, Confidence Intervals, Hypothesis Testing and Interval Estimation, Effect Size,Type II Error, and Power.

Chapter Fifteen - Correlation and Simple Regression as Inferential Techniques
Sharon Lawner Weinberg, New York University, Daphna Harel, New York University, Sarah Knapp Abramowitz, Drew University
Book:

Statistics Using R

Published online:

07 December 2023

Print publication:

07 December 2023, pp 484-532
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

Chapter 15 covers CORRELATION AND SIMPLE REGRESSION AS INFERENTIAL TECHNIQUES and includes the following specific topics, among others:Bivariate Normal Distribution, Statistical Significance Test of Correlation, Confidence Intervals, Statistical Significance of b-Weight, Fit of the Overall Regression Equation, R and R-squared, Adjusted R-squared, Regression Diagnostics, Residual Plots, Influential Observations, Discrepancy, Leverage, Influence, and Power Analyses.

Chapter Ten - Inferences Involving the Mean of a Single Population when σ is Known
Sharon Lawner Weinberg, New York University, Sarah Knapp Abramowitz, Drew University, New Jersey, Daphna Harel, New York University
Book:

Statistics Using Stata

Published online:

26 January 2024

Print publication:

30 November 2023, pp 294-321
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

Chapter 10 covers inferences involving the mean of a single population when σ is known and includes the following specific topics, among others: estimating the population mean, interval estimation, confidence intervals, hypothesis testing and interval estimation, effect size, type II error, and power.

Chapter Fifteen - Correlation and Simple Regression as Inferential Techniques
Sharon Lawner Weinberg, New York University, Sarah Knapp Abramowitz, Drew University, New Jersey, Daphna Harel, New York University
Book:

Statistics Using Stata

Published online:

26 January 2024

Print publication:

30 November 2023, pp 507-556
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

Chapter 15 covers correlation and simple regression as inferential techniques and includes the following specific topics, among others: bivariate normal distribution, statistical significance test of correlation, confidence intervals, statistical significance of the b weight, fit of the overall regression equation, R and R-squared, adjusted R-squared, regression diagnostics, residual plots, influential observations, discrepancy, leverage, influence, and power analysis.

Chapter Sixteen - An Introduction to Multiple Regression
Sharon Lawner Weinberg, New York University, Sarah Knapp Abramowitz, Drew University, New Jersey, Daphna Harel, New York University
Book:

Statistics Using Stata

Published online:

26 January 2024

Print publication:

30 November 2023, pp 557-594
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

Chapter 16 covers an introduction to multiple regression and includes the following specific topics, among others: confidence intervals, statistical significance of the b weight, fit of the overall regression Eeuation, R and R-squared, adjusted R-squared, semipartial correlation, partial slope, confounding, and statistical control.

Chapter 10 - Understanding Confidence Intervals
S. Nassir Ghaemi, Tufts University School of Medicine, Boston
Book:

A Clinician's Guide to Statistics in Mental Health

Published online:

20 January 2023

Print publication:

09 February 2023, pp 71-75
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

Confidence intervals are described as more relevant than p-values, as measures of precision of effect sizes.

Search Results

Refine search

Refine search

Actions for selected content:

58 results

6 - Is Bigger Always Better?

Summary

Diet, glutathione S-transferases M1 and T1 gene polymorphisms and cancer risk: a systematic review of observational studies

Decoupling Visualization and Testing when Presenting Confidence Intervals

Saddlepoint Approximations of the Distribution of the Person Parameter in the Two Parameter Logistic Model

On the Sampling Interpretation of Confidence Intervals and Hypothesis Tests in the Context of Conditional Maximum Likelihood Estimation

Optimal and Most Exact Confidence Intervals for Person Parameters in Item Response Theory Models

The Normal-Theory and Asymptotic Distribution-Free (ADF) Covariance Matrix of Standardized Regression Coefficients: Theoretical Extensions and Finite Sample Behavior

On Approximate Confidence Intervals for Measures of Concordance

Approximate Interval Estimation for a Certain Intraclass Correlation Coefficient

Profile Likelihood-Based Confidence Intervals and Regions for Structural Equation Models

7 - Effect Size, Power, and Parameter Estimation

Summary

6 - Heads or tails? The role of chance

Summary

Association between dietary fatty acids and depressive symptoms in Chinese haemodialysis patients: a cross-sectional study

8 - Stochastic Modeling

Summary

Chapter Ten - Inferences Involving the Mean of a Single Population When σ is Known

Summary

Chapter Fifteen - Correlation and Simple Regression as Inferential Techniques

Summary

Chapter Ten - Inferences Involving the Mean of a Single Population when σ is Known

Summary

Chapter Fifteen - Correlation and Simple Regression as Inferential Techniques

Summary

Chapter Sixteen - An Introduction to Multiple Regression

Summary

Chapter 10 - Understanding Confidence Intervals

Summary

Search Results

Refine search

Refine search

Actions for selected content:

Save Search

58 results

Summary

Summary

Summary

Summary

Summary

Summary

Summary

Summary

Summary

Summary