Hostname: page-component-586b7cd67f-tf8b9 Total loading time: 0 Render date: 2024-11-22T11:46:50.887Z Has data issue: false hasContentIssue false

BLC mini-series: New statistical approaches and research practices for bilingualism research

Published online by Cambridge University Press:  18 August 2021

João Veríssimo*
Affiliation:
Center of Linguistics, School of Arts and Humanities, University of Lisbon, Lisbon, Portugal
*
Address for correspondence: João Veríssimo, Faculdade de Letras da Universidade de Lisboa, Alameda da Universidade, 1600-214Lisboa, Portugal. E-mail: [email protected]
Rights & Permissions [Opens in a new window]

Extract

The past decade has witnessed some dramatic methodological changes in the wider disciplines of psycholinguistics, psychology, and experimental linguistics. One such set of changes comprises the development of open and transparent research practices, which have increasingly been adopted in response to concerns that empirical results often fail to replicate and may not generalise across samples and experimental conditions (Gibson & Fedorenko, 2013; Maxwell, Lau, & Howard, 2015; McElreath & Smaldino, 2015; Yarkoni, 2020). Another important set of changes concerns the use of sophisticated statistical techniques, such as mixed-effects models (Baayen, Davidson, & Bates, 2008) and Bayesian analyses (Vasishth, Nicenboim, Beckman, Li & Kong, 2018), which can provide much more information about magnitudes of effects and sources of variation than the more traditional statistical approaches.

Type
Editorial
Copyright
Copyright © The Author(s), 2021. Published by Cambridge University Press

The past decade has witnessed some dramatic methodological changes in the wider disciplines of psycholinguistics, psychology, and experimental linguistics. One such set of changes comprises the development of open and transparent research practices, which have increasingly been adopted in response to concerns that empirical results often fail to replicate and may not generalise across samples and experimental conditions (Gibson & Fedorenko, Reference Gibson and Fedorenko2013; Maxwell, Lau, & Howard, Reference Maxwell, Lau and Howard2015; McElreath & Smaldino, Reference McElreath and Smaldino2015; Yarkoni, Reference Yarkoni2020). Another important set of changes concerns the use of sophisticated statistical techniques, such as mixed-effects models (Baayen, Davidson, & Bates, Reference Baayen, Davidson and Bates2008) and Bayesian analyses (Vasishth, Nicenboim, Beckman, Li & Kong, Reference Vasishth, Nicenboim, Beckman, Li and Kong2018), which can provide much more information about magnitudes of effects and sources of variation than the more traditional statistical approaches.

Some of these changes have already trickled down to the field of bilingualism, while others are under way. The time is ripe to document these changes and take a broad view of the state-of-the-art, while at the same time examining the added value that such methodological advances can provide to bilingualism and second language research. These are the main goals of this mini-serie of seven peer-reviewed articles; altogether they review a wide range of novel and emerging statistical approaches and research practices in our field.

The first two contributions cover open science research practices. In the first of these, Bolibaugh, Vanek, and Marsden (Reference Bolibaugh, Vanek and Marsden2021) call for a “credibility revolution” in bilingualism research. They argue that complete disclosure of data, analysis code, and materials, is a minimum standard for assessing (and improving) the reproducibility and replicability of empirical results. The article also provides examples and recommendations for good practices in bilingualism research.

The second article by Mertzen, Lago, and Vasishth (Reference Mertzen, Lago and Vasishth2021), explains the benefits of preregistration, another research practice that can greatly improve the openness and transparency of bilingualism research. The authors provide practical examples of how preregistration can firmly distinguish hypothesis-testing from data exploration, and, more generally, how it can counter questionable research practices and unconscious biases on the part of researchers.

The two contributions that follow are at the border between statistical approaches and research practices. They present statistical techniques that are used in the service of study design and for drawing inferences from the published empirical literature. Brysbaert's (Reference Brysbaert2021) article discusses the potentially serious consequences of low statistical power in bilingualism research. The author presents a series of compelling simulations that demonstrate how small sample sizes can turn our statistical inferences into very blurry reflections of reality – especially when estimating between-group interactions, a common situation in bilingualism research.

In the next article Plonsky, Sudina, and Hu (Reference Plonsky, Sudina and Hu2021) review the application of meta-analysis (perhaps the oldest of the ‘new’ approaches presented here) to research on bilingualism. They discuss the strengths and advantages of meta-analysis over traditional literature reviews and provide a comprehensive overview of the research syntheses that have already been conducted in the field. Moreover, the authors provide a practical introduction to the major stages involved in conducting a meta-analysis, including data collection, analysis, and interpretation.

The last three contributions are short tutorials on specific statistical techniques and their application to various measures. Miwa and Baayen (Reference Miwa and Baayen2021) present an introduction to generalized additive mixed models a powerful and flexible regression-based approach that allows modelling nonlinear relationships between predictors and outcomes. The authors reanalyse two datasets of bilingual lexical decision and show that the technique is particularly useful for examining nonlinear interactions between continuous predictors, as well as for modelling the timecourse of lexical effects.

In the next contribution, Stone, Lago, and Schad (Reference Stone, Lago and Schad2021) present a novel method for estimating the onset of divergences in the visual world eye-tracking paradigm. The article discusses and compares various approaches to timecourse analysis and proposes a new bootstrapping procedure that overcomes the limitations of previous methods. The authors also exemplify how their method can be used to estimate timecourse differences between conditions or speaker groups, which may be crucial for distinguishing between theoretical accounts of L2 processing.

Finally, in my own contribution (Veríssimo, Reference Veríssimo2021), I apply Bayesian ordinal models to the analysis of rating scales (e.g., of grammatical acceptability or language proficiency). I first show how the statistical methods that are commonly used suffer from important flaws, and then illustrate how ordinal models can provide more valid, accurate, and informative inferences about graded constructs such as language proficiency.

The articles that feature in this mini-series all share three important attributes. First, they acknowledge the particular characteristics of bilingualism research: for example, a reliance on between-group comparisons, the measurement of latent constructs (e.g., language proficiency), the possibility of nonlinear effects, and the importance of timecourse analysis for testing theories of bilingual processing. Second, they collectively illustrate the large variety of data types, tasks, and measures that bilingualism researchers routinely deal with and care about (e.g., response times, target fixations, rating scales). Third the articles in this mini-series all speak from a practical standpoint. That is, they focus on the steps and challenges that are involved in the actual implementation of these technique.

I hope that this mini-series can serve not only as a systematic collection of resources, but also as a ‘call-to-action’: an invitation for students and researchers to integrate these techniques into their own methodological toolboxes. Taken together, the statistical approaches and research practices presented here hold the potential to improve our inferences, increase the openness and transparency of our field, and help establish bilingualism research as a cumulative scientific enterprise.

References

Baayen, RH, Davidson, DJ and Bates, DM (2008) Mixed-effects modeling with crossed random effects for subjects and items. Journal of Memory and Language 59, 390412.CrossRefGoogle Scholar
Bolibaugh, C, Vanek, V and Marsden, E (2021) Towards a credibility revolution in bilingualism research: Open data and materials as stepping stones to more reproducible and replicable research. Bilingualism: Language and Cognition.CrossRefGoogle Scholar
Brysbaert, M (2021) Power considerations in bilingualism research: Time to step up our game. Bilingualism: Language and Cognition. https://doi.org/ghn3f4Google Scholar
Gibson, E and Fedorenko, E (2013) The need for quantitative methods in syntax and semantics research. Language and Cognitive Processes 28, 88124.CrossRefGoogle Scholar
Maxwell, SE, Lau, MY and Howard, GS (2015) Is psychology suffering from a replication crisis? What does “failure to replicate” really mean? American Psychologist 70, 487498.CrossRefGoogle ScholarPubMed
McElreath, R and Smaldino, PE (2015) Replication, communication, and the population dynamics of scientific discovery. PloS One 10, e0136088.CrossRefGoogle ScholarPubMed
Mertzen, D, Lago, S and Vasishth, S (2021) The benefits of preregistration for hypothesis-driven bilingualism research. Bilingualism: Language and Cognition. https://doi.org/gkq622CrossRefGoogle Scholar
Miwa, K and Baayen, H (2021) Nonlinearities in bilingual visual word recognition: An introduction to generalized additive modeling. Bilingualism: Language and Cognition. https://doi.org/gjhqddCrossRefGoogle Scholar
Plonsky, L, Sudina, E and Hu, Y (2021) Applying meta-analysis to research on bilingualism: An introduction. Bilingualism: Language and Cognition 16. https://doi.org/ghwkx2CrossRefGoogle Scholar
Stone, K, Lago, S and Schad, DJ (2021) Divergence point analyses of visual world data: Applications to bilingual research. Bilingualism: Language and Cognition. https://doi.org/ghpwsbGoogle Scholar
Vasishth, S, Nicenboim, B, Beckman, ME, Li, F and Kong, EJ (2018) Bayesian data analysis in the phonetic sciences: A tutorial introduction. Journal of Phonetics 71, 147161. https://doi.org/gfzq3cCrossRefGoogle ScholarPubMed
Veríssimo, J (2021) Analysis of rating scales: A pervasive problem in bilingualism research and a solution with Bayesian ordinal models. Bilingualism: Language and Cognition.CrossRefGoogle Scholar
Yarkoni, T (2020) The generalizability crisis. Behavioral and Brain Sciences 137, doi:10.1017/S0140525X20001685CrossRefGoogle ScholarPubMed