A key consideration in bilingual language acquisition is the extent to which individual difference factors impact linguistic development. These factors may be categorised as internal or external to the individual (Paradis, Reference Paradis2011; Unsworth et al., Reference Unsworth, Hulk and Marinis2011): internal factors include biological and cognitive elements like chronological age and age of second language (L2) onset (Paradis, Reference Paradis2007, Reference Paradis2011; Chondrogianni & Marinis, Reference Chondrogianni and Marinis2011), while external factors involve the environmental context of language input. This experience may be further categorised based on quantity and quality of language exposure from both proximal (direct) and distal (indirect) sources across various settings such as at home, school, and participation in social activities (Paradis, Reference Paradis2007, Reference Paradis2011).
Investigating the impact of different factors on bilingual language acquisition is a challenging task as the factors can often co-vary with each other, or they can interact in different ways with different language features, the first language (L1), or even the language assessment itself. Variation in these elements has produced mixed results for a range of factors. For example, age is purported to be a key predictor of language proficiency in bilingualism. This has been shown in relation to a younger age of onset advantage where those who began exposure to the L2 earlier were more proficient in comparison to those who began exposure at later ages (e.g., Johnson & Newport, Reference Johnson and Newport1989). Conversely, age effects have also been found in relation to older chronological age whereby older participants have been shown to have greater linguistic proficiency (e.g., Paradis, Reference Paradis2011; Paradis et al., Reference Paradis, Rusk, Duncan and Govindarajan2017; Snow & Hoefnagel-Hohle, Reference Snow and Hoefnagel-Hohle1978). However, results are more mixed for more fine-grained measures of L2 input (e.g., de Cat, Reference de Cat2020; Jia & Fuse, Reference Jia and Fuse2007; Paradis et al., Reference Paradis, Rusk, Duncan and Govindarajan2017; Roesch & Chondrogianni, Reference Roesch and Chondrogianni2016; Unsworth, Reference Unsworth2016), and for distal input factors like socio-economic status (e.g., Armon-Lotem et al., Reference Armon-Lotem, Walters and Gagarina2011; Paradis et al., Reference Paradis, Rusk, Duncan and Govindarajan2017; but see Bohman et al., Reference Bohman, Bedore, Peña, Mendez-Perez and Gillam2010; Sorenson Duncan & Paradis, Reference Sorenson Duncan and Paradis2020).
In this paper we aim to tease apart the relationship between different factors and L2 outcomes across a number of specific linguistic domains, via regression analyses. We will do this by using both standardised expressive tests of language proficiency and a novel experimental task to evaluate language comprehension by sequential bilingual children from Arabic speaking backgrounds. The key finding is that sequential bilingual children’s L2 vocabulary and morphological proficiency is predicted by (overall L2) input, while L2 syntax is predicted by the children’s age. This result reflects the fact that age is a cumulative measure of experience in both the L1 and the L2 and the development of general cognitive processes; we argue that this result confirms the deeper properties of syntactic knowledge.
Individual Difference Factors
A number of individual difference factors have been examined in previous research (e.g., Armon-Lotem et al., Reference Armon-Lotem, Walters and Gagarina2011; Chondrogianni & Marinis, Reference Chondrogianni and Marinis2011; de Cat, Reference de Cat2020; Jia & Fuse, Reference Jia and Fuse2007; Paradis, Reference Paradis2011; Paradis et al., Reference Paradis, Rusk, Duncan and Govindarajan2017; Roesch & Chondrogianni, Reference Roesch and Chondrogianni2016; Unsworth, Reference Unsworth2016). Below, we begin by reviewing key biological, cognitive and environmental effects discussed in the literature: in particular we focus on age, language exposure, socioeconomic status and maternal L2 proficiency.
Internal Factors
One of the most commonly measured but controversial internal factors in bilingual language acquisition research is age. In research on bilingualism, the influence of age is typically measured in one of two ways: (1) from the time at which data collection occurs, often referred to as age at time of testing or chronological age; and (2) from the point of first meaningful exposure to the target language, commonly referred to as age of L2 onset. Both types of measurement can provide unique and inter-related influences on how language acquisition manifests (Chondrogianni & Marinis, Reference Chondrogianni and Marinis2011).
Chronological Age
Chronological age can impart insights on the cumulative impact of external experience and internal development: for bilingual acquisition, this includes language experience from both the L1 and the L2, as well as the biological development of cognitive capacities. Among others, these capacities include memory, attention, analytic reasoning, executive control and metalinguistic awareness, all of which play a role in language acquisition and processing (e.g., De Villiers, Reference De Villiers2007; Harley & Hart, Reference Harley and Hart1997; Long & Rothman, Reference Long and Rothman2014; Paradis, Reference Paradis2011; Paradis et al., Reference Paradis, Rusk, Duncan and Govindarajan2017; Szmalec et al., Reference Szmalec, Brysbaert and Duyck2012; Tsimpli, Reference Tsimpli2014). As a result, older age predicts greater proficiency in the L1 (S. Gathercole & Baddeley, Reference Gathercole and Baddeley1993; Stevens, Reference Stevens2006).
With the later introduction of the L2, this development occurs on a different timescale (Blom & Paradis, Reference Blom and Paradis2015; S. Gathercole, Reference Gathercole1999, Reference Gathercole2006; Paradis, Reference Paradis2011). Nonetheless, the greater cognitive development of older L2 children licenses higher ability of the cognitive properties which constrain perception and memory, consequently facilitating an advancement in processing language (Cummins, Reference Cummins1981). The more advanced cognitive development of older L2 learners is associated with the faster development of linguistic proficiency in the initial stages of L2 acquisition, also known as the rate advantage. Children at older ages also have more experience with language in general through familiarity and involvement with their native language which may facilitate particular L2 conceptual-lexical mappings if these have already been established in the L1 (Paradis, Reference Paradis2007).
In previous studies, age effects have varied across language domains. For example, vocabulary is consistently predicted by age, with more advanced lexical development in older bilingual children (e.g., Blom & Bosma, Reference Blom and Bosma2016; Paradis, Reference Paradis2011; Snow & Hoefnagel-Hohle, Reference Snow and Hoefnagel-Hohle1978; Unsworth, Reference Unsworth2016). Similarly, chronological age has been shown to predict morphological features (e.g., Paradis, Reference Paradis2011; Snow & Hoefnagel-Hohle, Reference Snow and Hoefnagel-Hohle1978). However, age effects are less robust in the morphosyntactic domain (e.g., Bedore et al., Reference Bedore, Peña, Griffin and Hixon2016; Blom & Bosma, Reference Blom and Bosma2016; Chondrogianni & Marinis, Reference Chondrogianni and Marinis2011; Nishikawa, Reference Nishikawa2014; Roesch & Chondrogianni, Reference Roesch and Chondrogianni2016). In addition, mixed results are observed for sentence structures: while some studies have found an age effect (e.g., Bohman et al., Reference Bohman, Bedore, Peña, Mendez-Perez and Gillam2010; Paradis et al., Reference Paradis, Rusk, Duncan and Govindarajan2017), others have not (e.g., Chondrogianni & Marinis, Reference Chondrogianni and Marinis2011).
Age of L2 Onset
Age of L2 onset is both one of the most researched factors and one of the most controversial factors in bilingualism (for reviews, see Herschensohn, Reference Herschensohn2007; Muñoz & Singleton, Reference Muñoz and Singleton2011). Age of onset effects are observed in both L1 and L2 acquisition, such that those who begin exposure earlier ultimately attain greater competence than those who begin later, thus supporting an ‘earlier is better’ perspective for language acquisition (e.g., Abrahamsson & Hyltenstam, Reference Abrahamsson and Hyltenstam2009; Coppieters, Reference Coppieters1987; Curtiss, Reference Curtiss1977; DeKeyser, Reference DeKeyser2000; Gleitman & Newport, Reference Gleitman and Newport1995; Johnson & Newport, Reference Johnson and Newport1989, Reference Johnson and Newport1991; Oyama, Reference Oyama1976; Patkowski, Reference Patkowski1980, Reference Patkowski1994; Schachter, Reference Schachter1990; Schwartz, Reference Schwartz2004). The early age of onset advantage for language learning has been attributed to a critical period, whereby linguistic proficiency is only possible if acquisition begins before a particular age (Johnson & Newport, Reference Johnson and Newport1989; Lenneberg, Reference Lenneberg1967; Meisel, Reference Meisel2008, Reference Meisel2009). Under this Critical Period Hypothesis, an innate optimal period occurs in the early years of childhood (often proposed to range from approximately two to five years of age) which is followed by an offset phase (commonly proposed to occur at around adolescence) (Lenneberg, Reference Lenneberg1967).
The early advantage is not necessarily absolute – exceptions are observed for both early and later ages, as well as interactions between the age of L2 onset and other internal and external factors (e.g., Kinsella & Singleton, Reference Kinsella and Singleton2014; Muñoz & Singleton, Reference Muñoz and Singleton2007). However, later ages of onset in L2 acquisition are generally associated with a qualitatively different developmental trajectory (e.g., error type) (e.g., Johnson & Newport, Reference Johnson and Newport1989; Kroffke & Rothweiler, Reference Kroffke and Rothweiler2006; Meisel, Reference Meisel2008, Reference Meisel2009, Reference Meisel2014; Rothweiler & Lleó, Reference Rothweiler and Lleó2006; Sopata, Reference Sopata2010). Meanwhile, findings are less conclusive for sequential bilinguals: in the morphosyntactic domain, advantages have been observed for an earlier age of L2 onset (e.g., Bedore et al., Reference Bedore, Peña, Griffin and Hixon2016; Nishikawa, Reference Nishikawa2014; Roesch & Chondrogianni, Reference Roesch and Chondrogianni2016) or not at all (e.g., Blom & Bosma, Reference Blom and Bosma2016; Chondrogianni & Marinis, Reference Chondrogianni and Marinis2011; Unsworth, Reference Unsworth2016; Paradis et al., Reference Paradis, Rusk, Duncan and Govindarajan2017), while an older age of onset has been advantageous for L2 syntax (e.g., Rothman et al., Reference Rothman, Long, Iverson, Judy, Lingwall and Chakravarty2016).
In sum, age as an internal factor has predicted L2 acquisition in previous studies; however the results are selective – if not mixed – with respect to language domain. These factors may also interact with external factors, discussed in the following section.
External Factors
External factors in L2 acquisition affect the environmental contexts of language exposure. These factors may be categorised under the terms proximal and distal. Proximal factors refer to those which relate directly to the individual’s language input and how it can vary, either quantitatively or qualitatively. In previous research on L2 acquisition, proximal factors have included quantitative measures of language input and language use (i.e., output), as well as the qualitative richness of the language environment (Paradis, Reference Paradis2023). Distal factors are those which impact on the language environment, resulting in variation in this environment; these have included socioeconomic status and parental L2 proficiency (Unsworth et al., Reference Unsworth, Hulk and Marinis2011; Paradis, Reference Paradis2011; De Houwer, Reference De Houwer2018; Paradis, Reference Paradis2023).
(Proximal) Quantitative Input
Quantitative language input factors are based on the amount of exposure to the target language. These factors are generally determined by measuring the overall length of exposure or the amount of target language contact at home, school, and in the community (Paradis, Reference Paradis2011; Unsworth et al., Reference Unsworth, Hulk and Marinis2011). Variability in L2 proficiency is often attributed to the individual differences in linguistic experience, with longer L2 exposure in sequential bilingual acquisition associated with greater proficiency. This correspondence is observed across language domains, including lexical range (e.g., Chondrogianni & Marinis, Reference Chondrogianni and Marinis2011), morphology (e.g., Paradis, Reference Paradis2011) and sentence structures (e.g., Paradis et al., Reference Paradis, Rusk, Duncan and Govindarajan2017). However, the overall length of exposure to the target language is linearly related to age of L2 onset: if chronological age is held constant, then a later age of L2 onset results in less overall L2 input. This relation complicates the interpretation of each factor individually – i.e., the impact of age of onset as an internal factor and the impact of length of exposure as an input factor (Paradis, Reference Paradis2011; Stevens, Reference Stevens2006).
To complement the overall L2 input measure, more fine-grained measures of the input aim to capture a more comprehensive representation of L2 language experience. This can involve measuring the amount of L2 input/output across the different L2 settings in which the individual participates. This more targeted measure can include specific environments such as day-care, nursery, or school, and has a positive impact on language acquisition (e.g., Blom, Reference Blom2010; V. Gathercole, Reference Gathercole2002, Reference Gathercole2007). Meanwhile, as with an overall length of exposure measure, the language environment represents a broad overview of L2 input; as a result, there may be variation in the amount and characteristics of regular L2 exposure within these environments (Unsworth, Reference Unsworth2013).
Other examples of fine-grained input measures include quantitative calculations of how the language is used at the child’s home (Paradis, Reference Paradis2011). This language use at home can refer to how much of the target language is heard by the individual (i.e., how much the other household members use the target language to the child) and how much is produced by the individual (i.e., how much the child uses the target language at home to other household members, e.g., parents/caregivers, siblings, and other adults). However, in previous studies on children’s morphosyntactic production, L2 language use at home has not predicted children’s L2 proficiency (e.g., Chondrogianni & Marinis, Reference Chondrogianni and Marinis2011; Paradis et al., Reference Paradis, Rusk, Duncan and Govindarajan2017). This suggests that non-native speakers who have a restricted fluency in the L2 in the home may have a negligible impact on the child’s L2 ability, a finding which has been replicated in other research (Hoff, Welsh, Place, Ribot, et al., Reference Hoff, Welsh, Place, Ribot, Grüter and Paradis2014).
(Proximal) Qualitative Input
Qualitative input measures are typically based on the richness of the target language; this depends on the activities the individual engages in with the target language as the primary language in these settings. These activities can include e.g., reading, watching TV, and play or social activities (Paradis, Reference Paradis2011).
Acquiring language through play based activities may be beneficial in comparison to language learning through formal settings as it often involves symbolic thinking where children engage in pretend play behaviour (e.g., roleplaying, and object substitution – the non-literal use of objects, actions or persons; Kane et al., Reference Kane, Kennedy, Sevdali, Folli and Rhys2019). This is associated with greater visual object recognition and the development of noun vocabulary skills (Pereira & Smith, Reference Pereira and Smith2009; Smith, Reference Smith2003; Smith & Jones, Reference Smith and Jones2011). The social-interaction element of play often involves collaboration between children where they assume different roles and negotiate the direction of play; this may comprise the use of more advanced language skills including complex grammatical and pragmatic forms compared to what is used in other situations (Bergen & Mauer, Reference Bergen and Mauer2000; Weisberg et al., Reference Weisberg, Zosh, Hirsh-Pasek and Golinkoff2013). In addition, a play activity offers children the opportunity to engage with a considerable amount of language input and use.
In relation to L2 language development, interaction through play has been shown to motivate preschool children to engage with the L2 and practice their L2 skills in supportive and encouraging environments (Fassler, Reference Fassler1998; Piker, Reference Piker2013). Playing with others provides children with the opportunity to learn language from peers and to practice what they may have previously acquired in other situations (Ervin-Tripp, Reference Ervin-Tripp, Scales, Almy, Nicolopoulou and Ervin-Tripp1991). When richness of the language environment has been investigated as a separate L2 input measure, it has predicted children’s proficiency across language domains, including vocabulary and morphology (e.g., Paradis, Reference Paradis2011) and syntax (e.g., Paradis et al., Reference Paradis, Rusk, Duncan and Govindarajan2017).
Distal Input Factors
Distal factors include parental language proficiency, family size, and socioeconomic status (measured by parental educational level or occupation); which in turn influence the individual’s language exposure (Hoff, Reference Hoff2006; Paradis & Grüter, Reference Paradis, Grüter, Grüter and Paradis2014; Sorenson Duncan & Paradis, Reference Sorenson Duncan and Paradis2018). Previous studies on distal input factors like socioeconomic status and parental L2 proficiency yield mixed results. Some studies show socioeconomic status to have no impact on linguistic scores (e.g., Armon-Lotem et al., Reference Armon-Lotem, Walters and Gagarina2011; Paradis et al., Reference Paradis, Rusk, Duncan and Govindarajan2017), whereas others do find an effect (e.g., Bohman et al., Reference Bohman, Bedore, Peña, Mendez-Perez and Gillam2010; de Cat, Reference de Cat2020; Sorenson Duncan & Paradis, Reference Sorenson Duncan and Paradis2018). However, the impact of socioeconomic status can vary depending on various factors, e.g., whether the mother’s education was completed in the L1 or L2 (Sorenson Duncan & Paradis, Reference Sorenson Duncan and Paradis2018), and can also interact with other factors, e.g., the child’s length of L2 exposure (de Cat, Reference de Cat2020).
These findings also vary by linguistic domain. For example, lexical development has been predicted by both maternal L2 proficiency and maternal education as measures of socioeconomic status (Hammer et al., Reference Hammer, Komaroff, Rodriguez, Lopez, Scarpino and Goldstein2012), by maternal L2 proficiency but not maternal education (Chondrogianni & Marinis, Reference Chondrogianni and Marinis2011), or by neither measure (Paradis, Reference Paradis2011). Effects are similarly mixed for morphological and syntactic development – with a positive association with maternal education (Paradis, Reference Paradis2011), or no association with maternal L2 proficiency (Sorenson Duncan & Paradis, Reference Sorenson Duncan and Paradis2020), or no association with either measure (Chondrogianni & Marinis, Reference Chondrogianni and Marinis2011; Paradis et al., Reference Paradis, Rusk, Duncan and Govindarajan2017).
Finally, in addition to the individual measures reviewed above, recent studies on the impact of external factors have also included composite measures of input which combine quantitative, qualitative, and distal input features (e.g., de Cat, Reference de Cat2020; Jia & Fuse, Reference Jia and Fuse2007; Roesch & Chondrogianni, Reference Roesch and Chondrogianni2016; Unsworth, Reference Unsworth2016). This approach aims to capture any changes in the child’s language experiences over time; nevertheless, findings from previous studies using this type of combined measure of external factors are also mixed across language domains (Armon-Lotem et al., Reference Armon-Lotem, Walters and Gagarina2011; Unsworth, Reference Unsworth2016; de Cat, Reference de Cat2020; Jia & Fuse, Reference Jia and Fuse2007; Roesch & Chondrogianni, Reference Roesch and Chondrogianni2016).
Summary
This review of the literature shows that there are different results in relation to the impact of different factors on language proficiency of sequential bilingual children, and a table summarising this discussion is provided in the supplementary materials. We argue that the reasons for the differences found between previous studies may be due to how the factors were measured (e.g., different task types used), differences among the participants (e.g., in terms of age or L2 exposure) and the target language domain or features under investigation. The current study therefore aims to further explore the impact of individual difference factors across a number of specific language domains and features, with a focus on L1 Arabic speaking children in Northern Ireland.
Newcomer Children in Northern Ireland
Children who fall within the category of sequential bilinguals are also often referred to as ‘Newcomer’ children in Northern Ireland (NI), defined by The Department of Education in NI (DENI) as a child or young person who has “enrolled in a school but who does not have satisfactory language skills to participate fully in the school curriculum and does not have a language in common with the teacher” DENI (2009, p. iii).
The growing numbers of Newcomer children in NI is a relatively recent phenomenon, with the numbers of Newcomer children having increased 12-fold over the last 20 years to represent nearly 7% of primary school enrolments, or almost 16,000 Newcomer pupils (DENI, 2019). In recent years Arabic speakers have migrated to NI with a high number being refugees resettled from the Syrian conflict. Syrian nationals now account for 0.08% of Northern Ireland’s population and Arabic speaking Newcomer pupils constitute the fourth largest linguistic group in NI schools after Polish, Lithuanian and Romanian speaking pupils (DENI, 2019). Therefore, Arabic speaking children are forming an ever-increasing part of the Newcomer pupil population in NI schools and therefore constitute an important linguistic group to investigate in terms of the impact of individual difference factors.
The Current Study
This research is a cross-sectional study of sequential bilingual children in NI which aims to tease apart the impact of different internal and external factors to determine their effect across a range of linguistic domains (vocabulary, morphology, and syntax) and features (third person singular and past tense morphology, active and passive voice, and subject and object relative clauses).
Our research questions are:
-
(1) How do internal and external factors affect proficiency in sequential bilinguals?
-
(2) Do the effects vary across language domains (vocabulary, morphology, syntax) and features (third person singular and past tense morphology, active and passive voice, and subject and object relative clauses)?
The complex and somewhat opposing findings from the literature reviewed above lead to various possible hypotheses. One hypothesis is that “earlier is better” for sequential bilinguals, i.e., an earlier age of L2 onset supports L2 proficiency across language domains (Johnson & Newport, Reference Johnson and Newport1991); if so, then we would predict increased scores for children who began acquiring English at younger ages (e.g., Bedore et al., Reference Bedore, Peña, Griffin and Hixon2016; Nishikawa, Reference Nishikawa2014; Roesch & Chondrogianni, Reference Roesch and Chondrogianni2016). However, another hypothesis for sequential bilinguals is that L2 proficiency is higher for older children, due to more advanced cognitive development and experience with age; if so, we would predict increased scores on vocabulary, morphology, and syntax with older ages (e.g., Bohman et al., Reference Bohman, Bedore, Peña, Mendez-Perez and Gillam2010; Paradis, Reference Paradis2011; Paradis et al., Reference Paradis, Rusk, Duncan and Govindarajan2017; Snow & Hoefnagel-Hohle, Reference Snow and Hoefnagel-Hohle1978).
A further hypothesis, however, is that the L2 proficiency is driven by overall input, which would predict increased scores across domains with greater exposure to the L2. This hypothesis includes the overall quantity of L2 input, but also more fine-grained external factors such as richness of the L2 environment or L2 use at home, in which case we would predict increased scores on vocabulary, morphology, and syntax with richer L2 language environments (e.g., Paradis, Reference Paradis2011; Paradis et al., Reference Paradis, Rusk, Duncan and Govindarajan2017) or with more L2 use at home (c.f. Paradis, Reference Paradis2011; Paradis et al., Reference Paradis, Rusk, Duncan and Govindarajan2017).
Finally, two further hypotheses concern the distal input factors socioeconomic status and maternal L2 proficiency. If socioeconomic status influences L2 proficiency, then we would predict higher scores on vocabulary, morphology, and sentence structures for children from higher socioeconomic backgrounds (e.g., Bohman et al., Reference Bohman, Bedore, Peña, Mendez-Perez and Gillam2010; de Cat, Reference de Cat2020; Sorenson Duncan & Paradis, Reference Sorenson Duncan and Paradis2020). Similarly, if maternal L2 proficiency influences the child’s L2 proficiency, then we would predict higher scores across domains for children with mothers who have greater L2 English abilities (e.g., Chondrogianni & Marinis, Reference Chondrogianni and Marinis2011; Sorenson Duncan & Paradis, Reference Sorenson Duncan and Paradis2020).
It may also be the case that these factors interact with one another – that is, the impact of one factor may depend on another factor. Interactions between variables are prevalent in language research (Winter & Grice, Reference Winter and Grice2021; Winter, Reference Winter2022) and therefore it is important that these are included in analyses to have a fuller understanding of the associations between variables and the impact of predictors on language outcomes.
Finally, different internal and external factors may affect different features across language domains. That is, the specific effect of a given factor on a language feature may depend on how this factor interacts with specific grammatical knowledge, specific sentence processing procedures or the combination of this knowledge and how it is deployed in a specific language assessment of a specific domain. For example, the processes involved in lexical retrieval are not identical to those required for identifying context-specific morphological affixes, which in turn differ from resolving long-distance dependencies (e.g., object relative clauses) and parsing non-canonical word order (e.g., passive voice sentences) (Ferreira, Reference Ferreira2003; Lewis & Vasishth, Reference Lewis and Vasishth2005; Lewis et al., Reference Lewis, Vasishth and Van Dyke2006; Marinis, Reference Marinis2007, p. 20007; Marinis & Saddy, Reference Marinis and Saddy2013; Stromswold, Reference Stromswold2004; Van Dyke & Lewis, Reference Van Dyke and Lewis2003). Therefore, these different procedures may be impacted by different internal and external factors. If so, then the above predictions may be borne out selectively across factors or language domains and features.
Methodology
Participants
The participants were 40 typically developing sequential bilingual children (5;7-12;2, mean = 8;4) from L1 Arabic-speaking backgrounds acquiring English as an additional language. Each child completed three language tasks, and parents completed a questionnaire on internal and external factors. Age of L2 onset for the bilingual children ranged from 5 months to 10 years old (mean = 4;3) and length of L2 exposure ranged from 7 months to 10;6 years (mean = 4;1). The bilingual participants spoke Arabic as a main language at home, and they, or their parents, originated from thirteen different Arabic-speaking countries. None of the children who participated in the study had a language impairment or learning disability as confirmed by their schools.
Participants were recruited from nine primary schools in and around Belfast, Northern Ireland via convenience sampling. Each school involved in recruitment used English as the language of instruction and followed the Northern Ireland Curriculum (Foundation Stage to Key Stage 2). Schools varied in terms of socioeconomic area and location with some in inner city areas while others were suburban. Schools also varied with regard to whether they had specific L2 support for Newcomer children, e.g., some had specific classes for English language assessment and development for Newcomer children while others offered more limited (e.g., in-class classroom assistant support only) or no extra support.
Procedures
Each child was tested in a quiet room on school premises and the data were gathered by a native English speaker from Northern Ireland. The tasks were delivered in English – i.e., the participants’ L2 – and L1 proficiency was not assessed.
Vocabulary Task
The children’s lexical development was evaluated with the Word Finding Vocabulary Test from The Renfrew Language Scales (Renfrew, Reference Renfrew1995). This assessment has been used in previous studies to assess children’s vocabulary proficiency in typical development (Buckley et al., Reference Buckley, Underwood and Purdie2013; Hastie, Reference Hastie2015), as well as multilingual and clinical populations (e.g., Antoniou et al., Reference Antoniou, Grohmann, Kambanaros and Katsos2016; Kwok et al., Reference Kwok, Brown, Smyth and Oram Cardy2015).
The Word Finding Vocabulary Test comprises 50 line-drawn pictures of objects of increasing difficulty which are shown to the participant one by one, and the participant is asked to name each target item aloud. In the current study, responses were scored either correct (1) or incorrect (0) for each of the trials and each trial was entered into the dataset individually.
Morphology Task
The children’s use of third person singular and past tense morphology was assessed using the Test of Early Grammatical Impairment Screening Test (TEGI; Rice & Wexler, Reference Rice and Wexler2001), a standardised test of morphosyntactic competency. The TEGI is used frequently in studies on bilingual acquisition (e.g., Blom & Paradis, Reference Blom and Paradis2015; Chondrogianni & Marinis, Reference Chondrogianni and Marinis2011; Paradis, Reference Paradis2011; Paradis & Blom, Reference Paradis and Blom2016), and comprises picture elicitation probes for the target morphological features, as in (1) (for third person singular):
Participants were presented with 10 test trials on the third person singular and 18 trials on the past tense (10 regular, 8 irregular). Participants scored either correct (1) or incorrect (0) for each of the trials and each trial was entered into the dataset individually.
Syntax Task
Syntax was evaluated using The Coloring Book Task, a digital colouring paradigm completed on a touchscreen PC (Pinto & Zuckerman, Reference Pinto and Zuckerman2019; Zuckerman et al., Reference Zuckerman, Pinto, Koutamanis and Spijk2016). In the Coloring Book Task, target structures are assessed by using the test sentence as a prompt to colour in a black and white picture; participants’ interpretations are then inferred based on how they colour in the pictures. (Pinto & Zuckerman, Reference Pinto and Zuckerman2019; Gerard et al., Reference Gerard, Lidz, Zuckerman and Pinto2018; Bosch et al., Reference Bosch, Zuckerman, Pinto, Guijarro-Fuentes and Suárez-Gómez2020).
The target syntactic structures in the current study were active voice, passive voice, subject relative clauses, and object relative clauses, which were used in prompts like (2) and (3) to colour a black and white picture like Figure 1.
The main test items consisted of 32 trials, with 8 items for each sentence structure (active voice, passive voice, subject relative clauses, object relative clauses) which were alternated with fillers as in (4):
Participants were scored either correct (1) or incorrect (0) for each of the trials and each trial was entered into the dataset individually.
Parent questionnaire
A parent questionnaire, adapted from the Alberta Language Environment Questionnaire (Paradis, Reference Paradis2011) was used to obtain a comprehensive representation of participant demographics (chronological age and age of L2 onset) and environmental (external) language factors. The questionnaire was translated to Modern Standard Arabic by a native Arabic speaker and completed by the participants’ mothers. The factors measured in the questionnaire are detailed in Table 1.
Note that the age of L2 onset may be calculated by subtracting the length of L2 exposure from chronological age, meaning that age of L2 onset and length of L2 exposure are not independent. While both factors are reported in the literature, the results in the following section will report the length of L2 exposure rather than the age of L2 onset both to avoid statistical non-convergence and to include both an internal and external (input-based) factor. We consider the implications of both factors in the discussion.
Results
Descriptive Statistics
Mean, standard deviation (SD), range and descriptions of scores for the internal and external predictor variables are reported in Table 2. For the linguistic measures, we report these scores along with Cronbach’s Alpha for internal reliability (Table 3). Scores ranged particularly widely between participants for the third person singular and more complex syntactic structures (passive voice and object relative clauses), with generally high scores for the less complex structures (active voice and subject relative clauses). High internal reliability was also observed across tasks.
Statistical Analysis
To analyse the relationships between the internal and external factors across language domains, we performed mixed effects logistic regression modelling using R (R Core Team, Reference Team2017) and lme4 (Bates et al., Reference Bates, Mächler, Bolker and Walker2015). The models were developed to predict correct and incorrect responses in the children’s production or comprehension. Each response was entered as an individual trial in the data set, resulting in a binary outcome variable (correct/incorrect). Fixed effects were the factors in Table 2, aside from the age of L2 onset as described above. Random effects were selected to account for variation across participants; these included ‘participant,’ ‘item’ (i.e., each item from the target linguistic measures), and ‘school’ (the schools the participants attended). Where a model revealed variation by a given random effect, it was retained in the model.
The modelling strategy was to find a parsimonious set of predictors that maximally explained the differences in the participants’ L2 English language proficiency scores while avoiding over-fitting the data (i.e., the highest generalisability from the smallest subset of predictor variables), for each language domain. Therefore, from a model which overfitted the data, backwards stepwise elimination of fixed effects which were not significant (Bates et al., Reference Bates, Mächler, Bolker and Walker2015) resulted in a reduced model, which was as parsimonious as possible (Paradis et al., Reference Paradis, Rusk, Duncan and Govindarajan2017). This procedure is described for each model in the supplementary materials, available from https://osf.io/pr26s/?view_only=5c1e2dfa51374540814d7e13ee27db92.
However, before running the logistic regressions, we first conducted a correlation analysis to explore the general linear relationships across all of the factors included as fixed effects and the language measures, in Table 4 (See the supplementary materials for plots).
*** p < 0.001
** p < 0.01
* p < 0.05
Table 4 includes several significant correlations. Notably, only two of these are between a predicting factor and a language measure: length of L2 exposure is significantly correlated with vocabulary (R = 0.65, p = 0.002), and chronological age is significantly correlated with subject relative clauses (R = 0.58, p = 0.005). The remaining correlations are between two language measures, or between two predicting factors.
The significant correlations between language measures include all three language domains, with correlations both across and (in the case of morphology and syntax) within domains. First, vocabulary was significantly correlated with accuracy on both TEGI morphology tasks – the third person singular (R = 0.69, p < 0.001) and the past tense (R = 0.72, p < 0.001), as well as the passive voice (R = 0.65, p < 0.001). In addition, the TEGI third person singular and past tense tasks were correlated with each other (R = 0.73, p < 0.001), and both tasks were correlated with the passive voice (third person singular R = 0.56, p < 0.001; past tense R = 0.68, p < 0.001). For syntax, significant correlations included the active voice and passive voice (R = 0.52, p < 0.001), active voice and object relative clauses (R = 0.51, p < 0.001), passive voice and subject relative clauses (R = 0.49, p = 0.002) and passive voice and object relative clauses (R = 0.55, p < 0.001)
Lastly, there were significant correlations between a number of the predicting factors. As expected, length of L2 exposure was negatively correlated with age of L2 onset (R = -0.67, p < 0.001) – that is, children with a later age of L2 onset had a lower quantity of L2 exposure. Length of L2 exposure was also correlated with L2 language use at home (R = 0.59, p = 0.006), while age of L2 onset was negatively correlated both with socioeconomic status – as measured by maternal education – (R = -0.57, p < 0.001) and with maternal L2 proficiency (R = -0.56, p < 0.001). Finally, socioeconomic status and maternal L2 proficiency were correlated with each other (R = 0.75, p < 0.001).
With the high number of correlations between measures, a stepwise elimination approach was used for the regression analyses to avoid cancelling effects. The analyses for each language domain are presented in the following sections.
Logistic Regression Analyses for Vocabulary
Stepwise elimination of fixed effects which did not reach significance resulted in a model for vocabulary which included the predictors Chronological age, Length of L2 exposure, L2 language use at home, Richness of the L2 environment and Socioeconomic status, as well as random effects for participants, items and schools; this model included 2000 observations (i.e., individual trials). The significant predictors for the vocabulary task were Length of L2 exposure (β = 0.05, Z = 4.68, p < 0.001) and Richness of the L2 environment (β = 2.65, Z = 2.20, p =0.028), with marginal significance observed for the remaining predictors (Table 5).
*** p < 0.001
** p < 0.01
* p < 0.05
˙ p < 0.1
The significant main effect of length of L2 exposure indicates that those who had longer exposure to L2 English were more likely to give a correct response on the vocabulary task compared to those who had less exposure to L2 English (Figure 2). Meanwhile, the main effect of Richness of the L2 environment is due to higher accuracy on the vocabulary task with a higher degree of participation in L2 play and social activities (Figure 3).
Logistic regression analyses for morphology
The morphology model included the third person singular and the past tense as two levels of a single fixed effect morpheme to identify the impact of factors across the two types of morphological features. Stepwise elimination of fixed effects which did not reach significance resulted in a model for morphology which included the predictors chronological age, length of L2 exposure and morpheme, as well as an interaction between morpheme and length of L2 exposure, and random effects for participants and items; this model included 1120 observations. The model revealed a main effect of length of L2 exposure (β = 0.4, Z = 2.74, p = 0.006), with no significant interaction between the predicting factors and morpheme (Table 6).
Note:
*** p < 0.001
** p < 0.01
* p < 0.05
˙ p < 0.1
The main effect of length of L2 exposure reflects that those who had longer exposure to L2 English had greater accuracy compared to those who had less exposure to L2 English, for both third person singular and past tense morphology (Figure 4).
Logistic regression analyses for syntax
For the syntax model, structure and complexity were included as fixed effects to determine whether the impact of the internal and external factors depended on the type of language structure (voice or relative clause) and the language feature’s complexity (low – active voice and subject relative clause; high – passive voice and object relative clause). Stepwise elimination of fixed effects which did not reach significance resulted in a model for syntax which included the predictors chronological age, richness of the L2 environment, complexity and structure, as well as an interaction between complexity, structure and richness of the L2 environment, and random effects for participants; this model included 1280 observations.
The results of this model are presented in Table 7: significant main effects were observed across all syntactic structures for chronological age (β = 0.03, Z = 3.84, p < 0.001) and richness of the L2 environment (β = 2.04, Z = 2.20, p =0.028). In addition, we observed the expected main effect of complexity due to higher accuracy with low complexity (β = 1.64, Z = 7.28, p < 0.001), as in Table 3. Finally, there was a significant three-way interaction between complexity, structure and richness of the L2 environment (β = 2.90, Z = 2.27, p = 0.024).
Note:
*** p < 0.001
** p < 0.01
* p < 0.05
˙ p < 0.1
The main effect of chronological age is due to the overall higher accuracy for older children across structures than for younger children (Figure 5). Although this effect varied numerically across structures, the interaction with age was not included in the optimal model.
In addition, the main effect of L2 Richness is due to the higher accuracy with greater L2 richness, across structures (Figure 6). However, the effect of L2 richness was modulated by both structure and complexity, as indicated by the three-way interaction between these factors. That is, for both the active and passive voice, L2 richness predicted children’s accuracy, with higher accuracy observed with higher L2 richness. However, this effect of L2 richness was not observed for both relative clause structures, due to the overall ceiling accuracy for subject relatives. Instead, the effect of L2 richness was observed only for object relative clauses, resulting an interaction with L2 richness for relative clauses, but not for voice (i.e., an interaction between structure, complexity and L2 richness).
Discussion
The aim of this study was to tease apart the relations between different internal and external factors by measuring their impact across a number of different language domains and features in sequential bilingual acquisition in childhood. The research questions were (a) how internal and external factors affect L2 proficiency in sequential bilinguals, and (b) whether the effects vary across language domains and features. Different tasks were used to measure L2 proficiency, including standardised tests to measure production, and a novel experimental task to measure comprehension.
Consistent with the observation that different language domains involve different linguistic knowledge and different parsing procedures, we found that L2 proficiency for vocabulary, morphology and syntax was affected differently by different factors. In particular:
-
- vocabulary was predicted both by length of L2 exposure and by richness of the L2 environment (measured as participation in L2 play and social activities)
-
- morphology was predicted only by length of L2 exposure, with no interaction by morpheme (past tense/3rd person singular)
-
- syntax was predicted by age, with higher accuracy observed at older ages, particularly for the more complex structures (i.e., passive voice and object relative clauses); in addition, syntax was predicted by richness of the L2 environment, although to different degrees across the different structures.
Meanwhile, we observed no interactions between the internal and external factors: the above main effects were not modulated by any other internal or external predictor variable. The following section will revisit the internal and external factors and their impact across language domains. We then consider implications for theory and practice, and end with some limitations and future directions.
Internal and External Factors Revisited
Mixed results have been observed across previous studies in relation to the impact of different internal and external factors on L2 proficiency. The current study aimed to tease apart potential sources of this variation by investigating the impact of these factors across language domains. In the following sections, we discuss our results in the context of previous studies.
Internal factors: Age
Age in the current study was measured by way of chronological age and age of L2 onset. In this section we consider the impact of chronological age, while age of L2 onset is addressed below in relation to the length of L2 exposure (and is therefore discussed as an external factor).
In previous studies, age effects have been generally consistent within the lexical and morphological domains, but less so for morphosyntactic knowledge (e.g., Bohman et al., Reference Bohman, Bedore, Peña, Mendez-Perez and Gillam2010; Paradis, Reference Paradis2011; Paradis et al., Reference Paradis, Rusk, Duncan and Govindarajan2017; Snow & Hoefnagel-Hohle, Reference Snow and Hoefnagel-Hohle1978). Consistent with these studies, chronological age did predict children’s interpretation of the sentence structures (active and passive voice, and subject and object relative clauses). However, the slopes of the age effect revealed greater differences in accuracy between the younger and older children for the more complex structures (passive voice and object relative clauses) compared to the less complex structures (active voice and subject relative clauses). This suggests that the age effect may be selective to specific measures of syntactic knowledge – e.g., more complex structures, if not also specific types of assessment, particularly in contexts with ceiling or floor effects for a given age range. This specificity may account for variation in the literature where we see contrasting results with different structures and assessments.
In addition, chronological age did not predict children’s vocabulary or morphology, which contrasts with previous research (e.g., Blom & Bosma, Reference Blom and Bosma2016; Paradis, Reference Paradis2011; Paradis & Blom, Reference Paradis and Blom2016; Snow & Hoefnagel-Hohle, Reference Snow and Hoefnagel-Hohle1978). This disparity for vocabulary and morphology may be due to actual variation in age effects; however, as we mention above, another source of variation is the type of task used across studies. For example, in previous studies, grammaticality judgements were also included in the lexical and morphological scores alongside the production tasks for these language measures. In contrast, the current study focused solely on production measures for vocabulary and morphology. While the different tasks are designed to draw on the same underlying linguistic knowledge, the specific extragrammatical processes involved in deploying this knowledge in tasks like grammaticality judgments and elicited production are likely to differ across tasks; these differences in task-specific demands may then result in variation in age effects (McDaniel & Cairns, Reference McDaniel and Cairns1990).
In general, the effect of age for syntax is consistent with previous studies (e.g., Bohman et al., Reference Bohman, Bedore, Peña, Mendez-Perez and Gillam2010; Paradis et al., Reference Paradis, Rusk, Duncan and Govindarajan2017; contra Chondrogianni & Marinis, Reference Chondrogianni and Marinis2011). This result suggests that the characteristics associated with older ages in childhood may be more important factors for the acquisition of syntax in sequential bilingualism as opposed to vocabulary and morphology; these characteristics include (a) more developed cognitive mechanisms (e.g., memory, attention, theory of mind, reasoning, and executive functioning skills including organising, planning and monitoring of information) and (b) more well-established linguistic experience and knowledge in both the L1 and the L2 (Paradis, Reference Paradis2007, Reference Paradis2011). These mechanisms may therefore be advantageous for interpretation of syntactic structures in childhood, in both the L1 and the L2 (e.g., Armon-Lotem et al., Reference Armon-Lotem, Walters and Gagarina2011; Rothman et al., Reference Rothman, Long, Iverson, Judy, Lingwall and Chakravarty2016). This advantage is discussed further below, in the context of the linguistic input. In addition, older children on account of being in later stages of primary school may have more exposure to a number of syntactic structures delivered through the educational curriculum.
External Factors: Length of L2 Exposure
The external factor length of L2 exposure has predicted children’s L2 proficiency in previous studies across language domains (e.g., Chondrogianni & Marinis, Reference Chondrogianni and Marinis2011; Paradis, Reference Paradis2011; Paradis et al., Reference Paradis, Rusk, Duncan and Govindarajan2017). This is generally consistent with our results, in which length of exposure predicted children’s accuracy for both vocabulary and morphology, including both third person singular and the past tense. The exception was the syntactic domain, where length of L2 exposure did not interact with the type of sentence structure. This pattern of results is particularly notable given that it is complementary to the pattern observed for chronological age: while syntax was predicted by age – i.e., a measure reflecting cumulative experience in both the L1 and the L2 and the development of general cognitive processes – children’s lexical and morphological proficiency was instead predicted by the overall quantitative input in the L2, above and beyond any effect of age.
This split between syntax on the one hand, and vocabulary and morphology on the other, reflects the more embedded properties of the former – all four structures (actives, passives, subject relatives and object relatives) are realized cross-linguistically, such that knowledge in the L1 may be transferred to the L2 (Greenberg, Reference Greenberg1963). This contrasts with vocabulary and morphology, where the possibility of transfer from the L1 and L2 may be more strongly dependent on the availability of shared forms across languages (Oksuz et al., Reference Oksuz, Alexopoulou, Derkach and Maria2022). In the current study with Arabic as the L1 and English as the L2, this overlap may be less prominent compared to languages within the same language family and/or with a higher degree of cognates, resulting in the overall effect of input for domains like vocabulary and morphology, but not for syntax.
The contrast between age and length of L2 exposure raises the question of the role of the Age of L2 Onset, which is derived from L2 exposure subtracted from age. As discussed above, due to this direct relation between the age of L2 onset and the length of L2 exposure, both factors could not be included in the same regression model. In general, age of onset effects are particularly reliable across a wide age range, especially in samples which include both child and adult L2 speakers. However, results have been mixed for sequential bilingual acquisition, where L2 onset is within the critical period (e.g., Blom & Bosma, Reference Blom and Bosma2016; Chondrogianni & Marinis, Reference Chondrogianni and Marinis2011; Unsworth, Reference Unsworth2016; Paradis et al., Reference Paradis, Rusk, Duncan and Govindarajan2017). Accordingly, correlations with age of onset were not observed between age of L2 onset and the language measures (Table 2). Future qualitative analyses of children’s error types may shed further light on the impact of age of L2 onset within this age range (e.g., Meisel, Reference Meisel2018).
The complementary distribution of age effects for syntax on the one hand, and length of exposure effects for vocabulary and morphology on the other, suggests that syntactic development in bilingualism is impacted more by either biological components related to maturation and cognition or by overall language input in both the L1 and the L2, rather than by the specific input in the L2. At the same time, syntactic development was also influenced by more fine-grained L2 input measures, discussed further in the following section.
External Factors: Richness of the L2 Environment
Like the length of L2 exposure, previous studies have observed an impact across language domains for the richness of the L2 environment, as measured by the degree of participation in L2 play and social activities (e.g., Paradis, Reference Paradis2011; Paradis et al., Reference Paradis, Rusk, Duncan and Govindarajan2017). Therefore, the hypothesis for this factor was that it would predict performance across language features and would therefore result in increased scores on vocabulary, morphology, and syntax for children with richer L2 language environments. Indeed, richer L2 environmental experiences did predict both vocabulary and syntax; however, they predicted L2 syntax only selectively and they did not predict L2 morphology. Thus, our prediction was only partially borne out, suggesting that fine-grained measures are selectively impactful across language features and domains. Conceivably, richer L2 experiences are providing the children with valuable opportunities to hear and use vocabulary and specific aspects of the L2 syntax.
These findings support previous research which associates peer play and social interaction with language development in bilingual and child L2 acquisition (e.g., Ervin-Tripp, Reference Ervin-Tripp, Scales, Almy, Nicolopoulou and Ervin-Tripp1991; Fassler, Reference Fassler1998; Piker, Reference Piker2013). It is likely that the play and social settings that the sequential bilingual children are participating in offers engagement with native English speakers who may be providing skilful language in the respective domains. This supports research which indicates that interaction with peers on its own is not enough for some language development. Rather, children require exposure to considerable amounts of ‘expert’ speakers (native or highly proficient speakers) for peer interaction to have an impact on language acquisition (Hoff, Reference Hoff2006).
External Factors: L2 Language Use at Home
In previous studies, the impact of L2 language use at home on children’s L2 proficiency has been mixed at best, and in general has not predicted L2 outcomes (e.g., Paradis, Reference Paradis2011; Paradis et al., Reference Paradis, Rusk, Duncan and Govindarajan2017). The use of the L2 at home was therefore not expected to predict children’s L2 proficiency in the current study. As expected, L2 language use at home was not associated with greater proficiency of any language measure and therefore our prediction was borne out. This supports previous research findings (e.g., Paradis, Reference Paradis2011; Paradis et al., Reference Paradis, Rusk, Duncan and Govindarajan2017) and the view that input from non-native speakers who have a restricted fluency in the L2 in the home may have a negligible impact on the child’s L2 ability (Hoff, Welsh, Place, & Ribot, Reference Hoff, Welsh, Place, Ribot, Paradis and Grüter2014).
External Factors: Socioeconomic Status
In the current study, socioeconomic status was measured by way of maternal education. This measure has yielded mixed results in predicting children’s L2 proficiency across domains. Consistent with previous studies which have observed no relation with socioeconomic status, we observed that higher socioeconomic status was not associated with greater ability for any language measure.
One caveat of this conclusion is that the sample included a disproportionate number of mothers with higher level qualifications. Maternal education was calculated in terms of the highest qualification obtained, and the majority of participants’ mothers in the current study had advanced level qualifications which included an undergraduate degree, master’s degree or PhD level education (n=24). While some also had higher educational institution level qualifications (n=10), fewer (n=6) had lower level qualifications (including primary or secondary school level education) as their highest qualification. If maternal education is most impactful within populations which were less represented within the current study, then this impact may not have been possible to detect with the current study.
However, it is important to note that although maternal education is used as the measure for socioeconomic status in the current study and previous research, this measure may not be as valid for economic and social position in the current study: as many of the mothers had conceivably come to Northern Ireland as refugees, it may be the case that on arrival to Northern Ireland, and indeed for some time afterward, many may not have been able to secure employment comparable to their level of education or that which they had in their country of origin. Therefore, maternal education in the case of refugee families may not in fact equate to socioeconomic status, a discrepancy which has also been observed for research in the United States (Passel & Cohn, Reference Passel and Cohn2009).
External factors: Maternal L2 proficiency
In the current study, maternal L2 proficiency was measured by participants’ mothers’ ratings of their own L2 English levels on a scale between 1-5 (Table 1). Like maternal education, the factor of maternal L2 proficiency has yielded mixed results across language domains, producing a range of hypotheses for each language domain (Hammer et al., Reference Hammer, Komaroff, Rodriguez, Lopez, Scarpino and Goldstein2012; Chondrogianni & Marinis, Reference Chondrogianni and Marinis2011; Paradis, Reference Paradis2011; Sorenson Duncan & Paradis, Reference Sorenson Duncan and Paradis2020; Paradis et al., Reference Paradis, Rusk, Duncan and Govindarajan2017). However, also like for maternal education, we found that maternal L2 proficiency was not associated with greater ability for any language measure.
The lack of impact found for this factor may again be due to the lack of variation across the mothers’ L2 proficiency scores, as participants’ mothers in the current study mostly had higher L2 English ability. Most mothers could either understand and use English adequately for work and most other situations (n=10) or could understand almost everything/were very comfortable expressing themselves in English (n=15). Therefore, as with maternal education, an impact of maternal L2 proficiency may be observed in a sample with greater variation in this factor.
Implications for theory and practice
Theory
Importantly, our findings suggest that there is a delineation between the impact of predicting factors depending on the language domain and features under investigation. This contrast is most notable for language features predicted by overall quantitative L2 input – i.e., L2 vocabulary and morphology – and those which are predicted by age – i.e., L2 syntax (particularly complex syntax). This complementary pattern across domains contrasts a range of studies which have observed homogenous impacts across domains; the current study contributes to this literature by teasing apart the impacts of different predicting factors across different language domains.
In addition, our analysis with chronological age and length of exposure, to the exclusion of age of onset, demonstrates how these factors are more useful for predicting L2 proficiency for sequential bilinguals in comparison to the measure of age of L2 onset as a predictor for more wide-ranging samples. While these latter studies garner support for a ‘younger is better’ perspective and critical period effects for language acquisition, the findings from our study are not consistent with this. We think the reason behind these differing results is that age of L2 onset is often confounded with input effects. In the present study, age of L2 onset was highly and negatively correlated with length of L2 exposure: with a later age of onset, children had less exposure to the L2. As discussed previously, this ‘younger is better’ perspective is generally consistent for broader age ranges (e.g., Johnson & Newport, Reference Johnson and Newport1991); however, the correlation between age of L2 onset and length of L2 exposure demonstrates how this relation may be misleading for narrower ranges as in the current study. Moreover, this may indicate that in previous studies where an age of onset effect has been found but length of exposure has not been accounted for, then this input variable may in fact be driving these results.
Practice
The results from the current study have important implications for teaching and supporting Newcomer children in educational settings. While the ‘earlier is better’ view for L2 acquisition is pervasive (Abrahamsson & Hyltenstam, Reference Abrahamsson and Hyltenstam2009; Johnson & Newport, Reference Johnson and Newport1989; Patkowski, Reference Patkowski1980), previous and more current research has demonstrated that older sequential bilingual children, at least in the initial stages of acquisition, have an advantage in acquiring language (e.g., Armon-Lotem et al., Reference Armon-Lotem, Walters and Gagarina2011; Blom & Bosma, Reference Blom and Bosma2016; Bohman et al., Reference Bohman, Bedore, Peña, Mendez-Perez and Gillam2010; Paradis, Reference Paradis2011; Paradis et al., Reference Paradis, Rusk, Duncan and Govindarajan2017; Rothman et al., Reference Rothman, Long, Iverson, Judy, Lingwall and Chakravarty2016; Snow & Hoefnagel-Hohle, Reference Snow and Hoefnagel-Hohle1978). This is often referred to as the rate advantage. While it has been proposed that those who begin earlier catch up and may surpass those who started acquiring the language at a later age (e.g., Jia & Fuse, Reference Jia and Fuse2007), this is often confounded with length of exposure effects.
In the current study, the sequential bilingual participants had a mean length of L2 exposure of 4;2 years. This suggests that the older age advantage found for syntax may hold for a considerable number of years and possibly within the children’s time at primary school. This could have implications on how sequential bilingual children can be best supported in educational settings. In this case, educators could harness the advantages associated with older ages by ensuring older sequential bilingual children have a wealth of opportunities for engagement with more language structures and features so that these can be practised to become established language features within their L2 language repertoire. This could have positive effects on sequential bilingual children’s academic skills (e.g., in writing where both complex and less complex language features are often utilised) which may ultimately have far-reaching consequences regarding future educational prospects (McEachron, Reference McEachron1998; Paradis, Reference Paradis2005). In addition, the impact of a richer L2 experience on L2 proficiency suggests that practitioners advocating for English as an additional language (EAL)/Newcomer children and their linguistic and educational outcomes or integration within school and the community may consider encouraging greater participation in L2 play and social activities which involve native speakers as a way to support vocabulary skills and development of sentence structures (Kane et al., Reference Kane, Kennedy, Sevdali, Folli and Rhys2019).
Limitations and Future Directions
The generalisability of the results is limited by the study’s sample size (n=40), as well as the absence of an L1 assessment. The sample size was constrained by the challenges of recruiting sequential bilingual children from one language background and age range in the specific location of the current study, with multiple assessments. To offset these limitations, each separate model in the analysis comprised a large number of observations/data points (Vocabulary: 2000; Morphology: 1200; Syntax: 1200); these numbers were achieved by collecting many observations from each child on each task.Footnote 1 Children’s responses were generally reliable within a given task (within individuals), even for measures with higher variance between individuals. Nevertheless, future research investigating this population and phenomena would benefit from a larger sample size to ensure the higher statistical power of the analysis, as well as a longitudinal analysis to investigate whether the older age advantage observed for L2 syntax holds over an extended course of time.
In addition, bilingual children are a heterogenous population, and their language abilities can be shaped by many different factors which can change over time and can also interact to form complex non-linear relationships (Paradis, Reference Paradis2023). Therefore, the statistical analysis used in the current study, which maximises prediction accuracy, cannot account for all the complex relationships between all the individual difference factors and the participants’ language proficiency. Paradis (Reference Paradis2023) and others (de Cat & Unsworth, Reference de Cat and Unsworth2023; Chondrogianni, Reference Chondrogianni2023) appeal for caution in establishing causal paths between predictor variables and outcomes when incorporating analyses which try to maximise prediction accuracy. Thus, while the analyses in the current study are suggestive of causal relations in theory, further evidence is needed to support these links in practice.
Finally, while the current study investigated several individual difference factors, other factors were not included. For example, the parent questionnaire was shortened from its original form in order to increase return rates; this resulted in the removal of questions about parental professions and household income, and these questions may be included in a future study with a stronger focus on SES. Moreover, a factor which more recent research has considered for refugee children is the relationship between language outcomes and socioemotional wellbeing (e.g., Soto-Corominas et al., Reference Soto-Corominas, Paradis, Al Janaideh, Vitoroulis, Chen, Georgiades, Jenkins and Gottardo2020). This is important for the current study, as a large proportion of the sample were likely refugees. This status may have impacted their attendance at school both in and outside of their home country with consequences in terms of literacy skills. In addition, many refugees may face key risk factors prior to resettlement, which can include exposure to violence, separation from family, and poverty. Challenges can endure long after resettlement as individuals may experience post-traumatic stress disorder, discrimination, and culture shock (Soto-Corominas et al., Reference Soto-Corominas, Paradis, Al Janaideh, Vitoroulis, Chen, Georgiades, Jenkins and Gottardo2020). Consequently, this population can experience many challenges to their wellbeing, and this factor of wellbeing (including confidence, self-esteem and interpersonal and social skills) can impact L2 proficiency (e.g., Han, Reference Han2010; McNally et al., Reference McNally, Darmody and Quigley2019; Soto-Corominas et al., Reference Soto-Corominas, Paradis, Al Janaideh, Vitoroulis, Chen, Georgiades, Jenkins and Gottardo2020; Hadfield et al., Reference Hadfield, Ostrowski and Ungar2017; Zins, Reference Zins2004). Including these factors in future research (i.e., wellbeing and refugee status) would therefore provide further insight on sequential bilingual acquisition.
Conclusion
This study aimed to contribute to the understanding of sequential bilingualism and the effect of individual differences by providing further insights on their impact across specific language domains and features by L1 Arabic speaking children. The study results suggested that for sequential bilingual children, overall exposure to the L2 is important for the development of vocabulary and morphology, while older age is related to better sentence structure proficiency especially in terms of complex syntax. In addition, participation in L2 play and social activities is associated with vocabulary and syntax.
A better understanding of sequential bilingualism and the factors which impact it has not only the potential to enlighten theory, but also practice in terms of helping educators deliver more appropriate activities and materials which ultimately better facilitate the educational progress and outcomes of EAL/Newcomer pupils (Paradis, Reference Paradis2011, Reference Paradis2023). Future research will further explore the relevant interactions between predicting factors, including causal relations and dynamic models which can account for more complex interactions, as well as real-world practical applications for EAL pupils in education settings.
Supplementary material
The supplementary material for this article can be found at http://doi.org/10.1017/S0305000924000497.
Acknowledgements
The authors would like to thank the children, parents and schools who took part in this research project. Research for this article was made possible by funding from the Department of Education and Learning, Northern Ireland.
Competing interest
The author(s) declare none.