INTRODUCTION
A great deal of research (summarized briefly below) has explored how the amount/nature of child-directed speech (CDS) might influence children's language outcomes. Other work has looked at the abilities that young infants bring to the table, such as the extensively studied ability to bootstrap language characteristics by locating individual words in fluent speech (also known as segmentation). These skills may themselves be impacted by the input to the child. But with few exceptions (e.g. Hurtado, Marchman & Fernald, Reference Hurtado, Marchman and Fernald2008; Weisleder & Fernald, Reference Weisleder and Fernald2013), investigations of infant skills and of the input children receive have been studied separately, without asking how their respective contributions might interact to influence children's vocabulary acquisition.
In this paper, we track the relationships among the lexical properties of CDS to a large sample (N = 121) of prelinguistic (7-month-old) infants, infant segmentation skill (or the ability to separate and identify those lexical items), and the children's vocabulary outcomes at age 2;0. We first discuss what is currently known about the impacts of CDS on children's language development. We then review what is known about infant speech processing skills, particularly segmentation, and its potential role in predicting children's later language abilities. Finally, we discuss a small body of recent research that reports interactions between children's linguistic experience and speech processing abilities before presenting the current investigation and its findings.
CDS and children's language learning
Adults speak differently to language-learning children than to more mature listeners (see Bernstein Ratner, Reference Bernstein Ratner2013; Soderstrom, Reference Soderstrom2007, for summaries); they also ‘fine-tune’ these speech adjustments to the child's perceived comprehension abilities (see Snow, Reference Snow, Fletcher and MacWhinney1995, for discussion). There are documented changes in infants' preferences for input (e.g. Hayashi, Tamekawa & Kiritani, Reference Hayashi, Tamekawa and Kiritani2001; McRoberts, McDonough & Lakusta, Reference McRoberts, McDonough and Lakusta2009), suggesting possible synergies between infant preferences, adult language modification, and benefits to child language learning. However, the long-term effect of variability in CDS on later communication outcomes is less evident.
Studies of the effect of CDS have explored a number of properties, ranging from phonetic properties (such as vowel space; Liu, Kuhl & Tsao, Reference Liu, Kuhl and Tsao2003), through grammatical components (e.g. Ambridge, Rowland, Theakston & Kidd, Reference Ambridge, Rowland, Theakston and Kidd2015), and pragmatic features (e.g. Hirsh-Pasek et al., Reference Hirsh-Pasek, Adamson, Bakeman, Owen, Golinkoff, Pace and Suma2015; Tamis-LeMonda, Bornstein & Baumwell, Reference Tamis-LeMonda, Bornstein and Baumwell2001; Tomasello & Farrar, Reference Tomasello and Farrar1986). In the present paper, we focus particularly on characteristics at the lexical level, as that level is one where children's own processing skills can be easily explored at the same age as parental input. Research suggests that input registers characterized by more verbosity and diversity appear to facilitate vocabulary development in toddlers and preschoolers. For example, Hoff and Naigles (Reference Hoff and Naigles2002) found that quantity of speech and lexical diversity predicted lexical achievement in two-year-old children (N = 63). Diversity of CDS vocabulary has been linked to outcomes at age two and beyond in middle-class families (Bornstein, Haynes & Painter, Reference Bornstein, Haynes and Painter1998; Hart & Risley, Reference Hart and Risley1995; Hoff-Ginsberg, Reference Hoff-Ginsberg1991; Huttenlocher, Haight, Bryk, Seltzer & Lyons, Reference Huttenlocher, Haight, Bryk, Seltzer and Lyons1991) and lower SES families (Pan, Rowe, Singer & Snow, Reference Pan, Rowe, Singer and Snow2005; Shimpi, Fedewa & Hans, Reference Shimpi, Fedewa and Hans2012). Increases in the number of child-directed words (tokens in the input) as well as types (unique words) and rare / less common words correlate with higher levels of child vocabulary (Rowe, Reference Rowe2012; Weizman & Snow, Reference Weizman and Snow2001). However, few studies have investigated CDS to infants under 14 months, or specific aspects of input that may be beneficial at this early age.
Although there is a general sense that more language input and thus more words leads to better language outcomes, different features of CDS may play a larger or smaller role at different points in development. Furrow, Nelson, and Benedict (Reference Furrow, Nelson and Benedict1979) argued that “what leads to a change in the system at one point in development may be completely ineffective at another point”, and Bohannon and Leubecker (Reference Bohannon and Leubecker1988) cautioned that CDS consists of “reciprocal, non-linear relations”. While input quantity is clearly important, other specific features of input are helpful over and above quantity, and the more we understand about these helpful features of input the more we can learn about the mechanisms involved in language learning (e.g. Rowe, Reference Rowe2012, Reference Rowe2015). Conceivably, while ‘more words’ in the input may generally be important, early in the first year of life repetition of words may be particularly useful, whereas lexical diversity is linguistically enriching later on. Indeed, McRoberts et al. (2009) reported that 6-month-old infants show preference for repeated, as opposed to novel, utterances. It also could be important how often parents provide their children with input that does not require segmentation to identify the words (e.g. 1-word utterances); Brent and Siskind (Reference Brent and Siskind2001) reported that exposure to a word in one-word utterances predicted the likelihood that the child would later use that word productively.
The role of processing in language learning
Most CDS studies have not tracked children's concurrent speech processing skills. A notable exception is Hurtado et al. (Reference Hurtado, Marchman and Fernald2008), who tracked twenty-seven children between the ages of 1;6 and 2;0; those whose mothers provided richer input at 1;6 (as estimated by automated counts of mothers' CDS by the LENA software system) had higher vocabulary scores and faster word recognition at 2;0. The authors concluded that enriched input influences lexical processing efficiency and that such improvements can work synergistically to advantage children. The team extended findings to twenty-nine Spanish-learning infants studied between ages 1;7 and 2;0 (Weisleder & Fernald, Reference Weisleder and Fernald2013). Richer input at 1;7 (again, as estimated by CDS words recorded by LENA) predicted children's vocabulary at age 2;0, and the speed with which the toddlers oriented to images when presented with a word in the laboratory. Thus, maternal input spurred vocabulary learning, and also led to improved capacity to attend to and process incoming speech information, which in turn was independently associated with better vocabulary outcomes.
Most CDS consists of multiword utterances (e.g. Aslin, Woodward, LaMendola & Bever, Reference Aslin, Woodward, LaMendola, Bever, Morgan and Demuth1996) without obvious signals demarcating word boundaries. Infants must somehow subdivide utterances into individual units (word segmentation). Early segmentation investigations familiarized infants with isolated monosyllabic target words (Jusczyk & Aslin, Reference Jusczyk and Aslin1995). Infants were then tested on their ability to recognize these same words in fluent speech stimuli. At 0;7·15 (but not 0;6, at least for English), infants demonstrated ability to segment words from fluent speech (see also Jusczyk, Houston & Newsome, Reference Jusczyk, Houston and Newsome1999). Segmentation shows a gradual developmental timecourse, and infants appear to use a variety of cues to segment (Jusczyk, Reference Jusczyk1999, and subsequent work). Thus, there is individual variability; similar-aged children may differ in abilities or strategies. For example, of twenty-four infants in the first experiment of the Jusczyk et al. (Reference Jusczyk, Houston and Newsome1999) study, only eighteen showed segmentation skill – in short, children regularly show non-uniform performance patterns.
Does failure to perform well in a laboratory-based segmentation or similar speech perception task have implications for later language development? Failure should presumably delay vocabulary acquisition, and a recent meta-analysis documents that infant speech perception performance predicts later language skill equally as well as non-linguistic measures, such as habituation/dishabituation and/or rapid auditory processing tasks (Cristia, Seidl, Junge, Soderstrom & Hagoort, Reference Cristia, Seidl, Junge, Soderstrom and Hagoort2014). Segmentation abilities are delayed in children with cognitive/linguistic deficits (Nazzi, Paterson & Karmiloff-Smith, Reference Nazzi, Paterson and Karmiloff-Smith2003), in line with the notion that segmentation could be a necessary precursor for typical language development. A retrospective analysis found that typically developing infants aged 0;7–0;8 who fail laboratory segmentation tasks have significantly poorer age 2;0 language outcomes than do successful infants (Newman, Bernstein Ratner, Jusczyk, Jusczyk & Dow, Reference Newman, Bernstein Ratner, Jusczyk, Jusczyk and Dow2006). Further, at age 4;0–6;0, those children who had failed laboratory segmentation tasks as infants demonstrated lower syntactic and semantic skills, but did not differ in general intellectual abilities, suggesting that infant segmentation may specifically predict language development. Singh, Reznick, and Xuehua (Reference Singh, Reznick and Xuehua2012) recently extended this finding. They prospectively correlated children's laboratory performance (on both simple and complex segmentation tasks) at 0;7·15 and their vocabulary outcomes at 2;0. Similarly, Junge and colleagues (e.g. Junge, Kooijman, Hagoort & Cutler, Reference Junge, Kooijman, Hagoort and Cutler2012) reported that infants with more advanced ERP patterns at 0;7 had larger language quotients at 3;0 and larger receptive vocabularies at 2;0. Thus, infants' failure on laboratory segmentation tasks may signal a language-processing developmental lag, rather than transient inattention during testing (see Cristia et al., Reference Cristia, Seidl, Junge, Soderstrom and Hagoort2014, for more extensive discussion).
The current study
Children's ability to segment CDS may predict later language outcomes. Similarly, the quantity of CDS available to be segmented (Bornstein et al., Reference Bornstein, Haynes and Painter1998; Hart & Risley, Reference Hart and Risley1995; Huttenlocher et al., Reference Huttenlocher, Haight, Bryk, Seltzer and Lyons1991) and lexical properties such as vocabulary diversity (Hoff, Reference Hoff2003; Pan et al., Reference Pan, Rowe, Singer and Snow2005; Rowe, Reference Rowe2012) lead to differential outcomes in language development. Yet no work to date has explored both lexical input and segmentation ability within the same cohort, as Cristia et al. (Reference Cristia, Seidl, Junge, Soderstrom and Hagoort2014) observe. We evaluate how these factors relate to one another and jointly predict later language skills, using a much larger cohort of children than in prior studies.
Specifically, our goals were:
-
1. To replicate findings that segmentation ability at 0;7 positively predicts vocabulary at 2;0;
-
2. To examine whether lexical aspects of maternal input at 0;7 predict child vocabulary at 2;0;
-
3. To determine whether segmentation ability and input simultaneously predict child vocabulary at age 2;0 and whether there is interaction between the two predictors.
METHODS
Participants
One hundred and twenty-one mother–child (58 male) dyads participated in a longitudinal project with multiple visits across a two-year span (initial goal had been 120 completed participants). Our sampling strategy and summary of attrition is described in Table 1. One hundred and seventy-three families were initially tested, but twenty-eight were excluded prior to one year of age because of family move or request for removal (n = 25); genetic disorder or health issues (n = 2); or only the father being free to participate (n = 1). Data from an additional ten participants were removed either because the child was not learning English as the majority language at some point during the study (some children began hearing English the majority of the time but their language exposure changed over the course of the study) or because the mother was not a native speaker of English. Finally, an additional fourteen children were later excluded because they failed to attend the final visit (n = 13) or had been identified with a language disorder during their second year (n = 1), leaving 121 participants.
Mothers averaged 17·1 years of education (range: HS diploma to PhD.); fathers had 17·0 years. Ethnicity was 79% Caucasian, 12% African-American, 4% Hispanic, 2·5% Asian, and 2·5% Native-American/Pacific-Islander. Age at first visit ranged from 0;6·28 to 0;8·8 (average 7·49 months). Age at the two-year visit ranged from 1;11·1 to 2;2·6 (average 24·3 months). Numbers of participants vary slightly across analyses as noted, due to sporadic incomplete data (e.g. audios of maternal input were distorted/uncodable for 2 participants, etc.).
Dyads participated in a range of tasks at multiple ages designed to address a variety of theoretical questions; the current paper explores those measures specifically focused on segmentation, maternal input, and vocabulary outcomes.
Segmentation task, age 0;7.15
The task was modeled after Jusczyk and Aslin (Reference Jusczyk and Aslin1995). Infants sat on their caregiver's lap in a three-sided test booth. An experimenter hidden behind the booth pressed a button on a response box to indicate direction of the child's looking behavior. At trial start, a center light in the front panel flashed. When the child responded, it was extinguished and a side panel light began to flash. Once the child looked towards that light, a sound file played from a hidden loudspeaker, continuing until the child looked away for two consecutive seconds, at which point the next trial began. Total amount of looking time to the flashing light was calculated as a measure of listening time; any time the infant spent looking away (2 s or less) was excluded.
There were familiarization and test phases. During familiarization, children heard repetitions of two words (bike and feet) on different trials, until they accumulated a total of 30 s of listening time to each word. During test phases, children heard sixteen trials (4 blocks of 4 trials). In each trial, children heard a six-sentence passage focused on one of four target words: bike, feet, cup, and goat (e.g. “The goat pranced across the field. He saw a really big goat. He patted his goat on the back.”; we used goat rather than dog, as Jusczyk and Aslin did, because dog engenders extreme preferences in some infants, unlike other words in the set, a concern for an individual-difference study; in addition, our location on a dialectal border could affect family production of that vowel in particular). Each block contained one trial for each passage, in random order. Average listening time to passages containing the two novel words was compared to trials containing familiarized words. Data were analyzed from all participants who completed at least three of the four blocks without fussing out (102 of 121 children). The experimenter and caregiver wore Peltor Aviation headphones playing masking music to block hearing the stimuli and biasing the child's behavior or subsequent coding.
Input assessment, age 0;7.15
After the segmentation task, dyads played with a standard set of toys, including plush animals, baby doll with accessories, food-/eating-/cooking-related items, and board books. Mothers were asked to play with their children as they would normally at home. They were informed that they were being video- and audio-recorded, and asked to wear an Audio-Technica AT 8531 lavalier microphone connected to a Marantz PMD 660 solid-state digital recorder; mothers were told that while our intent was to record the child, they should wear the microphone so that their child would not play with it. Sessions lasted approximately 15 minutes. All recordings were transcribed via sonic CHAT mode using TalkBank utilities (MacWhinney, Reference MacWhinney2000). Transcripts were analyzed using CLAN for total amount of input (tokens), vocabulary diversity (types), and repetitiveness of the input (type–token ratio, or TTR), and for the percentage of utterances that were a single word in length.
Vocabulary assessment, age 2;0
Children returned at approximately two years of age. At this visit, parents completed the MacArthur-Bates Communicative Development Inventory (Fenson et al., Reference Fenson, Dale, Reznick, Thal, Bates, Hartung and Reilly1993). Children's vocabulary scores on the MCDI at age 2;0 varied widely. Three children had scores below 50 words, suggesting possible clinically relevant delay and were excluded from further analyses; MCDI scores were unavailable for one additional child, resulting in a final sample of n = 117. Mean MCDI score was 324·2 (SD = 157·7, range = 62–664), comparable to original norming study scores (Fenson, Dale, Reznick, Bates, Thal & Pethick, Reference Fenson, Dale, Reznick, Bates, Thal and Pethick1994; mean = 312 words, SD = 174). MCDI scores were normally distributed in this sample.
RESULTS
We first sought to replicate our previous retrospective finding (Newman et al., Reference Newman, Bernstein Ratner, Jusczyk, Jusczyk and Dow2006) that infants' ability to segment fluent speech predicted later vocabulary skill. Of the 117 participants with usable MCDI scores, 21 did not complete the laboratory task (fussed out (n = 15), refused to look towards the flashing lights (n = 3), or equipment failure (n = 3)). Thus, 96 participants were available for analysis; looking time was normally distributed. The earlier study split children into two groups based on vocabulary, looking only at the top and bottom 15%; however, we had a smaller sample, and thus included all children, splitting into two equivalent groups. We compared children with higher (MCDI > 300) and lower (MCDI < 300) vocabulary scores. Children in the upper-MCDI group showed a greater novelty preference on the segmentation task than children in the lower-MCDI group (t(95) = 2·24, p < .05), thus replicating our previous finding of a relationship between segmentation and later outcomes. (Correlations across MCDI scores showed a similar relationship: r(95) = 0·25, p < .02) However, the direction of this relationship was different from the previous study: we found that novelty, not familiarity, preferences led to larger vocabulary scores, a point we return to in the ‘Discussion’.
We next asked whether maternal input at child age 0;7·15 predicts MCDI scores at 2;0. Maternal input was unavailable for two dyads due to poor recordings; thus, this analysis included 115/117 families with usable MCDI scores (we included children who did not complete the segmentation task successfully). CLAN was used to compute maternal tokens, types, and TTR in 7-month-directed CDS, and the percentage of utterances that were a single word in length. Maternal tokens ranged from 75 to 1566 (M = 758·8). Maternal types ranged from 39 to 392 (M = 221·8). Maternal TTR ranged from 0·18 to 0·62 (M = 0·321). Lower TTR indicates more repetition. Mothers varied in the percentage of one-word utterances from 0·03 to 0·48 (M = 0.20). All four of the input measures were normally distributed and did not require transformation. Children's two-year MCDI scores were significantly positively correlated with mothers' tokens (r = 0·23) and types (r = 0·20), and significantly negatively correlated with TTR (r = –0·28). That is, children's vocabulary scores were higher if their mother produced more words, and more different words, but also if their mothers used more repetition. Children's two-year MCDI scores did not correlate significantly with the percentage of one-word utterances that mothers used (r = –0·08). TTR had the strongest association with the MCDI scores, thus we opted to use TTR as the input measure for further analysis. Unsurprisingly, maternal tokens, types, and TTR are all significantly inter-correlated. Interestingly, controlling for types, the relation between TTR and MCDI is still significant (r = –0·20, p < .05), but when controlling for tokens, it is not (r = –0·15, p = ·11).
Are infants' abilities to segment speech at 0;7·15 and maternal repetitiveness at that same age related? Segmentation ability might already, even at this early age, be affected by the input children receive. The two measures marginally correlate (r(92) = 0·19, p < .10); thus, in this sample, there is a tendency for infants showing a novelty preference on segmentation tasks to hear more repetition from mothers. We return to this finding when we model the relationship among these variables and child language outcomes.
We last asked whether child segmentation ability and maternal input, when considered simultaneously, explain unique variation in child vocabulary at age 2;0. We conducted multiple regression analyses using infant novelty preference and maternal input (TTR) as predictors of children's MCDI scores (all as continuous measures) at age 2;0 (see Table 2). Although it is unclear whether segmentation results from a study using the head-turn preference procedure task should be measured continuously (looking time) or categorically (looking longer to the appropriate item vs. not doing so), this choice does not change the pattern of results. Data include the ninety-four participants having segmentation, TTR, and MCDI data available. Model 1 (Table 2) shows the independent significant effect of child segmentation ability at 0;7·15 on MCDI scores at age two. In this model, infants' segmentation ability explains 6·1% of variation in toddler vocabulary skills. Model 2 presents the independent significant effect of maternal repetitiveness at 0;7·15 on child MCDI scores at age two. In this model, maternal input at 0;7·15 explains 7·0% of the variation in two-year-old child vocabulary. Model 3 includes both infant segmentation ability and maternal input as simultaneous predictors. In this model, both predictors remain significant and the amount of variation explained increases to nearly 11%. Thus, when considered together, both child segmentation ability and parent repetitiveness at child age 0;7·15 are significantly related to children's vocabulary skill at 2;0, as measured on the MCDI. A change in R-square statistic was calculated to determine whether Model 3 was a significantly better fitting model than Models 1 or 2, despite the inclusion of an additional predictor. The comparisons resulted in Partial F statistics of 4·96 (Model 3 vs. 1) and 4·03 (Model 3 vs. 2), which are both greater than the corresponding alpha = ·05 critical F-value of 3·95. Thus, Model 3, which contains the effects of both infant segmenting and maternal input, is a significantly better fitting model than Model 1 or 2. In Model 4, we added the interaction term between segmentation ability and input to determine whether the effect of input differed based on segmentation ability. The interaction was non-significant. As a final step, not shown in the table, we considered potentially important control variables. Neither parent education (fairly homogenous in this sample), nor child gender related to MCDI scores at age 2;0, with TTR and segmentation ability in the model, and thus were not retained in the analysis. We also examined the joint impact of the percentage of one-word utterances and segmentation ability, and their interaction, on vocabulary outcomes, in case the percentage of one-word utterances was particularly important in children whose segmentation skills were weak, but found no relationships. Thus, our final model is Model 3, suggesting that both infant skills (segmenting) and repetitiveness in the input (TTR) at 0;7·15 contribute to children's later productive vocabulary size. Residuals from this model were examined and did not violate any of the regression assumptions.
notes: * p < .05; ** p < ·01.
DISCUSSION
Most children acquire their native language successfully and rapidly over a very short time frame in the first years of life. But some children do not acquire language as well as others, and we are as yet unable to predict which children will be more or less successful language learners. We examined the potential contributing roles of two factors at age 0;7·15: (i) infants' ability to break up fluent speech into individual words (segmentation); and (ii) the input that children hear from their mothers. Each has been argued to be a critical underpinning for language acquisition; however, most work on the role of input investigates CDS to children older than 0;7·15. Both maternal input (repetitiveness) and speech segmentation skills at 0;7·15 predicted child language outcomes at 2;0; moreover, while these factors were somewhat related to one another, they each had independent effects on toddler vocabulary skill.
These results support prior retrospective and prospective work (e.g. Newman et al., Reference Newman, Bernstein Ratner, Jusczyk, Jusczyk and Dow2006; Junge et al., Reference Junge, Kooijman, Hagoort and Cutler2012; Singh et al., Reference Singh, Reznick and Xuehua2012), suggesting that children's laboratory performance may reveal individual speech processing skills. Moreover, this ability contributed significantly to vocabulary outcome scores even once maternal input was factored in, suggesting that segmentation skills are not isomorphic with input patterns, though they may share a partially mediated relationship. Rather, this segmentation task may measure children's processing abilities, independent of their environmental exposure. Clearly, future work should explore this more fully by examining parental input prior to 0;7.15, our earliest datapoint. Because at this age both segmentation ability and maternal input appeared to contribute independently to the child's language profile at age two, the directionality of this association is unclear. Processing skill and input style may be independent, or interact in ways currently unknown, such that maternal input style may have given our young infant participants benefits in facing segmentation task challenges, or that infants with better segmentation skills could better capitalize on a particular maternal style of input. McRoberts et al. (Reference McRoberts, McDonough and Lakusta2009) suggest that “attention to repeated utterances precedes word segmentation and statistical cues in continuous speech”. This suggests that repetition in infant-directed speech actually primes the child's ability to segment. In McRoberts et al., six-month-old children (younger than our youngest subjects) particularly showed preference for a more repetitive infant-directed signal. Further, our analyses found a marginal correlation between CDS repetitiveness and child segmentation ability. Additional longitudinal research, starting at younger ages, may enlighten the directionality of the relationship between infant segmentation ability and experience with particular profiles of CDS in their environment.
In the current study, infants who showed novelty preferences in the segmentation task showed larger later vocabularies, rather than infants who showed familiarity preferences, as suggested in our prior work. The interpretation of novelty vs. familiarity preferences in infants is a matter of long-standing concern (Burnham & Dodd, Reference Burnham and Dodd1998; Houston-Price & Nakai, Reference Houston-Price and Nakai2004; Slater, Reference Slater2004). Burnham and Dodd note that “From one point of view, it can be argued that the valence of the preference is unimportant, for as long as there is a preference, then discrimination is demonstrated” (p. 174). Cristia et al. (Reference Cristia, Seidl, Junge, Soderstrom and Hagoort2014) briefly discuss the problem of understanding which type of preference is predictive of later outcomes, given various ages and tasks. Here, we were not testing discrimination per se, but evaluating how individual differences in the ability to make a discrimination (between familiarized and unfamiliarized words) predicted later abilities. Many factors influence whether a particular infant shows novelty vs. familiarity preferences (e.g. stimulus complexity, familiarization time). In general, children seem to show novelty preferences when they are more ‘bored’ with the stimulus, because it is inherently less interesting / simpler, because it has been presented more times, or because they have completed processing. This suggests that more cognitively advanced infants should be more likely to show novelty preferences, since they will process more quickly, in line with the current results (where infants showing a stronger novelty preference were more advanced in vocabulary acquisition rates). Nonetheless, the variation across studies is of concern, and may encourage the use of other paradigms in future work, such as ERPs (as in Junge et al., Reference Junge, Kooijman, Hagoort and Cutler2012), where such shifts over time or development are less likely to occur. That said, there were also some important differences between this study and prior work. In particular, all the children in the current experiment participated in the same segmentation task – one that focused on monosyllabic words spoken by the same talker. In contrast, the Newman et al. (Reference Newman, Bernstein Ratner, Jusczyk, Jusczyk and Dow2006) paper retrospectively examined a large set of children who had participated in a variety of segmentation studies, and collapsed data across these different studies. The studies themselves included ones in which the talker varied between training and test, where the words were multisyllabic, or where there was a limited set of acoustic cues available. In other words, the current study used a simpler (likely easier) segmentation task – this may help explain why novelty preference, rather than familiarity preference, was seen here.
Cristia et al. (Reference Cristia, Seidl, Junge, Soderstrom and Hagoort2014) note that the infant's linguistic environment cannot be ignored in trying to link speech perception task performance and children's later linguistic skills. In the current study, both the input and the child's processing proclivities appear to work synergistically to provide benefit to the language-learning process. This may explain, in part, why, for input to aid in child language acquisition, it must be child-addressed, and not merely overheard (Shneidman & Goldin-Meadow, Reference Shneidman and Goldin-Meadow2012; Weisleder & Fernald, Reference Weisleder and Fernald2013). Our findings also confirm that input addressed to quite young infants, who show rudimentary comprehension in naturalistic conversations, and only prelinguistic levels of verbal expression, does have a measurable impact on the rate of language development over the first two years of life. At these early stages, it is access to repeated instances of words that appears to facilitate mapping of lexical targets.
There were no effects of either gender or parent education on children's outcomes once input and segmentation abilities were entered into the equation. Lack of an SES effect may reflect limited diversity in our sample; only eleven mothers had less than a college education, with only one without some education post-high school. The fact that profiles of interaction even within this rather privileged sample appeared to influence children's outcomes at age two is significant for its ramifications on the quality of child care (particularly adult–infant ratio) in less advantaged communities, for children under the age of one year. However, the lack of an effect of gender may be more surprising.
We also found no effect of the percentage of one-word utterances that parents used; mothers' use of single-word utterances might have been expected to interact with children's segmentation ability, in that children with weaker segmentation skills might benefit from parents who provided already segmented input. That did not appear to be the case, however. These findings also seem to contradict those of Brent and Siskind (Reference Brent and Siskind2001), who found a relationship between mothers' use of words in isolation and children's vocabulary acquisition. However, Brent and Siskind examined the relationship between individual words being produced in isolation, and the likelihood of those particular words being learned by children; in contrast, we calculated the proportion of maternal one-word utterances overall, and related that to the size of children's productive vocabulary, without attempting to identify whether there was specific overlap among word sets.
Critically, we only explored vocabulary outcomes within typically developing children; although we identified children whose vocabulary scores satisfy a diagnosis of Specific Emergent Language Delay (a precursor to specific language impairment, or SLI), our sample included only three such children. This was not a sufficient number to explore possible predictors for this group, and thus we excluded them from our analysis. However, one direction for future work would be to explore aspects of both nature and nurture that could predict those children most at risk for later language difficulties.
In the end, our data provide further evidence for the role of the child and the role of environmental (i.e. input) effects in early vocabulary development. Both are critically important to the eventual outcome. Moreover, our finding of specific lexical input profiles linked to higher levels of vocabulary knowledge is of extreme practical importance, since vocabulary both measures and enables scholastic achievement (Snow, Burns & Griffin, Reference Snow, Burns and Griffin1998). It appears that the repetitiveness of the vocabulary input with 7-month-old children is a key factor. Finding an association between input profiles at an early age and later language skill enables us to guide parents, caretakers, and childcare policy to maximize children's language and educational potential.
ACKNOWLEDGEMENTS
This work was supported by NSF grant BCS 0745412 to the University of Maryland. The authors thank Lisa Tuit, Sean Hendricks, and Amelie Bail for conducting much of the testing of the assessments at age 2;0, and Devon Brunson, who performed most of the scheduling. We particularly thank Giovanna Morini for helpful oversight and commentary, and Kelly Hartman, Kerry McColgan, Julia Sampson, Jenna Poland, Christina Royster, and Anna Synnesvedt for coding oversight. The authors also thank the following students for assistance in recruiting and scheduling families or transcribing test sessions: Katrina Ablorh, Candace Ali, Saher Ali, Alison Arnold, Megan Askew, Catherine Bender, Taryn Bipat, Michelle Cass, Danielle Chazen, Alyssa Cook, Jennifer Coon, Sara Davis, Justine Dombroski, Sara Dougherty, Cathy Eaton, Sara Edelberg, Daniel Eisenberg, Meaghan Ervin, Lauren Evans, Andrea Farina, Josefina Fernandez, Lauren Fischer, Andrea Fisher, Arielle Gandee, Richard Garcia, Whitney Goodrich-Smith, Eliana Groskin, Natalie Hein, Laura Horowitz, Tim Howell, Megan Janssen Crenshaw, Mina Javid, Amanda Jensen, Jamie Karen, Caroline Kettl, Michelle Keenan, Esther Kim, Stephanie Lee, Perri Lieberman, Rachel Lieberman, Danielle Lindenger, Rachel Lipinski, Katie Lippitt, Debbie Martinez, Jenn McCabe, Eileen McLaughlin, Kelly McPherson, Debra Mirazhi, Vidda Moussavi, Molly Nasuta, Ashley Nimmo, Courtenay O'Connor, Sabrina Panza, Elise Perkins, Amanda Pasquarella, Lauren Polovoy, Rachel Ports, Rachel Rhodes, Allie Rodriguez, Maria Rodriguez, Judith Segal, Katie Shniderman, Veronica Son, Mara Steinberg, Sarah Steele, Justine Taweel, Allison Temple, Dena Tran, Hillary Tyler, Eugene Vassilas, Susan Veppumthara, Krista Voelmle, Chelsea Vogel, Amanda Wildman, Cavena Williams, Kimmie Wilson, Catherine Wu, Donna Zack-Williams and Michelle Zobel.