Developmental science is predicated on the paradigm that early periods of life are meaningful to later periods. Its core principles concern the development and expression of individual differences and the moderating role of environmental influences across multiple domains that include behavior, physiology, and cognition. Over the past decade or so, the role of the period before birth has received much attention as efforts to identify its influence on postnatal life have intensified. Much of this initial work was generated by epidemiologic studies of health and well-being in adulthood that were retrospectively linked to perinatal circumstances, primarily size at birth (Barker, Reference Barker1998; O'Brien, Wheeler, & Barker, Reference O'Brien, Wheeler and Barker1999). However, interest in the prenatal period among developmentalists is not new. The foundational Fels Longitudinal Study initiated in 1929 included a cohort of pregnant women (Sontag & Wallace, Reference Sontag and Wallace1934) and examined the role of the intrauterine and extrauterine environment on the development of the fetus (Sontag, Reference Sontag1941). The invention of technology to better view and monitor the fetus quickly revealed that toward the end of gestation behaviors and other features of developmental function that are routinely measured in the neonate and infant neither originate at term gestation nor emerge in response to birth (Als, Reference Als1982; Prechtl, Reference Prechtl and Prechtl1984). In the mid-1990s, the National Institute of Child Health and Development convened a series of conferences so that neurologists, developmental psychobiologists, developmental psychologists, and obstetricians could share nascent information with the goal of advancing the field of prenatal development and understanding its implications for postnatal life (Krasnegor et al., Reference Krasnegor, Fifer, Maulik, McNellis, Romero and Smotherman1998).
The construct of “fetal programming” has been generated from the most recent wave of work that examines the influences brought to bear on the fetus and features of subsequent development or health. Although we take exception to the use of the term “programming” in application to complex human development outcomes that are multiply determined, there is no doubt that this perspective invigorated interest in the role of the earliest period of development. Nonetheless, the focus of much recent work is not on the fetus per se, but on the maternal and environmental factors that may affect development of one or more fetal organ systems, including the central nervous system. Detection of association with these exposures and subsequent features of health or development in infancy, childhood, or beyond are thus assumed to be attributed to their effect on the fetus and its gestational environment.
Our research program, which began in 1991, has taken a different approach to the study of early origins by developing and applying methodology to measure fetal neurobehavioral development and contemporaneous environmental exposures. The term “neurobehaviors” applies to those features of basic neural functioning that are phenotypic expressions of the processes that underlie the development and expression of autonomic and behavioral regulation (Brazelton, Reference Brazelton1984). Despite advances in current technologies, the fetus remains relatively inaccessible and fetal neurobehavioral research is limited by what is measurable. The field has generally concentrated on four aspects of fetal function: autonomic (i.e., heart rate and its variation); somatic (i.e., motor activity and patterning); state development and regulation (i.e., coalescence between heart rate and motor activity patterns); and fetal reactivity to stimuli, based on autonomic and somatic responses. All four domains have been shown to develop in predictable ways over gestation (DiPietro et al., Reference DiPietro, Caulfield, Costigan, Merialdi, Nguyen, Zavaleta and Gurewitsch2004; DiPietro, Costigan, & Voegtline, Reference DiPietro, Costigan and Voegtline2015; DiPietro, Hodgson, Costigan, Hilton, & Johnson, Reference DiPietro, Hodgson, Costigan, Hilton and Johnson1996), including a well-established developmental discontinuity between approximately the 28th and the 32nd gestational week in virtually all aspects of fetal neurodevelopment. For continuous measures, this is expressed as steeper gradient of development up to this gestational age range followed by a slowing of developmental rate through the remaining months of gestation (DiPietro et al., Reference DiPietro, Costigan and Voegtline2015). This is also reflected in specific behaviors and capabilities, including fetal breathing movements (Roodenburg, Wladimiroff, van Es, & Prechtl, Reference Roodenburg, Wladimiroff, van Es and Prechtl1991), responsiveness to an external vibrating stimulus (Buss et al., Reference Buss, Davis, Class, Gierczak, Pattillo, Glynn and Sandman2009), and habituation (Groome, Gotlieb, Neely, & Waters, Reference Groome, Gotlieb, Neely and Waters1993). Patterning of fetal motor activity, heart rate variability, and eye movements undergoes progressive consolidation commencing at about this time, resulting in functional expression as fetal behavioral states corresponding to rudimentary sleep-wake cycles closer to term (Nijhuis et al., Reference Nijhuis, ten Hof, Nijhuis, Mulder, Narayan, Taylor and Visser1999; Pillai & James, Reference Pillai and James1990). Fetal development is predicated on hierarchical mastery beginning with autonomic control and culminating in interaction with the environment (Als, Reference Als1982). Autonomic differentiation both expresses and contributes to developing sympathetic and parasympathetic processes, thereby establishing the basis for reactivity and regulation to endogenous and exogenous stimuli. Both terms are foundational constructs underpinning temperament theory (Goldsmith et al., Reference Goldsmith, Buss, Plomin, Rothbart, Thomas, Chess and McCall1987; Rothbart & Ahad, Reference Rothbart and Ahad1994).
Remarkable also when considering the fetus is that it serves in an essentially parasitic relationship within another developing human. Figure 1 illustrates our reinvigoration of Als's earlier work (1982), presenting a similar hierarchical structure of fetal neurodevelopment but within a framework of mutual and spiraling engagement between the pregnant woman and fetus. It is difficult, if not impossible, to stimulate the fetus directly without maternal awareness. Instead investigators explicitly rely on inducing maternal physiological activation through the use of experimental manipulations designed to be psychologically challenging or emotionally evocative to generate reactivity in the fetus (Araki et al., Reference Araki, Nishitani, Ushimaru, Masuzaki, Oishi and Shinohara2010; Copher & Huber, Reference Copher and Huber1967; DiPietro, Ghera, & Costigan, Reference DiPietro, Ghera and Costigan2008; Monk et al., Reference Monk, Sloan, Myers, Ellman, Werner, Jeon and Fifer2004). Conversely, spontaneous fetal motor activity inspires maternal physiological reactivity (DiPietro et al., Reference DiPietro, Caulfield, Irizarry, Chen, Merialdi and Zavaleta2006), and stimulating the fetus directly with sound also elicits a maternal response, presumably through evoked fetal behavior (DiPietro et al., Reference DiPietro, Voegtline, Costigan, Aguirre, Kivlighan and Chen2013).
From Fetus to Child
The expectation that there are aspects of individuals that endure over time is a given in developmental sciences, yet understanding of which characteristics endure, whether they are the same across individuals, and how to best measure or otherwise detect them has been a challenging endeavor. Various terms, including continuity and stability, have been used (sometimes interchangeably) to describe the preservation of individual differences (i.e., relative or rank ordering of an attribute within a group) across periods of development (Bornstein & Suess, Reference Bornstein and Suess2000; Caspi, Reference Caspi, Damon, Lerner and Eisenberg1998). Moreover, because the nature of how an underlying attribute is expressed changes as an individual's developmental repertoire expands during maturation, measurement may be homotypic or heterotypic (Putnam, Reference Putnam2011). Heart rate, for example, is a homotypic attribute as it can be measured in similar units over time, but activity level is a heterotypic attribute as the fetus does not locomote, so its expression is necessarily different in the fetus and 3-year-old child. Moreover, both heart rate and motor activity may be viewed as markers of neurologic maturation, particularly when gestational age at the time of measurement is controlled, and thus may be used for heterotypic prediction.
A second consideration in the identification of individual differences involves whether to measure physiological parameters or behaviors during a baseline window of observation in an effort to determine tonic or normative function or perturbing the system to evoke reactivity and recovery, thereby providing some degree of equivalency in context across individuals (Blair, Peters, & Granger, Reference Blair, Peters and Granger2004; Doom & Gunnar, Reference Doom and Gunnar2013; Planalp, van Hulle, Gagne, & Goldsmith, Reference Planalp, van Hulle, Gagne and Goldsmith2017; Porges, Doussard-Roosevelt, Portales, & Greenspan, Reference Porges, Doussard-Roosevelt, Portales and Greenspan1996). Both approaches have strengths and limitations, but at the core of each is the tenet that individual differences in self-regulation, expressed as both reactivity to stimuli and recovery from that arousal, provide the constitutionally based substrate for broader features of temperament, allowing for refinements over time in conjunction with experience and maturation (Goldsmith et al., Reference Goldsmith, Buss, Plomin, Rothbart, Thomas, Chess and McCall1987; Rothbart & Derryberry, Reference Rothbart, Derryberry, Lamb and Brown1981).
Information on the degree to which characteristics of fetal functioning predict to characteristics of the infant or child is scant. Baseline fetal heart rate and variability are the most stable characteristics during gestation and remain correlated with infant heart rate and variability through at least the first year of life (DiPietro, Costigan, Pressman, & Doussard-Roosevelt, Reference DiPietro, Costigan, Pressman and Doussard-Roosevelt2000; Lewis, Wilson, Ban, & Baumel, Reference Lewis, Wilson, Ban and Baumel1970), and a small but statistically significant association with 10-year-old children has been reported (Thomas, Haslum, MacGillivray, & Golding, Reference Thomas, Haslum, MacGillivray and Golding1989). Significant associations with fetal heart rate have been reported between infants with the highest and the lowest reactivity thresholds to novelty (Snidman, Kagan, Riordan, & Shannon, Reference Snidman, Kagan, Riordan and Shannon1995). Fetal heart rate has been linked with maternally reported infant emotional tone (DiPietro, Hodgson, Costigan, & Johnson, Reference DiPietro, Hodgson, Costigan and Johnson1996) and positive reactivity (Werner et al., Reference Werner, Myers, Fifer, Cheng, Fang, Allen and Monk2007). Fetal heart rate variability predicts performance on developmental assessments including Bayley psychomotor scores at 18 months (Ratcliffe, Leader, & Heller, Reference Ratcliffe, Leader and Heller2002) and both Bayley mental and psychomotor scores, as well as language development and symbolic play, through the 3rd year of life (Bornstein et al., Reference Bornstein, DiPietro, Hahn, Painter, Haynes and Costigan2002; DiPietro, Bornstein, Hahn, Costigan, & Achy-Brou, Reference DiPietro, Bornstein, Hahn, Costigan and Achy-Brou2007).
With respect to motor activity, more active fetuses tend to become more active neonates (Groome et al., Reference Groome, Swiber, Holland, Bentz, Atterbury and Trimm1999), infants (Degani, Leibovitz, Shapiro, & Ohel, Reference Degani, Leibovitz, Shapiro and Ohel2009), and toddlers (DiPietro, Bornstein, et al., Reference DiPietro, Bornstein, Costigan, Pressman, Hahn, Painter and Yi2002), although the latter finding was true only for boys. Fetuses that are more active score higher on behavioral and neurological indicators of motor maturity as neonates (DiPietro et al., Reference DiPietro, Kivlighan, Costigan, Rubin, Shiffler, Henderson and Pillion2010) and infants (Richards & Newbery, Reference Richards and Newbery1938). Fetuses that display consistently high levels of motor activity are rated by mothers as more fussy, unadaptable, and unpredictable through 6 months (DiPietro, Hodgson, Costigan, & Johnson, Reference DiPietro, Hodgson, Costigan and Johnson1996). Several studies have also documented consistency in aspects of fetal state organization with infant sleep (DiPietro, Costigan, & Pressman, Reference DiPietro, Costigan and Pressman2002; DiPietro, Hodgson, Costigan, & Johnson, Reference DiPietro, Hodgson, Costigan and Johnson1996; Groome et al., Reference Groome, Singh, Bentz, Holland, Atterbury, Swiber and Trimm1997). The longest follow-up of the predictive validity of fetal measures to date reported that near-term fetuses exhibiting more mature transitions between behavioral states were reported by mothers to have better effortful control in late childhood and early adolescence (van den Bergh & Mulder, Reference van den Bergh and Mulder2012) despite the small sample size (n = 25). This circumscribed set of findings suggests modest consistency from the prenatal to postnatal periods within specific developmental domains despite the wide contextual variation in which fetal and child measurements are taken.
The literature on fetal reactivity as a predictive construct is even smaller. Two studies have linked fetal responsiveness to maternal physiological arousal, induced through exposure to psychologically challenging or emotionally evocative stimuli, with infant emotion regulation. Greater fetal heart rate responsiveness to a cognitive challenge (i.e., Stroop Color–Word task) presented to pregnant woman predicted greater motor reactivity to a standard novelty paradigm and a trend for greater maternally reported infant negativity at 4 months (Werner et al., Reference Werner, Myers, Fifer, Cheng, Fang, Allen and Monk2007). Similarly, fetuses displaying greater heart rate reactivity (as well as motor reactivity) to maternal viewing of a labor and delivery film were more irritable infants in response to the manipulations encountered in a neurodevelopmental exam administered 6 weeks after birth (DiPietro et al., Reference DiPietro, Ghera and Costigan2008). A recent report notes that lower fetal heart rate variability in response to recorded speech is associated with reduced neurobehavioral maturation in neonates (Figueiredo, Pinto, Pacheco, & Field, Reference Figueiredo, Pinto, Pacheco and Field2017). Greater fetal heart rate reactivity to a vibrating device placed on the maternal abdomen has also been associated with higher maternal ratings of fussy/difficultness in early infancy (DiPietro, Hodgson, Costigan, & Johnson, Reference DiPietro, Hodgson, Costigan and Johnson1996).
The Current Studies
This report will extend this limited knowledge base by examining whether tonic and reactive components of fetal functioning are prospectively associated with outcomes in late childhood. Based on the modest but provocative set of findings described above and in concert with the view that individual differences are first manifest in the fetal period, two studies were conducted. The first is based on neurobehavioral and physiological data generated from over 300 maternal–fetal pairs collected during an undisturbed condition at multiple gestational ages; the second utilizes fetal neurobehavioral reactivity and regulation data collected at 24 and 36 weeks from a subset of those cases. We focus on the most conspicuous and measurable facets of fetal neurobehavior: heart rate and motor activity.
Study 1 examines whether fetal neurobehaviors portend child temperament. The temperament dimension of behavioral inhibition, or the tendency to withdraw and behave warily to new people, objects, and situations, was selected because it is perhaps the most extensively studied temperament dimension with well-documented longitudinal stability from early life and underlying physiological correlates (Fox, Snidman, Haas, Degnan, & Kagan, Reference Fox, Snidman, Haas, Degnan and Kagan2015; Kagan, Reznick, & Gibbons, Reference Kagan, Reznick and Gibbons1989; Kagan, Snidman, Kahn, & Towsley, Reference Kagan, Snidman, Kahn and Towsley2007). Behavioral inhibition is expressed in infancy and early childhood as a characteristic pattern of negative reactivity to novelty; continued behavioral inhibition toward people, both familiar and unfamiliar, is subsequently characterized as shyness. Studies using both laboratory and parental-report measures demonstrate that early behavioral inhibition is predictive of subsequent childhood shyness (Fox, Henderson, Marshall, Nichols, & Ghera, Reference Fox, Henderson, Marshall, Nichols and Ghera2005; Volbrecht & Goldsmith, Reference Volbrecht and Goldsmith2010), which further confirms temperament stability given that the operational definitions vary with age-dependent developmental capabilities. Although somewhat less commonly investigated, uninhibited behavior has also been investigated as a temperament construct with exuberance as the developmental counterpoint to shyness in later childhood (Dollar, Stifter, & Buss, Reference Dollar, Stifter and Buss2017; Stifter, Putnam, & Jahromi, Reference Stifter, Putnam and Jahromi2008).
Given the limited literature on continuity in temperament from the fetus to child, there is minimal extant data on which to base a directional hypothesis. The two most germane existing findings include a modest link between being classified as a highly reactive infant at 4 months and faster fetal heart rate (Snidman et al., Reference Snidman, Kagan, Riordan and Shannon1995) and detected associations between fetal motor activity and laboratory-assessed behavioral inhibition at age 2 (DiPietro, Bornstein, et al., Reference DiPietro, Bornstein, Costigan, Pressman, Hahn, Painter and Yi2002). In that report, higher fetal motor activity between 24 and 36 weeks gestation was associated with lower behavioral inhibition. Each of these reports is limited by small sample sizes (n = 35 in both) and relatively short developmental reach. Based on these empirical findings, and the general maturational principles underlying fetal development research, we predict that faster heart rate, greater heart rate variability, and more motor activity will predict less behavioral inhibition in childhood.
Whereas Study 1 is focused on baseline, spontaneous fetal behavior, Study 2 seeks to establish whether fetal reactivity and subsequent regulation elicited by perturbation of the intrauterine environment reveal key attributes of individual differences as they extend to child behavioral difficulties. We rely on evoked maternal autonomic arousal, provided by a standard challenge task, to provide the eliciting stimulus for fetal responsivity. A range of temperament attributes has been linked to perceived behavioral difficulties in children, but here we focus on a general composite of behavioral difficulties as reported by mothers using a structured interview. Again, limited empirical information hinders our ability to generate hypotheses, but we expect that lack of regulation following reactivity in the fetus will extend to indicators of behavioral dysregulation in childhood.
General Method: Overview
This report is based on a childhood follow-up of 385 children distributed over four cohorts of maternal–fetal pairs that provided prenatal data commencing midway through gestation between June 1997 and July 2007. Women were subsequently surveyed about their child's behavior by phone and mailed questionnaires when children were in late childhood and early adolescence (ages 7 to 14). Although study aims and methods for each prenatal cohort varied, a standard protocol was embedded in each allowing aggregation of baseline fetal data across cohorts; these data form the basis for Study 1. The protocol for the largest of the cohorts included a maternal manipulation implemented to assess fetal reactivity and recovery, providing the Study 2 data. Because the research questions, methods, and dependent and independent measures differ for each study, they are each presented separately.
Study 1: Fetal Neurobehavior and Inhibited/Uninhibited Temperament
Participants
A total of 508 eligible maternal–fetal pairs participated in the fetal studies. Prenatal enrollment for all cohorts was limited to nonsmoking healthy women with singleton pregnancies and without significant preexisting conditions that would jeopardize normal progression of pregnancy at enrollment. Women were self-referred volunteers recruited through local university and hospital publications or referrals from from other participants. Pregnancy dectection was based on early first trimester testing (M gestational age at pregnancy diagnosis = 4.7, SD = 1.3), and dating was confirmed by examination and/or ultrasound shortly thereafter (M gestational age at first prenatal visit = 7.7 weeks, SD = 1.9). Over time, and after children reached school age, families were contacted through information provided during study enrollment or via public records. The final sample (n = 385) includes women who completed the full protocol (interview and questionnaire; n = 333) or the interview alone (n = 53). Loss to follow-up was due to inability to locate participants over time (n = 90), lack of response to attempted contact (n = 27), and declining participation (n = 3).
Maternal characteristics, based on data collected prenatally, reflects a population of predominantly well-educated (M maternal education = 16.8 years), married (93.8%), and mature (M age = 31.6 years), respondents. Infants were predominantly normally grown (M weight at birth = 3432 g), full-term (M gestational age at delivery = 39.1 weeks), and had normal Apgar scores (M = 8.0 and 8.9 at 1 and 5 min). Half of the offpring were girls (49.5%), and at follow-up most were either the oldest (47.9%) or youngest (27.6%) member of the family. Women who provided data about their children were slightly older, M difference = 1.4 years, t (506) = 3.08, p < .01, with more years of education, M difference = 1.4 years, t (506) = 3.08, p < .01, than those who were eligible but did not participate (n = 120). As a result of our long-standing research program in the community, a number of women participated with more than one pregnancy. There were 44 sets of siblings within the total follow-up sample of 385.
Materials and procedure
Prenatal data collection overview
The standard fetal monitoring protocol involved a baseline, unperturbed 50-min recording of fetal neurobehavioral and maternal psychophysiological measures. Longitudinal designs were implemented for all cohorts, ranging from two to six visits. A full description of the cohorts, prenatal protocol, and the manner in which the fetal data set was compiled is provided elsewhere (DiPietro et al., Reference DiPietro, Costigan and Voegtline2015). That report, in the form of a monograph, is based solely on documenting prenatal development without any postnatal follow-up. The current study is based on three cohorts with data collection during or near the 24th, 32nd, and 36th weeks of gestation (i.e., cohorts I, III, and VI) and one with only the last two periods (IV). The last two cohorts (VII and VIII) concluded more recently (2013), and those children are not included in this report because they were not of school age at the time of analysis.
Prenatal data collection
Visits were generally conducted in early afternoons (13:00 to 15:00) to control for potential diurnal effects. Fetal position and amniotic fluid volume using a standard index, the amniotic fluid index (AFI), were ascertained through brief ultrasound scans. Women were monitored lying down with head elevated, and tilted slightly to the left to avoid venous compression. Monitoring proceeded for 50 undisturbed minutes. Fetal data were collected from the output port of a Toitu (MT320, Tokyo Japan) fetal actocardiograph, which detects fetal heart rate and motor activity through a single wide array transabdominal Doppler transducer. Data were sampled at 1000 Hz via an internal analog to digital board using streaming software; analysis proceeded offline using customized software (GESTATE; James Long Company, Caroga Lake NY). Digitized heart rate data were filtered for error rejection based on moving averages using a previously established algorithm; fetal motor activity was calibrated in arbitrary units ranging from 0 to 100.
Fetal variables were quantified as follows: (a) mean fetal heart rate (FHR), computed in 1-min epochs and averaged over the full recording period; (b) fetal heart rate variability (FHRV), calculated as the standard deviation of FHR values per 1-min epoch, and averaged over the recording; (c) total motor activity (FACTIVE), calculated as the number of bouts identified as each time the actograph signal equaled or exceeded a predetermined threshold (15 a.u.s.), and remained at or above this amplitude for at least 10 contiguous seconds multiplied by the mean duration of each bout(s), yielding the total time spent moving, in seconds, per 50-min recording. This approach for defining actograph-detected movements was validated against ultrasound visualization near the initiation of our research program (DiPietro, Costigan, & Pressman, Reference DiPietro, Costigan and Pressman1999). In addition, a composite measure of somatic–cardiac (FM-FHR) coupling was defined as occurring each time a fetal movement bout was accompanied by an excursion in FHR ≥ 5 beats per minute (bpm) for ≥5 s above the FHR baseline, within 5 s prior to the movement onset, or 15 s after it, based on previously developed criteria (Baser, Johnson, & Paine, Reference Baser, Johnson and Paine1992; DiPietro, Hodgson, Costigan, Hilton, & Johnson, Reference DiPietro, Hodgson, Costigan, Hilton and Johnson1996). FM-FHR coupling index (COUP-IND) was computed as the number of coupled fetal movements divided by all fetal movements during the observation period. When coupling was detected, the latency between the onset of the fetal movement relative to the onset of the FHR change was calculated in seconds and the mean latency across instances was computed (COUP-LATEN). Because fetal monitoring took place over a 10-year period, fetal measures were standardized (i.e., Z scored) by cohort to rule out potential signal drift in the fetal monitoring device.
Maternal physiological signals were amplified using a multichannel, electrically isolated, bioamplifier (Model JAD-04; James Long Company, Caroga Lake, NY). An electrocardiogram was recorded from three carbon fiber disposable electrodes in triangulated placement and compiled as maternal heart rate (MHR), computed in 1-min epochs and averaged over the recording. Electrodermal activity was monitored from two silver–silver chloride electrodes with a gelled skin contact area placed on the distal phalanxes of the first and index fingers of the nondominant hand. Skin conductance was measured by administering a constant 0.5-volt root-mean-square 30 Hz excitation signal and detecting the current flow and quantified in terms of skin conductance level (SCL), scaled from 0 to 25 microsiemens (μS). Data quantification proceeded offline using the PHY General Physiology System and IBI Analysis Systems (James Long Company).
Childhood follow-up
General characteristics about the child (e.g., age and birth order) were collected during the initial phone interview followed by a structured interview about child behavior (see Study 2). At the time of phone contact, women were asked to complete and return a temperament questionnaire by mail. M child age at follow-up was 10.1 years with a relatively small standard deviation (SD = 1.4), but there was significant range (7.3 years to 14.8). Given that it took a decade to accrue the fetal data, follow-up data collection was a largely unfunded venture that was implemented over time as resources and available personnel allowed. This necessitated the use of two different age-based temperament questionnaires. Women completed either the Temperament in Middle Childhood Questionnaire (TMCQ; version 3.0; Simonds & Rothbart, Reference Simonds and Rothbart2004); n = 269, 81%, or the Early Adolescent Temperament Questionnaire—Revised (EATQ, Parent; Ellis & Rothbart, Reference Ellis and Rothbart2001); n = 64, 19%.
Depending on the dimension, between 5 and 11 individual items scores were summed according to scale instructions, with reverse scoring as appropriate, to yield values for target constructs of shyness, fearfulness, and surgency, which include items consistent with the constructs of behavioral inhibition and exuberance, respectively. Each item is scored on a 5-point scale from 1 (almost always untrue) to 5 (almost always true). Both questionnaires assess the same temperament dimensions of interest using similar items although with some variations based on age-based social and behavioral repertoire. For example, both questionnaires contain very similar items for the construct of shyness (e.g., “Feels shy meeting new people”), but surgency items correspond to age (e.g., “Likes going down high slides or other adventurous activities” is included in the TMCQ while “Wouldn't be afraid to try a risky sport like deep sea diving” is in the EATQ). However, there was an age gradient in the degree to which mothers characterized their children as shy, r (331) = .12, p < .05, or surgent, r (331) = .11, p < .05, so values were standardized (i.e., Z scored) for the EATQ and the TMCQ separately.
Data analysis
Hierarchical linear modeling was used to confirm the developmental trends in prenatal measures (Raudenbush & Byrk, Reference Raudenbush and Byrk2002). Exploratory analyses included examining sex differences in dependent (child temperament ratings) and independent (fetal heart rate measures [FHR and FHRV], fetal motor activity [FACTIVE], and their relationship [COUP-IND and COUP-LATEN]), along with maternal context variable associations. Bivariate correlations were performed to establish whether there was sufficient indication of associations between prenatal and child measures to proceed with additional analyses. Detection of significant unadjusted associations resulted in stepwise multiple regressions predicting child temperament measures for each fetal measure, controlling for maternal contextual variables. Entry of the fetal measure at the last step provides a conservative approach by evaluating whether it adds significant unique variance to the maternal measures. In order to ascertain whether shared variance between siblings and mothers contributed to the findings, analyses were rerun excluding the second sibling for each pair. Analysis of variance was used to ascertain nonlinear associations following distribution of the temperament measures into three categories based on quartiles (i.e., lowest, highest, and middle 50%).
Results and discussion of Study 1
Descriptive values for fetal measures of those cases with temperament questionnaire data (n = 333) are presented in Table 1. As the data generated by these cohorts reflects a subsample of the data presented previously (DiPietro et al., Reference DiPietro, Costigan and Voegtline2015), mean values are presented only to provide measurement context for the current report. Developmental trends follow those reported on the full sample, which included significant declines in FHR and COUP-LATEN, increases in FHRV and COUP-IND (ps < .0001), and no change in overall motor activity. Data are presented by protocolized gestational period, but actual gestational age at testing, derived from early pregnancy dating, was used in analyses. M gestational ages (weeks) were 24.7 (SD = 0.7), 32.2 (SD = 0.7), and 36.6 (SD = 0.6) at the three periods studied. Note that the two indicators of coupling between FHR and FM signify different maturational expectations: COUP-IND provides information of the degree to which fetal movements inspire changes in heart rate; COUP-LATEN characterizes how tightly these two events are linked in time.
Note: FHR, fetal heart rate. FHRV, fetal heart rate variability. FACTIVE, total motor activity. COUP-IND, fetal motor activity–FHR coupling index. COUP-LATEN, in coupling instances mean latency between onset of fetal movement relative to the FHR change. Variation in sample sizes between gestational ages are the results of both missed visits and protocol differences between cohorts. Variation within gestational age are due to occasional data quality difficulties for specific measures.
With respect to temperament ratings, more educated women were less likely to characterize their children as fearful, r (331) = –.26, p < .001, but not shy, r (331) = –.07, or surgent, r (331) = .03. There were no sex differences in maternal ratings. Fear and shyness scale Z scores were significantly correlated, r (331) = .30, p < .001. Given that items in each reflect the core construct of behavioral inhibition, and following others (Volbrecht & Goldsmith, Reference Volbrecht and Goldsmith2010), they were combined (summed) into a composite variable. As expected, surgency was negatively related to fear/shyness, r (331) = –.39, p < .001, but analyzed as a separate construct. There was no association between the final Z-scored fear/shyness variable and child age at questionnaire administration, r (331) = –.01.
Preliminary unadjusted bivariate correlations revealed no significant associations between fetal heart rate, motor activity, or coupling measures and behavioral inhibition (i.e., fear/shyness composite), or surgency and data collected during the first (24 week) or last (36 week) gestational period. Correlations between fetal measures and behavioral inhibition ranged from r = .04 to r = –.10 for these periods. Surgency values ranged from r = –.04 to r = .11; the latter value reflecting the association between surgency and COUP-IND neared, but did not attain significance at 36 weeks (p = .06). As a result, data collected from these periods were not considered further. In contrast, all measured aspects of fetal functioning at the middle gestational period (i.e., 32 weeks) were significantly associated with behaviorally inhibited temperament. These associations were modest but significant: FHR, r (298) = –.17, p < .01; FHRV, r (298) = –.12, p < .05; FACTIVE, r (296) = –.16, p < .01; and COUP-LATEN (but not COUP-IND), r (298) = –.14, p < .05. Surgency was not significantly correlated with any fetal measure, rs ranged from .05 to .08, and was not considered further in continuous models.
In addition to maternal education level (M presented earlier), prenatal maternal context variables include body mass index at the start of pregnancy (M = 24.0, SD = 4.5), MHR (M = 86.1 bpm, SD = 9.1), and skin conductance (M = 7.0 μS, SD = 3.6) collected at 32 weeks. The 32-week AFI (M AFI = 14.3 cm, SD = 2.9) was also included because of its association with fetal motor activity. Consistent with prior findings (DiPietro et al., Reference DiPietro, Costigan and Voegtline2015), fetuses with more amniotic fluid at 32 weeks moved more than those with less, r (296) = .19, p < .001. Associations among maternal contextual variables are presented in Table 2. Most (58%, 86%, and 94%) fetuses had assumed a vertex (i.e., head down) position by 24, 32, and 36 weeks, respectively; fetal lie was not associated with fetal motor activity so not included in the models.
aMaternal heart rate and skin conductance level at 32 weeks gestation. *p < .05. **p < .001.
Separate regression analyses were constructed for each fetal measure as follows: maternal education was entered in the first step to control for the detected association with behavioral inhibition; intrauterine context variables (MHR, SCL, and body mass index, along with AFI for FACTIVE) were entered in the next step; and the fetal measure was entered on the final step. Table 3 presents the regression analysis results, and roman numerals indicate the Step 3 results for each separate regression of the individual fetal measures. Maternal education was significantly and negatively associated with temperament ratings while the intrauterine variables, either separately or together, did not add additional variance. Note that Steps 1 and 2 results as presented in Table 3 are based on the equation for FHR. Minor differences of one or two cases with missing data for FACTIVE or COUP-LATEN generated slightly different estimates for Steps 1 and 2 in those equations. With the exception of FHRV, which neared but did not attain significance (p = .10), all fetal measures contributed significant unique variance in the predication of behavioral inhibition at the final step of the equation. Exclusion of siblings that contributed both temperament and 32 week fetal data (n = 32) did not alter bivariate or multivariate results (not shown), with the exception that the FHRV association with behavioral inhibition attained significance, F (R 2 change) = 3.77, p = .05.
A final regression incorporating all of the fetal measures at the final step yielded the following results: multiple R = .35, R 2Δ = .06, F (8, 286) = 4.92, p < .001. Individual variables that retained significance in the final equation predicting childhood behavioral inhibition included: maternal education, t = –4.18, p < .001; FHR, t = –3.00, p < .01; and FACTIVE, t = –2.12, p < .05.
aEstimates based on equation for fetal heart rate. bAmniotic fluid index entered in Step 2 of motor activity equation. †p = .10. *p < .01. **p < .001.
Figure 2 provides visual depiction of these results using distribution of behavioral inhibition into quartile-based categories (lowest quartile, n = 79; highest quartile, n = 73, and middle half, n = 146). Categorical analyses were conducted to examine potential nonlinear relations not detected by regression analysis and sex differences. Separate 3 (Inhibition Category) × 2 (Sex) analyses of variance did not reveal any sex by fetal measure interactions. Significant post hoc contrasts for behavioral inhibition categories are provided. Note that analyses were conducted using Z scores, but nonstandardized variable values are provided in the figures to provide measurement context. Examination of the mean values suggests that all associations are linear, confirms the correlational and regression findings based on continuous values, and illustrates the particular contrasts between the lowest and highest behavioral inhibition groups. For both cardiac measures (FHR and FHRV) post hoc contrasts revealed significant differences only between the lowest and highest behavioral inhibition groups; for FACTIVE and COUP-LATEN, the highest group differed from both the lowest and middle groups. A similar approach taken for a categorical surgency variable did not reveal any significant differences (not presented).
Results from Study 1 provide fairly compelling support for the premise that temperamental variation in behavioral inhibition, defined here as maternal reports of child fearfulness and shyness, is established before birth. Fetuses exhibiting slower FHR, lower FHRV, and less FM at 32 weeks gestation were described by the mothers as more behaviorally inhibited in late childhood. In addition, the negative association with FM-FHR coupling latency implies that when the fetus does move and the movement is sufficient enough to generate perturbation in FHR, the system responds more quickly in fetuses who go on to display behavioral inhibition in childhood. The data analytic method selected is a conservative one because it controls for the relatively large contribution of maternal education on temperament rating, as well as potential maternal physiological influences that could conceivably be related to both the dependent and independent measures. Note that we are assuming that the association between higher maternal education and behavioral inhibition simply reflects a reporting bias, although maternal education may reflect environmental contributors to the expression of child fear/shyness.
Our failure to detect associations with fetal neurobehaviors and maternally reported surgency suggests two related possibilities. The first is that there may be a mismatch between data provided by questionnaires versus the types of measures assessed earlier in childhood in laboratory-based protocols. The second possibility is that, given that surgency and behavioral inhibition scores are negatively correlated, children who score low on behavioral inhibition may be characterized as more “uninhibited” and possess some characteristics of surgent temperament. These individuals occupy the lower end of the continuum reflected in the detected linear associations between behavioral inhibition and fetal measures. Thus, we are hesitant to rule out a prenatal origin for this aspect of temperament based on our assessment method.
The restriction of these results to 32 weeks but not earlier or closer to term may be related to the observed developmental shift in all measured parameters, with the exception of motor activity, that occurs at this gestational age. This period of neural reorganization may maximize detection of individual differences. Before this time, neural immaturity may constrain consolidation of underlying individual differences. As the fetus gets closer to term, physical constraints of the intrauterine environment may limit expression of these differences as factors such as fetal size relative to amniotic fluid, and the intrauterine space can dampen endogenously generated motor activity and resultant changes in heart rate. We will revisit this issue again in the final discussion.
Study 2
Participants
Study 2 was based on a subset (n = 130) of the larger sample from a cohort that included a fetal reactivity protocol with childhood follow-up data. Maternal (i.e., age, education, and marital status) and fetal (sex, gestational age at delivery, birth weight, and Apgar scores) characteristics were consistent with the larger sample described in Study 1. There were no siblings in this subset.
Procedure
Fetal and maternal monitoring proceeded as described for Study 1. The longitudinal protocol for one cohort included administration of the Stroop Color–Word test (MacLeod, Reference MacLeod1991) following completion of the 50-min undisturbed baseline recordings at 24 and 36 weeks gestation. The task, which requires disassociating word meaning from printed word color under time pressure, evokes a sympathetic response; the version used included pregnancy-specific stimuli as well as standard color words. Details of the protocol have been previously described (DiPietro, Costigan, & Gurewitsch, Reference DiPietro, Costigan and Gurewitsch2003). Fetal and maternal data were streamed continuously, and event marking generated three segments: pre-Stroop baseline, Stroop period, and post-Stroop. Child outcome measures were provided during a maternal telephone interview, as described in Study 1.
Measures
Maternal–fetal reactivity and recovery
To limit the number of analyses, fetal reactivity and regulation to induced maternal arousal was limited to the two principal fetal measures: FHR and FM. Because the experimental segments were too brief for the identification of discrete fetal movement bouts necessary for the total motor activity (and FM-FHR coupling) variables used in Study 1, FM was defined as the summed value of all actograph data points divided by the number of data points per epoch. Mean values were computed by subtracting the Stroop segment from the pre-Stroop baseline (i.e., reactivity) and subtracting the post-Stroop period from the Stroop period (i.e., recovery). MHR and electrodermal (SCL) reactivity to the Stroop were computed for inclusion as contextual variables.
Child behavior at follow-up
Mothers reported on children's behavior using the Strengths and Difficulties Questionnaire (SDQ; Goodman, Reference Goodman2001) administered during a telephone interview. The SDQ is a relatively straightforward appraisal of a child's typical behavioral profile that includes 25 items rated on 3-point scales from not true to certainly true, which are grouped into 5-item subscales: emotional symptoms, conduct problems, hyperactivity, peer problems, and prosocial behavior. The subscales, with the exception of prosocial behavior, are summed to provide a total difficulties score. Psychometric properties of the SDQ have been established in the current sample age range (Goodman & Goodman, Reference Goodman and Goodman2009; van Roy, Veenstra, & Clench-Aas, Reference van Roy, Veenstra and Clench-Aas2008).
Data analytic plan
Hierarchical linear models were used to confirm the fetal and maternal response to Stroop as previously presented in the full sample. Pearson correlations were used to evaluate unadjusted bivariate associations between measures of fetal reactivity/recovery and total difficulties (SDQ-Tot) and prosocial behavior scores (SDQ-Pro), because the latter is not included in the composite score. In addition, given the findings in Study 1, the peer problems score (SDQ-Peer) was also analyzed due to its focus on social interaction. Multivariate regression models were constructed to identify whether fetal responsivity measures were associated with child outcomes using a similar analytic approach as in Study 1. Separate analyses were conducted for each fetal variable at both gestational ages. Categorical analyses based on the level of fetal responsiveness were undertaken using mixed models to examine potential nonlinear associations and interactions with child sex and SDQ-Tot values; contrast estimates tested paired comparisons.
Results and discussion of Study 2
As described in the original report, the Stroop was effective in eliciting maternal physiological activation consisting of an increase in heart rate and SCL followed by a decrease after the manipulation ceased (DiPietro et al., Reference DiPietro, Costigan and Gurewitsch2003). Because the current prenatal analysis is based on slightly fewer cases due to exclusion of participants without child follow-up, there is slight variation in the values used in the current analyses, but the overall mean responses remain consistent. The manipulation did not generate a significant mean change in FHR at either gestational age, but did generate suppression of FM at 36 weeks in reaction to the Stroop followed by recovery that returned to or exceeded baseline levels afterward: reactivity, β = –1.65, SE = 0.38, t = –4.34, p < .0001; recovery, β = 1.44, SE = 0.40, t = 3.55, p < .001. A similar rebound phenomenon was observed at 24 weeks, recovery, β = 1.40, SE = 0.32, t = 4.44, p < .0001, although the initial reaction to the Stroop did not attain significance, reactivity, β = –0.31, SE = 0.25, t = –1.24, p = .22.
The original report was focused on main effects for condition and did not consider individual variation. However, Table 4 illustrates the wide individual variation in FHR responsivity. For example, at 36 weeks, FHR reactivity in response to the Stroop ranged from a reduction of 18 bpm to an increase of 22 bpm; similar reactivity ranges were found at 24 weeks and also for both recovery values. FM values were also variable although somewhat more constrained. Thus, we examined individual differences in responses throughout this section regardless of central tendency findings.
Note: Reactivity values were constructed by subtracting the 2nd epoch (i.e., Stroop) from the baseline 1st epoch (pre-Stroop baseline); recovery values were constructed by subtracting the 3rd epoch (i.e., post-Stroop) from the 2nd epoch (i.e., Stroop). As such, positive change scores indicate a decrease in the Stoop or post-Stroop epoch relative to the prior epoch; negative change scores indicate an increase in the Stroop or post-Stroop epochs relative to the prior epoch.
Scores for SDQ-Tot (M = 6.82, SD = 4.65), SDQ-Pro (M = 8.84, SD = 1.51), and SDQ-Peer (M = 1.39, SD = 1.45) were unrelated to child age at follow-up, rs (128) = –.10 to .06, ps = .26 to .88. However, child sex differences were detected for all SDQ outcome variables. Mothers rated boys as having higher SDQ-Tot scores, t (128) = 3.20, p < .01, lower SDQ-Pro scores, t (128) = –2.68, p < .01, and trend-level higher SDQ-Peer scores, t (128) = 1.95, p = .05. In addition, more highly educated women were less likely to report behavioral problems in their children, SDQ-Tot, r (128) = –.29, p < .001, including peer problems specifically, SDQ-Peer, r (128) = –.18, p < .05, but did not rate their children as more prosocial, SDQ-Pro, r (128) = .04, p = .65.
Bivariate associations of fetal response to induced maternal arousal and child behavior
No significant associations emerged between FHR or FM reactivity (i.e., delta from baseline to Stroop periods) and SDQ scores with one exception: a significant association between FM at 24 weeks and SDQ-Peer, r (128) = .18, p < .05. In contrast, there were a number of significant unadjusted associations between fetal recovery (i.e., delta from Stroop to post periods) and SDQ scores. At 24 weeks, FHR recovery was significantly associated with both SDQ-Tot and SDQ-Peer, rs (128) = –.19, ps < .05; this relationship was true for all three SDQ measures at 36 weeks, rs (109) range from –.28 to .23, ps < .05. That is, larger decreases in FHR following the Stroop were associated with lower SDQ problem scores and higher prosocial scores. Associations with fetal motor activity recovery were limited to 24 weeks, such that larger increases in fetal movement following the Stroop were associated with less SDQ-Pro behavior, r (119) = .26, p < .01, and a trend toward more SDQ-Peer problems, r (119) = –.16, p = .08.
General linear models of fetal response to induced maternal arousal and child behavior
Tables 5 and 6 provide regression results for 24 and 36 weeks. Because maternal education and child sex were both associated with SDQ ratings, these were entered on the first step to control for their effects. Maternal reactivity (MHR and SCL) was entered in the second step, and fetal parameters (reactivity and recovery) entered in the final step. In addition, to control for the law of initial values, fetal baseline values were included in the final step. Maternal education and/or child sex were significantly associated with each SDQ score, multiple Rs range from .22 to .39. While maternal physiological reactivity was not associated with SDQ scores at 24 weeks, at 36 weeks there were significant contributions to SDQ-Peer and SDQ-Pro such that women who reacted to the Stroop with greater sympathetic withdrawal (i.e., greater electrodermal decrease) reported their children as having more peer problems and less prosocial behavior. A larger increase in MHR was also associated with less prosocial behavior. Note that the values provided for Steps 1 and 2 in the tables are based on the equations for FHR; values for FM are similar and follow the same patterns of significance.
Note: SDQ, Strengths and Difficulties Questionnaire. MHR, maternal heart rate. SCL, skin conductance level. Fetal heart rate (FHR) and fetal motor activity (FM) reactivity and recovery values presented for Step 3 reflect separate equations. †p < .10. *p < .05. **p< .01.
Note: SDQ, Strengths and Difficulties Questionnaire. MHR, maternal heart rate. SCL, skin conductance level. Fetal heart rate (FHR) and fetal motor activity (FM) reactivity and recovery values presented for Step 3 reflect separate equations. †p < .10. *p < .05. **p < .01.
With respect to the fetus, at both 24 and 36 weeks, FHR recovery was significantly predictive of SDQ-Tot and SDQ-Peer, when controlling for all other variables in the equation; at 36 weeks, this association extended to SDQ-Pro as well. That is, fetuses that recovered with greater FHR decline following the Stroop were more likely to be rated as having fewer behavioral difficulties, including peer-related ones, and more prosocial behavior. FM responsivity remained significant only for SDQ-Prosocial skills at 24 weeks, with a trend-level association in the opposite direction for SDQ-Tot. At 36 weeks, FM responsivity was unrelated to child ratings.
Categorical recovery patterns and child behavior
Results presented so far are based on continuous values for change scores; categorical analyses were conducted to better understand the directionality of the fetal recovery response. In addition, review of bivariate correlations between FHR and FM recovery and child measures by fetal sex suggested disparities in the pattern of correlations. Top and bottom quartile groups were constructed based on the direction and magnitude of the change in FHR at 36 weeks from Stroop to post-Stroop periods as follows: activators, FHR increase ≥ +3 bpm, 24.3%, n = 27; suppressors, FHR decrease, < –3 bpm, 26.1%, n = 29. A similar strategy was applied to FM recovery change scores at 36 weeks: FM increase, 24.8%, n = 25; FM decrease, 24.8%, n = 25. Figure 3 depicts SDQ-Tot by FHR/FM recovery groups and sex. Using mixed models, there was a significant FHR Group × Sex interaction, F (1, 51) = 4.84, p < .05. Pairwise comparisons revealed SDQ-Tot differed for boys versus girls among FHR activators, t (51) = –3.19, p <.01. A similar finding, using ±3 bpm cutoff values, was replicated at 24 weeks (not shown), Group × Sex interaction, F (1, 60) = 7.56, p < .01, t (60) = −4.16, p < .001. Although the interaction did not attain significance for FM at 36 weeks, a sex difference was detected among FM suppressors, t (45) = –2.68, p < .05.
These results suggest the primacy of fetal poststimulation recovery, as opposed to reactivity, in the prediction from fetal neurobehavioral measures to child outcomes. The SDQ provides a relatively undifferentiated indicator of child tendencies that can be generalized in terms of dysregulation that results in emotional, behavioral, and social disruptions. The results suggest that the degree to which the fetus responds following termination of an environmental challenge, regardless of reactivity to it, provides information about individual differences in neural organization that are manifest in childhood as generalized regulatory problems. For example, at 24 weeks greater rebound in fetal movement following the Stroop is ultimately associated with diminished prosocial behavior, perhaps reflective of a lesser regulatory “brake” that extends to social situations. This seems to be particularly true for boys who displayed a characteristic pattern of post-Stroop activation in heart rate coupled with motor activity suppression. Although the total SDQ score was the principal outcome measure of interest, prosocial ratings were analyzed separately because they were not included in the total score. Findings showing associations with this specific domain of child function, along with its inverse, peer problems, were unexpected. Examination of the items that contribute to those scales suggests they access the degree to which the child gets along with and is helpful to others. This may be an especially particularly salient issue for mothers of children at this age to observe and may be indicative of the broader consequences of behavioral regulation and dysregulation.
General Discussion
The late-term human fetus and the 10-year-old child inhabit different worlds. The fetus is constrained upside-down, knees near ears, in an obscure, fluid-filled environment; the child is a fully sentient and self-aware being who locomotes freely as she goes about her business. Although both continue on their developmental trajectory, the behavioral repertoires and maturational proficiencies of each are vastly different. Despite this, findings from these two studies indicate that there are attributes that are fundamental to individual differences that can be detected before birth. In Study 1, children who were rated as exhibiting greater fearfulness and shyness were more likely to have slower fetal heart rates and less variability, exhibited less motor activity, and when they did move responded with heart rate reactivity to that motor activity more quickly than fetuses rated as less inhibited in childhood. Taken together, this pattern of findings suggests overall lower autonomic sympathetic and parasympathetic tone interspersed with short latency cardiac reactivity to endogenously generated movements in those fetuses.
The literature on baseline heart rate and behavioral inhibition in early and middle childhood is somewhat mixed, with some reports finding no association while those that do report the opposite of what was detected here, that behaviorally inhibited children display faster heart rate (i.e., generally reported as lower heart period; Fox et al., Reference Fox, Henderson, Marshall, Nichols and Ghera2005). The discrepancy may lie in the context in which a fetus and child are measured given that behaviorally inhibited children who are assessed in a laboratory situation may already be displaying sympathetic activation as a result of study participation; the fetus is unaware of being monitored. The detected association between higher spontaneous fetal motor activity and less behavioral inhibition confirmed our earlier report based on laboratory assessment at age 2 (DiPietro, Bornstein, et al., Reference DiPietro, Bornstein, Costigan, Pressman, Hahn, Painter and Yi2002).
In contrast to reliance on tonic (i.e., undisturbed) measurement in Study 1, Study 2 illustrates how a perturbation methodology can be applied to the fetal period to help standardize a window of observation. While conceptually this may be a more appealing approach, it introduces more complexities in interpretation because fetal stimulation is a downstream consequence of maternal reactivity. Although pregnancy is associated with blunted physiological responsiveness, sufficient individual variation is retained (deWeerth & Buitelaar, Reference deWeerth and Buitelaar2005). Nonetheless, when a fetus does not display a robust response, it is difficult to ascertain whether this may be the result of constitutional differences in reactivity and regulation of the fetus, the pregnant woman, or both. This may help reconcile, in part, why recovery patterns were more consistently predictive of child outcomes than reactivity ones. Moreover, while we focused on maternal autonomic indicators, there are many unrecorded signals (e.g., changes to the intrauterine acoustic environment) generated by induced maternal arousal that may be transduced to the fetus, making it difficult to fully characterize the maternal response. Despite these challenges, these findings confirm our expectation that individual differences in fetal regulatory function following stimulation are associated with indicators of generalized child dysregulation, which include the social context.
The unique contributions of individual fetal measures to report of child temperament and behavior detected here are significant but not large, accounting for approximately 2% to 3% of total explained variance in behavioral inhibition after controlling for the large associations with maternal education on reporting. However, combining all fetal indicators together generated 6% of explanatory variance, accounted for primarily by fetal heart rate and motor activity, which is comparable to the within-subject stability reported in many longitudinal studies restricted to infancy and childhood. In Study 2, 10% of unique variance in childhood behavioral difficulties was accounted for by 36-week fetal heart rate reactivity and recovery combined. Note that baseline (prechallenge) fetal heart rate was also predictive in this instance, such that higher fetal heart rate as well as the postchallenge regulatory response was also associated with total behavior problems in general, and peer problems in particular. This dovetails with the association detected between faster heart rate and less behavioral inhibition in Study 1, particularly as the baseline period used prechallenge was not the same as used in Study 1.
The other obvious distinction between the fetus and child, in addition to those in posture and capabilities described above, is that one is housed within another individual. While emotional engagement remains in late childhood, the child is no longer physiologically enmeshed. Although baseline maternal physiological indicators have been previously observed to be correlated with fetal heart rate and motor activity within periods of observation, the directionality of this association is not straightforward because the fetus also exerts influence on maternal physiological functioning (DiPietro et al., Reference DiPietro, Caulfield, Irizarry, Chen, Merialdi and Zavaleta2006). In Study 1, neither maternal prenatal autonomic measure contributed to child temperament. In Study 2, the observed associations between both measures of physiological reactivity to challenge and child prosocial behavior at 36 weeks suggests that the reactive maternal component may exert physiological priming of the fetus that translates into childhood behavior. Alternatively, women displaying certain reactivity patterns to challenge (specifically, electrodermal activation coupled with heart rate suppression) are more likely to rate their children positively or provide a child-rearing environment that encourages prosocial development. The latter possibilities presume that maternal reactivity shows stability from pregnancy through the ensuing 10 years, which is not implausible nor mutually exclusive with fetal priming.
Now we turn to what we did not find, associations between fetal measures at the earlier (24 week) and later (36 week) gestational ages and child outcomes in Study 1. This is a not uncommon problem with longitudinal studies with repeated assessments, and interpretation as to why predictive relations are found at one age but not another is not always straightforward. Conversely, had data been collected only at the midpoint of 32 weeks, we might have been tempted to overgeneralize to the entire fetal period. Developmental discontinuities, resulting in temporary reassortment of rank ordering, may play a role. The fetal period is not monolithic and just as there are developmental shifts during the first years of life that are presumed to reflect key periods of neural reorganization (Kagan, Reference Kagan1979; Zeanah, Boris, & Larrieu, Reference Zeanah, Boris and Larrieu1997), at least one of these has been established during the prenatal period near 32 weeks gestation, as described and confirmed in the current analysis. Although periods of reorganizational instability may dampen the ability to detect associations, they may also reflect variation in the rate at which individuals mature, thereby distinguishing individuals at a given point in time. In addition, the physical constraints exerted on a developing fetus within a limited intrauterine space may diminish the ability to detect associations after this point. However, the preeminence of prediction from 32 weeks was unexpected because our earlier work and that of others has reported associations with early (i.e., first or second year of life) outcomes at 36 weeks or later, and at times, at earlier points. It is possible that the less differentiated behavioral repertoire of younger children makes it easier to identify links to neurobehaviors.
The other possibility is that similar associations exist at the other gestational ages, but based on the considerable signal-to-noise ratio in the fetal data, we were unable to detect them in this sample. That is, given the circumstantial differences between the fetus and the child, variation in the intrauterine context during gestation, and the degree of measurement error inherent to collecting data from a study participant that cannot be directly viewed or handled, true shared variance between fetal indicators and child outcomes may be obscured. Stability in fetal cardiac measures and motor activity has been detected as early as 20 or 24 weeks and persists through term (DiPietro et al., Reference DiPietro, Bornstein, Hahn, Costigan and Achy-Brou2007, Reference DiPietro, Costigan and Voegtline2015), making the inability to detect associations with undisturbed functioning at the earlier or later periods more puzzling. In Study 2, although significant associations were not consistently found for both gestational ages studied, significant findings were detected at the earlier (24 weeks) and later (36 weeks) periods. This includes replication of the association between recovery of fetal heart rate following induced maternal arousal in relation to total behavioral difficulties and peer difficulties at both 24 and 36 weeks, along with the significant sex difference. While this may reveal a benefit of standardizing the window of observation through perturbation, we have no ready explanation for the lack of findings at these gestational ages in Study 1.
The clear methodological limitation of this report lies in its reliance on maternal report for both studies as opposed to measuring child temperament or behavior in a laboratory setting. While this approach tends to be the rule rather than the exception in studies predicting outcomes from the prenatal period, until the data can be confirmed by observational methods, we regard the findings as provocative but preliminary. However, given the difficulty in collecting data on large prenatal cohorts and the ensuing interim until they reach childhood, we are not optimistic that such an opportunity of equivalent follow-up duration or sample size will arise in the near future. This report was able to leverage the large extant fetal data set, generated by collapsing across multiple cohorts, to extend the reach of prediction from the prenatal period to late childhood and early adolescence in over 300 maternal–fetal pairs. In addition to the benefit conferred by the sample size, validity of the current findings is bolstered by the fact that women were unaware of the independent measures of fetal functioning, thereby precluding reporting biases on that basis. This includes fetal motor activity, as even if women could accurately recall how vigorously a child moved before birth 10 years later, felt fetal movements constitute only a small proportion of fetal movements. Without a systematic source of bias, reliance on maternal report in this context would tend toward a Type II error of interpretation, such that it introduces measurement error that would diminish the ability to detect significant associations that exist.
The findings presented here represent the most comprehensive evaluation to date of the fetal origins of childhood temperament and behavioral outcomes. Change and constancy in human development has been a long-running theme in developmental science (Bornstein & Suess, Reference Bornstein and Suess2000; Kagan, Reference Kagan1979; Putnam, Reference Putnam2011; Rothbart, Ahadi, & Evans, Reference Rothbart, Ahadi and Evans2000), and here we show that over fairly protracted differences in time and place, core features of individual differences, are preserved. The substrate of individual autonomic and behavioral variation in reactivity and subsequent regulation undergirds the current focus on behavioral inhibition (i.e., fear and shyness) and behavioral regulation or dysregulation (i.e., problem behaviors). The model presented in Figure 1 includes the construct of canalization, indicated by the widening scoop at bottom, connoting the decrease in constraints imposed by species-typical processes earlier in development toward expression of individual differences as development progresses (McCall, Reference McCall1981). The period of measurement in this report reflects a relatively canalized period of development, yet sufficient individual variation exists to detect longer term extensions. While the current findings confirm the supposition of the constitutionality of these core dimensions, they do not reveal its source. Contributors to individual differences in the fetus likely include genetic influences, those introduced to the intrauterine environment by endogenous and exogenous maternal factors, and more diffuse environmental influences that may affect the fetus directly or though epigenetic alterations. The degree and manner in which these intrauterine and environmental influences displace an individual's developmental trajectory from normative levels of reactivity and regulation toward dysregulational ones continues to be a key area of developmental inquiry.