Researchers and public health experts are increasingly interested in modelling longitudinal data to determine relationships between early life exposures and future health outcomes. Such relationships are particularly relevant for dietary exposures where poor diet quality over a period of time may be associated with an increased risk of developing obesity. Previous longitudinal dietary intake analyses have estimated population means over time using mixed effect and average growth models(Reference Winpenny, van Sluijs and White1,Reference Goff, Huang and Silva2) . However, classifying individuals into subgroups (latent classes) who follow similar trajectories over time is potentially more informative than previous analytical approaches. These trajectories describe longitudinal patterns of diet quality and could therefore help identify when and in whom to intervene. Although several methods are available for defining latent classes(Reference van der Nest, Passos and Candel3), these have yet to be applied to longitudinal measures of dietary intake across early life.
The aim of this study was to provide an overview of two latent class methods: growth mixture models (GMM) and group-based trajectory modelling (GBTM) using the gllamm and traj Stata commands, respectively. GBTM and GMM were selected over other models because they are able to handle missing data and unevenly spaced assessments over time. GBTM also allows the user to define the shape of the individual trajectories (e.g. linear/quadratic). These methods were applied to data from 2963 mother-child dyads from the Southampton Women's Survey (SWS). In the SWS cohort continuous diet quality indices (DQIs) were derived using principal component analysis from an interviewer-administered food frequency questionnaire collected in the mother preconception and at 11 and 34 weeks’ gestation, and in the child at ages 6 months, 12 months, 3, 6–7 and 8–9 years. To identify the optimal number of latent classes for the DQIs, we used a forward modelling approach, as advised by the GRoLTS-Checklist, from one to six classes(Reference Lennon, Kelly and Sperrin5). Each model was assessed using the following criteria for classifying and interpreting latent class analyses: the Bayesian information criteria, probability of class assignment, the ratio of the odds of a correct classification, group membership, and relative entropy.
According to the classification criteria, the optimal model included 5-trajectories, with results from the GMM and GBTM being most similar for the 5-trajectory model (Spearman's correlation = 0.98). Both methods characterised the dietary trajectories as stable with horizontal lines, and the five trajectories defined as low (GMM = 4.1%, GBTM = 4.8%), low-medium (23.0%, 22.7%), medium (39.0%, 39.0%), medium-high (27.4%, 27.9%) and high (6.5%, 5.6%).
Although both methods provide a similar interpretation of dietary trajectories, we propose using GBTM command for modelling longitudinal dietary intake because GMM was computationally more intensive. Additionally, the findings of this study provide a rationale to promote dietary changes preconceptionally as the results show that diet quality tracks from pre-conception into childhood.