Hostname: page-component-745bb68f8f-b95js Total loading time: 0 Render date: 2025-01-08T09:47:08.339Z Has data issue: false hasContentIssue false

Regularized Finite Mixture Models for Probability Trajectories

Published online by Cambridge University Press:  01 January 2025

Kerby Shedden*
Affiliation:
Department of Statistics, University of Michigan
Robert A. Zucker
Affiliation:
Departments of Psychiatry and Psychology, University of Michigan
*
Requests for reprints should be sent to Kerby Shedden, Department of Statistics, University of Michigan, Ann Arbor, MI 48109-1107, USA. E-mail: [email protected]

Abstract

Finite mixture models are widely used in the analysis of growth trajectory data to discover subgroups of individuals exhibiting similar patterns of behavior over time. In practice, trajectories are usually modeled as polynomials, which may fail to capture important features of the longitudinal pattern. Focusing on dichotomous response measures, we propose a likelihood penalization approach for parameter estimation that is able to capture a variety of nonlinear class mean trajectory shapes with higher precision than maximum likelihood estimates. We show how parameter estimation and inference for whether trajectories are time-invariant, linear time-varying, or nonlinear time-varying can be carried out for such models. To illustrate the method, we use simulation studies and data from a long-term longitudinal study of children at high risk for substance abuse.

Type
Theory and Methods
Copyright
Copyright © 2008 The Psychometric Society

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Footnotes

This work was supported in part by NIAAA grants R37 AA07065 and R01 AA12217 to RAZ.

References

Achenbach, T.M., & Edelbrock, C. (1983). Manual for the child behavior checklist and revised child behavior profile, Burlington: University of Vermont, Department of Psychiatry.Google Scholar
Akaike, H. (1974). A new look at the statistical model identification. IEEE Transactions, 19, 716723.Google Scholar
Bishop, C. (1995). Neural networks for pattern recognition, Oxford: Clarendon.CrossRefGoogle Scholar
Dunson, D.B., & Herring, A.H. (2005). Bayesian latent variable models for mixed discrete outcomes. Biostatistics, 6(1), 1125.CrossRefGoogle ScholarPubMed
Chen, H., Chen, J., & Kalbfleisch, J.D. (2004). Testing for a finite mixture model with two components. Journal Royal Statistical Society B, 66, 96116.CrossRefGoogle Scholar
Burnham, K.P., & Anderson, D.R. (2002). Model selection and multimodel inference: a practical information-theoretical approach, (2nd ed.). New York: Springer.Google Scholar
Burnham, K.B., & Anderson, D.R. (2004). Multimodel inference: understanding AIC and BIC in model selection. Sociological Methods & Research, 33, 261305.CrossRefGoogle Scholar
Fan, J., & Li, R. (2004). New estimation and model selection procedures for semiparametric modeling in longitudinal data analysis. Journal of American Statistical Association, 99, 710723.CrossRefGoogle Scholar
Fergusson, D., Horwood, L.J.L., & Nagin, D.S. (2000). Offending trajectories in a New Zealand cohort. Criminology, 38, 525552.CrossRefGoogle Scholar
Galindo-Garre, F., & Vermunt, J.K. (2006). Avoiding boundary estimates in latent class analysis by Bayesian posterior mode estimation. Behaviormetrika, 33, 4359.CrossRefGoogle Scholar
Hastie, T., Tibshirani, R., & Friedman, J. (2001). The elements of statistical learning, Berlin: Springer.CrossRefGoogle Scholar
Jacqmin-Gadda, H., Joly, P., Commenges, D., Binquet, C., & Chene, G. (2002). Penalized likelihood approach to estimate a smooth mean curve on longitudinal data. Statistics in Medicine, 21(16), 23912402.CrossRefGoogle ScholarPubMed
Little, R.J.A., & Rubin, D.B. (1987). Statistical analysis with missing data, New York: Wiley.Google Scholar
Loeber, R., & Hay, D. (1997). Key issues in the development of aggression and violence from childhood to early adulthood. Annual Review of Psychology, 48, 371410.CrossRefGoogle ScholarPubMed
McLachlan, G.J., & Peel, D. (2000). Finite mixture models, New York: Wiley.CrossRefGoogle Scholar
Muthén, B., & Shedden, K. (1999). Finite mixture modeling with mixture outcomes using the EM algorithm. Biometrics, 55(2), 463469.CrossRefGoogle ScholarPubMed
Nagin, D.S. (1999). Analyzing developmental trajectories: a semiparametric, group-based approach. Psychological Methods, 4(2), 139157.CrossRefGoogle Scholar
Nagin, D.S. (2005). Group-based modeling of development, Cambridge: Harvard University Press.CrossRefGoogle Scholar
Nagin, D.S., & Tremblay, R.E. (2001). Analyzing developmental trajectories of distinct but related behaviors: a group-based method. Psychological Methods, 6(1), 1834.CrossRefGoogle ScholarPubMed
Pu, W.J., & Niu, X.F. (2006). Selecting mixed-effects models based on a generalized information criterion. Journal of Multivariate Analysis, 97, 733758.CrossRefGoogle Scholar
Ramsay, J., & Silverman, B.W. (2005). Functional data analysis, (2nd ed.). Berlin: Springer.CrossRefGoogle Scholar
Rice, J.A., & Silverman, B.W. (1991). Estimating the mean and covariance structure nonparametrically when the data are curves. Journal of the Royal Statistical Society B, 53, 233243.CrossRefGoogle Scholar
Schwarz, G. (1978). Estimating the dimension of a model. Annals of Statistics, 6, 461464.CrossRefGoogle Scholar
Shen, X.T., & Ye, J.M. (2002). Adaptive model selection. Journal of the American Statistical Association, 97, 210221.CrossRefGoogle Scholar
Titterington, D.P., Smith, A.F.M., & Makov, U.E. (1985). Statistical analysis of finite mixture distributions, New York: Wiley.Google Scholar
Verbeke, G., & Lesaffre, E. (1996). A linear mixed effects model with heterogeneity in the random effects population. Journal of the American Statistical Association, 91, 217221.CrossRefGoogle Scholar
Vermunt, J.K., & Magidson, J. (2004). Technical guide for Latent GOLD 4.0. Statistical Innovations.Google Scholar
Wahba, G. (1990). Spline models for observational data. CBMS-NSF regional conference series, Philadelphia: SIAM.CrossRefGoogle Scholar
Winsberg, S., & De Soete, G. (1999). Latent class models for time series analysis. Applied Stochastic Models in Business and Industry, 15, 3, 183-194.3.0.CO;2-T>CrossRefGoogle Scholar
Yung, Y.F. (1997). Finite mixtures in confirmatory factor-analysis models. Psychometrika, 62, 297330.CrossRefGoogle Scholar
Zucker, R.A., Ellis, D.A., Bingham, C.R., & Fitzgerald, H.E. (1996). The development of alcoholic subtypes: Risk variation among alcoholic families during early childhood years. Alcohol Health & Research World, 20, 4654.Google ScholarPubMed
Zucker, R.A., Fitzgerald, H.E., Refior, S.K., Puttler, L.I., Pallas, D.M., & Ellis, D.A. (2000). The clinical and social ecology of childhood for children of alcoholics: description of a study and implications for a differentiated social policy. In Fitzgerald, H.E., Lester, B.M., & Zuckerman, B.S. (Eds.), Children of addiction: research, health and policy issues, London: Routledge. (Chap. 4, pp. 109–141)Google Scholar