Hostname: page-component-745bb68f8f-mzp66 Total loading time: 0 Render date: 2025-01-08T10:26:02.609Z Has data issue: false hasContentIssue false

A Bayesian Modeling Approach for Generalized Semiparametric Structural Equation Models

Published online by Cambridge University Press:  01 January 2025

Xin-Yuan Song
Affiliation:
Department of Statistics, The Chinese University of Hong Kong
Zhao-Hua Lu
Affiliation:
Department of Statistics, The Chinese University of Hong Kong
Jing-Heng Cai*
Affiliation:
Department of Statistics, Sun Yat-sen University
Edward Hak-Sing Ip
Affiliation:
Department of Biostatistical Sciences, Division of Public Health Sciences, Wake Forest University Health Sciences
*
Requests for reprints should be sent to Jing-Heng Cai, Department of Statistics, Sun Yat-sen University, Guangzhou, China. E-mail: [email protected]

Abstract

In behavioral, biomedical, and psychological studies, structural equation models (SEMs) have been widely used for assessing relationships between latent variables. Regression-type structural models based on parametric functions are often used for such purposes. In many applications, however, parametric SEMs are not adequate to capture subtle patterns in the functions over the entire range of the predictor variable. A different but equally important limitation of traditional parametric SEMs is that they are not designed to handle mixed data types—continuous, count, ordered, and unordered categorical. This paper develops a generalized semiparametric SEM that is able to handle mixed data types and to simultaneously model different functional relationships among latent variables. A structural equation of the proposed SEM is formulated using a series of unspecified smooth functions. The Bayesian P-splines approach and Markov chain Monte Carlo methods are developed to estimate the smooth functions and the unknown parameters. Moreover, we examine the relative benefits of semiparametric modeling over parametric modeling using a Bayesian model-comparison statistic, called the complete deviance information criterion (DIC). The performance of the developed methodology is evaluated using a simulation study. To illustrate the method, we used a data set derived from the National Longitudinal Survey of Youth.

Type
Original Paper
Copyright
Copyright © 2013 The Psychometric Society

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

References

Arminger, G., & Küsters, U. (1988). Latent trait models with indicators of mixed measurement level. In Langeheine, R., & Rost, J. (Eds.), Latent trait and latent class models (pp. 5173). New York: Plenum.CrossRefGoogle Scholar
Bartholomew, D.J., & Knott, M. (1999). Latent variable models and factor analysis. (2nd ed.). London: Arnold Publishers.Google Scholar
Bentler, P.M., & Stein, J.A. (1992). Structural equation models in medical research. Statistical Methods in Medical Research, 1, 159181.CrossRefGoogle ScholarPubMed
Berry, S.M., Carroll, R.J., & Ruppert, D. (2002). Bayesian smoothing and regression splines for measurement error problems. Journal of the American Statistical Association, 97, 160169.CrossRefGoogle Scholar
Bollen, K.A. (1989). Structural equations with latent variables. New York: Wiley.CrossRefGoogle Scholar
Cai, J.H., Song, X.Y., & Hser, Y.I. (2010). A Bayesian analysis of mixture structural equation models with non-ignorable missing responses and covariates. Statistics in Medicine, 29, 18611874.CrossRefGoogle ScholarPubMed
Cai, J.H., Song, X.Y., Lam, K.H., & Ip, H.S. (2011). A mixture of generalized latent variable models for mixed mode and heterogeneous data. Computational Statistics & Data Analysis, 55, 28892907.CrossRefGoogle Scholar
Caldwell, B.M., & Bradley, R.H. (1984). Home observation for measurement of the environment. Little Rock: University of Arkansas at Little Rock, Center for Child Development and Education.Google Scholar
Celeux, G., Forbes, F., Robert, C.P., & Titterington, D.M. (2006). Deviance information criteria for missing data models. Bayesian Analysis, 1, 651674.CrossRefGoogle Scholar
Center for Human Resource Research (2004). The national longitudinal surveys NLSY user’s guide, 1979–2004. Columbus: U.S. Department of Labor, Bureau of Labor Statistics.Google Scholar
Cowles, M.K. (1996). Accelerating Monte Carlo Markov chain convergence for cumulative-link generalized linear models. Statistics and Computing, 6, 101111.CrossRefGoogle Scholar
De Boor, C. (1978). A practical guide to splines. New York: Springer.CrossRefGoogle Scholar
DiMatteo, I., Genovese, C.R., & Kass, R.E. (2001). Bayesian curve fitting with free-knot splines. Biometrika, 88, 10551071.CrossRefGoogle Scholar
Dunn, L.M., & Markwardt, F. (1970). Peabody individual achievement test manual. Circle Pines: American Guidance Services.Google Scholar
Dunson, D.B. (2000). Bayesian latent variable models for clustered mixed outcomes. Journal of the Royal Statistical Society. Series B, 62, 355366.CrossRefGoogle Scholar
Eilers, P.H.C., & Marx, B.D. (1996). Flexible smoothing with B-splines and penalties. Statistical Science, 11, 89121.CrossRefGoogle Scholar
Fahrmeir, L., & Raach, A. (2007). A Bayesian semiparametric latent variable model for mixed responses. Psychometrika, 72, 327346.CrossRefGoogle Scholar
Fan, J., & Gijbels, I. (1996). Local polynomial modelling and its applications. London: Chapman & Hall.Google Scholar
Fox, J.P., & Glas, C.A.W. (2003). Bayesian modeling of measurement error in predictor variables using item response theory. Psychometrika, 68, 169191.CrossRefGoogle Scholar
Gelman, A., & Meng, X.L. (1998). Simulating normalizing constant: from importance sampling to bridge sampling to path sampling. Statistical Science, 13, 163185.CrossRefGoogle Scholar
Geman, S., & Geman, D. (1984). Stochastic relaxation, Gibbs distribution, and the Bayesian restoration of image. IEEE Transactions on Pattern Analysis and Machine Intelligence, 6, 721741.CrossRefGoogle Scholar
Green, P.J., & Silverman, B.W. (1994). Nonparametric regression and generalized linear models: a roughness penalty approach. London: Chapman & Hall.CrossRefGoogle Scholar
Hastings, W.K. (1970). Monte Carlo sampling methods using Markov chains and their application. Biometrika, 57, 97109.CrossRefGoogle Scholar
Huber, P., Scaillet, O., & Victoria-Feser, M.P. (2009). Assessing multivariate predictors of financial market movements: a latent factor framework for ordinal data. Annals of Applied Statistics, 3, 249271.CrossRefGoogle Scholar
Ibrahim, J.G., Chen, M.H., & Sinha, D. (2001). Criterion-based methods for Bayesian model assessment. Statistica Sinica, 11, 419443.Google Scholar
Imai, K., & van Dyk, D.A. (2005). Bayesian analysis of the multinomial probit model using marginal data augmentation. Journal of Econometrics, 124, 311334.CrossRefGoogle Scholar
Jara, A., Quintana, F., & San Martín, E. (2008). Linear mixed models with skew-elliptical distributions: a Bayesian approach. Computational Statistics & Data Analysis, 52, 50335045.CrossRefGoogle Scholar
Jöreskog, K.G., & Moustaki, I. (2001). Factor analysis for ordinal variables: a comparison of three approaches. Multivariate Behavioral Research, 36, 347387.CrossRefGoogle ScholarPubMed
Kass, R.E., & Raftery, A.E. (1995). Bayes factors. Journal of the American Statistical Association, 90, 773795.CrossRefGoogle Scholar
Kenny, D.A., & Judd, C.M. (1984). Estimating the nonlinear and interactive effects of latent variables. Psychological Bulletin, 96, 201210.CrossRefGoogle Scholar
Lang, S., & Brezger, A. (2004). Bayesian P-splines. Journal of Computational and Graphical Statistics, 13, 183212.CrossRefGoogle Scholar
Lee, S.Y. (2007). Structural equation modeling: a Bayesian approach. Chichester: Wiley.CrossRefGoogle Scholar
Lu, Z.H., & Song, X.Y. (2012). Finite mixture varying coefficient models for analyzing longitudinal heterogeneous data. Statistics in Medicine, 31, 544560.CrossRefGoogle Scholar
McCullagh, P., & Nelder, J.A. (1989). Generalized linear models. New York: Chapman & Hall.CrossRefGoogle Scholar
Meng, X.L. (1994). Posterior predictive p-values. The Annals of Statistics, 22, 11421160.CrossRefGoogle Scholar
Metropolis, N., Rosenbluth, A.W., Rosenbluth, M.N., Teller, A.H., & Teller, E. (1953). Equations of state calculations by fast computing machine. Journal of Chemical Physics, 21, 10871091.CrossRefGoogle Scholar
Moustaki, I. (1996). A latent trait and a latent class model for mixed observed variables. British Journal of Mathematical & Statistical Psychology, 49, 313334.CrossRefGoogle Scholar
Moustaki, I., & Victoria-Feser, M.-P. (2006). Bounded-influence robust estimation in generalized linear latent variable models. Journal of the American Statistical Association, 101, 644653.CrossRefGoogle Scholar
Muthén, B. (1984). A general structural equation model with dichotomous, ordered categorical, and continuous latent variable indicators. Psychometrika, 49, 115132.CrossRefGoogle Scholar
Nylund, K.L., & Muthén, B. (2007). Deciding on the number of classes in latent class analysis and growth mixture modeling: a Monte Carlo simulation study. Structural Equation Modeling. A Multidisciplinary Journal, 14, 535569.CrossRefGoogle Scholar
Owen, A.B. (2001). Empirical likelihood. Boca Raton: Chapman & Hall.Google Scholar
Pan, J.H., Song, X.Y., & Ip, H.S. (2013, in press). A Bayesian analysis of generalized latent curve mixture models. Statistics and Its Interface.Google Scholar
Panagiotelis, A., & Smith, M. (2008). Bayesian identification, selection and estimation of semiparametric functions in high-dimensional additive models. Journal of Econometrics, 143, 291316.CrossRefGoogle Scholar
Pugesek, B.H., Tomer, A., & von Eye, A. (2003). Structural equation modeling applications in ecological and evolutionary biology. New York: Cambridge University Press.CrossRefGoogle Scholar
Ruppert, D., Wand, M.P., & Carroll, R.J. (2003). Semiparametric regression. Cambridge: Cambridge University Press.CrossRefGoogle Scholar
San Martín, E., Jara, A., Rolin, J.M., & Mouchart, M. (2011). On the Bayesian nonparametric generalization of IRT-type models. Psychometrika, 76, 385409.CrossRefGoogle Scholar
Sanchez, B.N., Budtz-Jorgenger, E., Ryan, L.M., & Hu, H. (2005). Structural equation models: a review with applications to environmental epidemiology. Journal of the American Statistical Association, 100, 14431455.CrossRefGoogle Scholar
Schumacker, R.E., & Marcoulides, G.A. (Eds.) (1998). Interaction and nonlinear effects in structural equation models. Mahwah: Lawrence Erlbaum Associates, Publishers.Google Scholar
Shi, J.Q., & Lee, S.Y. (2000). Latent variable model with mixed continuous and polytomous data. Journal of the Royal Statistical Society. Series B, 62, 7787.CrossRefGoogle Scholar
Skrondal, A., & Rabe-Hesketh, S. (2004). Generalized latent variable modeling: multilevel, longitudinal, and structural equation models. Boca Raton: Chapman & Hall/CRC.CrossRefGoogle Scholar
Song, X.Y., & Lee, S.Y. (2007). Bayesian analysis of latent variable models with nonignorable missing outcomes from exponential family. Statistics in Medicine, 26, 681693.CrossRefGoogle ScholarPubMed
Song, X.Y., Lee, S.Y., Ng, M.C.Y., So, W.Y., & Chan, J.C.N. (2007). Baysian analysis of structural equation models with multinomial variables and an application to type 2 diabetic nephropathy. Statistics in Medicine, 26, 23482369.CrossRefGoogle Scholar
Song, X.Y., & Lu, Z.H. (2010). Semiparametric latent variable models with Bayesian P-splines. Journal of Computational and Graphical Statistics, 19, 590608.CrossRefGoogle Scholar
Song, X.Y., & Lu, Z.H. (2012). Semiparametric transformation models with Bayesian P-splines. Statistics and Computing, 22, 10851098.CrossRefGoogle Scholar
Song, X.Y., Pan, J.H., Kwok, T., Vandenput, L., Ohlsson, C., & Leung, P.C. (2010). A semiparametric Bayesian approach for structural equation models. Biometrical Journal, 52, 314332.CrossRefGoogle ScholarPubMed
Spiegelhalter, D.J., Best, N.G., Carlin, B.P., & van der Linde, A. (2002). Bayesian measures of model complexity and fit (with discussion). Journal of the Royal Statistical Society. Series B, 64, 583639.CrossRefGoogle Scholar
Spiegelhalter, D.J., Thomas, A., Best, N.G., & Lunn, D. (2003). WinBUGS user manual. Version 1.4. Cambridge: MRC Biostatistics Unit.Google Scholar
Yang, M.G., & Dunson, D.B. (2010). Bayesian semiparametric structural equation models with latent variables. Psychometrika, 75, 675693.CrossRefGoogle Scholar
Yang, M.G., Dunson, D.B., & Baird, D. (2010). Semiparametric Bayes hierarchical models with mean and variance constraints. Computational Statistics & Data Analysis, 54, 21722186.CrossRefGoogle ScholarPubMed
Zill, N. (1985). Behavior problem scales developed from the 1981 child health supplement to the national health interview survey. Washington: Child Trends, Inc..Google Scholar