Hostname: page-component-cd9895bd7-gbm5v Total loading time: 0 Render date: 2024-12-27T18:48:57.107Z Has data issue: false hasContentIssue false

GENERALIZED ADDITIVE PARTIAL LINEAR MODELS WITH HIGH-DIMENSIONAL COVARIATES

Published online by Cambridge University Press:  07 August 2013

Heng Lian
Affiliation:
Nanyang Technological University
Hua Liang
Affiliation:
University of Rochester

Abstract

This paper studies generalized additive partial linear models with high-dimensional covariates. We are interested in which components (including parametric and nonparametric components) are nonzero. The additive nonparametric functions are approximated by polynomial splines. We propose a doubly penalized procedure to obtain an initial estimate and then use the adaptive least absolute shrinkage and selection operator to identify nonzero components and to obtain the final selection and estimation results. We establish selection and estimation consistency of the estimator in addition to asymptotic normality for the estimator of the parametric components by employing a penalized quasi-likelihood. Thus our estimator is shown to have an asymptotic oracle property. Monte Carlo simulations show that the proposed procedure works well with moderate sample sizes.

Type
ARTICLES
Copyright
Copyright © Cambridge University Press 2013 

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Footnotes

1

We sincerely thank Professor Oliver Linton and two anonymous reviewers for their insightful and constructive comments, which have improved the manuscript significantly. Lian’s research was supported by Singapore MOE Tier 1 RG 62/11. Liang’s research was partially supported by NSF grants DMS-1007167 and DMS-1207444 and by Award 11228103 from the National Natural Science Foundation of China. Address correspondence to Hua Liang, Department of Biostatistics and Computational Biology, University of Rochester, Rochester, NY 14642, USA; e-mail: [email protected].

References

REFERENCES

Andrews, D. (1991) Asymptotic normality of series estimators for nonparametric and semiparametric regression models. Econometrica 59, 307345.CrossRefGoogle Scholar
Andrews, D. & Whang, Y. (1990) Additive interactive regression models: Circumvention of the curse of dimensionality. Econometric Theory 6, 466479.CrossRefGoogle Scholar
Belloni, A. & Chernozhukov, V. (2012) Post-l1-penalized estimators in high-dimensional linearregression models. Bernoulli, forthcoming.Google Scholar
Bickel, P., Ritov, Y., & Tsybakov, A. (2009) Simultaneous analysis of LASSO and Dantzig selector. Annals of Statistics 37, 17051732.CrossRefGoogle Scholar
Buja, A., Hastie, T., & Tibshirani, R. (1989) Linear smoothers and additive models. Annals of Statistics 17, 453510.CrossRefGoogle Scholar
Burda, M.C., Härdle, W., Müller, M., & Werwatz, A. (1998) Semiparametric analysis of German East-West migration intentions: Facts and theory. Journal of Applied Econometrics 13, 525541.Google Scholar
Chen, H. (1988) Convergence rates for parametric components in a partly linear model. Annals of Statistics 16, 136146.CrossRefGoogle Scholar
Chen, X. (2007) Large sample sieve estimation of semi-nonparametric models. Handbook of Econometrics 6, 55495632.CrossRefGoogle Scholar
Donald, S. & Newey, W. (1994) Series estimation of semilinear models. Journal of Multivariate Analysis 50, 3040.CrossRefGoogle Scholar
Du, P., Ma, S.G., & Liang, H. (2010) Penalized variable selection procedure for Cox models with semiparametric relative risk. Annals of Statistics 38, 20922117.Google ScholarPubMed
Engle, R., Granger, C., Rice, J., & Weiss, A. (1986) Semiparametric estimates of the relation between weather and electricity sales. Journal of the American Statistical Association 81, 310320.CrossRefGoogle Scholar
Fan, J.Q. & Li, R. Z. (2001) Variable selection via nonconcave penalized likelihood and its oracle properties. Journal of the American Statistical Association 96, 13481360.CrossRefGoogle Scholar
Fan, J.Q. & Peng, H. (2004) Nonconcave penalized likelihood with a diverging number of parameters. Annals of Statistics 32, 928961.CrossRefGoogle Scholar
Friedman, J., Hastie, T., Höfling, H., & Tibshirani, R. (2007) Pathwise coordinate optimization. Annals of Applied Statistics 1, 302332.CrossRefGoogle Scholar
Härdle, W., Liang, H., & Gao, J.T. (2000) Partially Linear Models. Springer Physica.CrossRefGoogle Scholar
Härdle, W., Mammen, E., & Müller, M. (1998) Testing parametric versus semiparametric modeling in generalized linear models. Journal of the American Statistical Association 93, 14611474.Google Scholar
Heckman, N.E. (1986) Spline smoothing in partly linear models. Journal of the Royal Statistical Society, Series B 48, 244248.Google Scholar
Hiirdle, W., Müller, M., Sperlich, S., & Werwatz, A. (2004) Nonparametric and Semiparametric Models. Springer-Verlag.CrossRefGoogle Scholar
Horowitz, J. (1998) Semiparametric Methods in Econometrics. Springer-Verlag.CrossRefGoogle Scholar
Huang, J. (1998) Functional ANOVA models for generalized regression. Journal of Multivariate Analysis 67, 4971.CrossRefGoogle Scholar
Huang, J., Horowitz, J. L., & Wei, F. (2010) Variable selection in nonparametric additive models. Annals of Statistics 38, 22822313.CrossRefGoogle ScholarPubMed
Huang, J.H.Z., Wu, C.O., & Zhou, L. (2004) Polynomial spline estimation and inference for varying coefficient models with longitudinal data. Statistica Sinica 14, 763788.Google Scholar
Juhl, T. & Xiao, Z. (2005) Partially linear models with unit roots. Econometric Theory 21, 877906.CrossRefGoogle Scholar
Kneib, T., Konrath, S., & Fahrmeir, L. (2011) High dimensional structured additive regression models: Bayesian regularization, smoothing and predictive performance. Journal of the Royal Statistical Society, Series C 60, 5170.CrossRefGoogle Scholar
Lam, C. & Fan, J. (2008) Profile-kernel likelihood inference with diverging number of parameters. Annals of Statistics 36, 22322260.Google ScholarPubMed
Leeb, H. & Pötscher, B. (2005) Model selection and inference: Facts and fiction. Econometric Theory 21, 2159.CrossRefGoogle Scholar
Leeb, H. & Pötscher, B. (2008) Sparse estimators and the oracle property, or the return of Hodges’ estimator. Journal of Econometrics 142, 201211.CrossRefGoogle Scholar
Li, Q. (2000) Efficient estimation of additive partially linear models. International Economic Review 41, 10731092.CrossRefGoogle Scholar
Li, Q. & Stengos, T. (1996) Semiparametric estimation of partially linear panel data models. Journal of Econometrics 71, 389397.CrossRefGoogle Scholar
Li, Q. & Wooldridge, J.M. (2002) Semiparametric estimation of partially linear models for dependent data with generated regressors. Econometric Theory 18, 625645.CrossRefGoogle Scholar
Li, R. & Liang, H. (2008) Variable selection in semiparametric regression modeling. Annals of Statistics 36, 261286.CrossRefGoogle ScholarPubMed
Linton, O. & Härdle, W. (1996) Estimation of additive regression models with known links. Biometrika 83, 529540.CrossRefGoogle Scholar
Linton, O. & Nielsen, J. (1995) A kernel method of estimating structured nonparametric regression based on marginal integration. Biometrika 82, 93100.CrossRefGoogle Scholar
Liu, X., Wang, L., & Liang, H. (2011) Estimation and variable selection for semiparametric additive partial linear models. Statistica Sinica 21, 12251248.CrossRefGoogle ScholarPubMed
Marx, B. & Eilers, P. (1998) Direct generalized additive modeling with penalized likelihood. Computational Statistics & Data Analysis 28, 193209.CrossRefGoogle Scholar
Müller, M. & Rönz, B. (2000) Credit Scoring Using Semiparametric Methods. Springer Lecture Notes in Statistics. Springer-Verlag.CrossRefGoogle Scholar
Newey, W. (1997) Convergence rates and asymptotic normality for series estimators. Journal of Econometrics 79, 147168.CrossRefGoogle Scholar
Robinson, P.M. (1988) Root n-consistent semiparametric regression. Econometrica 56, 931954.Google Scholar
Ruppert, D., Wand, M., & Carroll, R. (2003) Semiparametric Regression. Cambridge University Press.CrossRefGoogle Scholar
Severini, T.A. & Staniswalis, J.G. (1994) Quasi-likelihood estimation in semiparametric models. Journal of the American Statistical Association 89, 501511.CrossRefGoogle Scholar
Speckman, P.E. (1988) Kernel smoothing in partial linear models. Journal of the Royal Statistical Society, Series B 50, 413436.Google Scholar
Stone, C. (1986) The dimensionality reduction principle for generalized additive models. Annals of Statistics 14, 590606.CrossRefGoogle Scholar
Stone, C. (1994) The use of polynomial splines and their tensor products in multivariate function estimation. Annals of Statistics 22, 118171.CrossRefGoogle Scholar
Su, L. & Jin, S. (2010) Profile quasi-maximum likelihood estimation of partially linear spatial autoregressive models. Journal of Econometrics 157, 1833.CrossRefGoogle Scholar
Tibshirani, R. (1996) Regression shrinkage and selection via the lasso. Journal of the Royal Statistical Society, Series B, 58, 267288.Google Scholar
van der Geer, S.A. (2000) Applications of Empirical Process Theory. Cambridge University Press.Google Scholar
Wang, H., Li, R., & Tsai, C.L. (2007) Tuning parameter selectors for the smoothly clipped absolute deviation method. Biometrika 94, 553568.CrossRefGoogle ScholarPubMed
Wang, H.S. & Xia, Y.C. (2009) Shrinkage estimation of the varying coefficient model. Journal of the American Statistical Association 104, 747757.CrossRefGoogle Scholar
Wang, L., Liu, X., Liang, H., & Carroll, R. (2011) Estimation and variable selection for generalized additive partially linear models. Annals of Statistics 39, 18271851.CrossRefGoogle Scholar
Wood, S. (2004) Stable and efficient multiple smoothing parameter estimation for generalized additive models. Journal of the American Statistical Association 99, 673686.CrossRefGoogle Scholar
Wu, G. & Xiao, Z. (2002) A generalized partially linear model of asymmetric volatility. Journal of Empirical Finance 9, 287319.CrossRefGoogle Scholar
Xue, L. (2009) Consistent variable selection in additive models. Statistica Sinica 19, 12811296.Google Scholar
Xue, L. & Yang, L. (2006) Additive coefficient modeling via polynomial spline. Statistica Sinica 16, 14231446.Google Scholar
Yatchew, A. (2003) Semiparametric Regression for the Applied Econometrician. Cambridge University Press.CrossRefGoogle Scholar
Yu, K., Park, B., & Mammen, E. (2008) Smooth backfitting in generalized additive models. Annals of Statistics 36, 228260.CrossRefGoogle Scholar
Yuan, M. & Lin, Y. (2006) Model selection and estimation in regression with grouped variables. Journal of the Royal Statistical Society, Series B 68, 4967.CrossRefGoogle Scholar
Zhang, C.H. & Huang, J. (2008) The sparsity and bias of the lasso selection in high-dimensional linear regression. Annals of Statistics 36, 15671594.CrossRefGoogle Scholar
Zhang, H., Cheng, G., & Liu, Y. (2011) Linear or nonlinear? Automatic structure discovery for partially linear models. Journal of the American Statistical Association 106, 10991112.CrossRefGoogle ScholarPubMed
Zou, H. (2006) The adaptive LASSO and its oracle properties. Journal of the American Statistical Association 101, 14181429.CrossRefGoogle Scholar
Zou, H. & Li, R.Z. (2008) One-step sparse estimates in nonconcave penalized likelihood models. Annals of Statistics 36, 15091533.CrossRefGoogle ScholarPubMed