GLMs Part III – Zero-Inflated and Hurdle Models

Joseph M. Hilbe; Rafael S. de Souza; Emille E. O. Ishida

doi:10.1017/CBO9781316459515.008

7 - GLMs Part III – Zero-Inflated and Hurdle Models

Published online by Cambridge University Press: 11 May 2017

Joseph M. Hilbe ,

Rafael S. de Souza and

Emille E. O. Ishida

Show author details

Joseph M. Hilbe: Affiliation:
Jet Propulsion Laboratory, California Institute of Technology
Rafael S. de Souza: Affiliation:
Eötvös Loránd University, Budapest
Emille E. O. Ishida: Affiliation:
Université Clermont-Auvergne (Université Blaise Pascal), France

Book contents

Get access

Summary

Zero-inflated models are mixture models. In the domain of count models, zero-inflated models involve the mixtures of a binary model for zero counts and a count model. It is a mixture model because the zeros are modeled by both the binary and the count components of a zero-inflated model.

The logic of a zero-inflated model can be expressed as

Pr(Y = 0) : Pr(Bin = 0) + [1 − Pr(Bin = 0)] × Pr(Count = 0)

Pr(Y ≥ 0) : 1 − Pr(Bin = 0) + PDFcount

Thus, the probability of a zero in a zero-inflated model is equal to the probability of a zero in the binary model component (e.g., the logistic) plus one minus the probability of a zero in the binary model times the probability of a zero count in the count model component. The probability that the response is greater than or equal to zero (as in e.g. the Poisson model) is equal to one minus the probability of a zero in the binary component plus the count model probability distribution. The above formulae are valid for all zero-inflated models.

Bayesian Zero-Inflated Poisson Model

We can apply the above formulae for a zero-inflated Poisson–logit model. The count component is a Poisson model and the binary component is a Bernoulli logistic model. Aside from the Poisson PDF, the key formulae for the zero-inflated Poisson–logit model, generally referred to as ZIP, include the probability of zero for a logistic model, 1/[1 + exp(xβ)], and the probability of a zero Poisson count, exp(−μ). Given that μ = exp(xβ), the probability of a zero Poisson count with respect to the linear predictor xβ is exp[− exp(xβ)]. The probability of all but zero counts is 1 − Pr(0), or 1 − exp[− exp(xβ)] or 1 − exp(−μ). The zero-inflated Poisson–logit model log-likelihood is given by the following expressions:[…]

Type: Chapter
Information: Bayesian Models for Astrophysical Data
Using R, JAGS, Python, and Stan
, pp. 184 - 214

DOI: https://doi.org/10.1017/CBO9781316459515.008 [Opens in a new window]

Publisher: Cambridge University Press

Print publication year: 2017

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

References

Cameron, E. (2011). “On the estimation of confidence intervals for binomial population proportions in astronomy: the simplicity and superiority of the Bayesian approach.” Publ. Astronom. Soc. Australia 28, 128–139. DOI: 10.1071/AS10046. arXiv:1012.0566 [astro-ph.IM].

de Souza, R. S., E., Cameron, M., Killedar, J. M., Hilbe, R., Vilalta, U., Maio, V., Biffi et al. (2015). “The overlooked potential of generalized linear models in astronomy, I: Binomial regression.” Astron. Comput. 12, 21–32. DOI: http://dx.doi.org/10.1016/j.ascom.2015.04.002.

Elliott, J., R. S., de Souza, A., Krone-Martins, E., Cameron, E. O., Ishida, and J. M., Hilbe (2015). “The overlooked potential of generalized linear models in astronomy, II: gamma regression and photometric redshifts.” Astron. Comput. 10, 61–72. DOI: 10.1016/j.ascom.2015.01.002. arXiv: 1409.7699 [astro-ph.IM].

Hardin, J. W. and J. M., Hilbe (2012). Generalized Linear Models and Extensions, Third Edition. Taylor & Francis.

Hilbe, J. M. (2011). Negative Binomial Regression, Second Edition. Cambridge University Press.

Hilbe, J. M. (2014). Modeling Count Data. Cambridge University Press.

Hilbe, J. M. (2015). Practical Guide to Logistic Regression. Taylor & Francis.

McElreath, R. (2016). Statistical Rethinking: A Bayesian Course with Examples in R and Stan. Chapman & Hall/CRC Texts in Statistical Science. CRC Press.

Smithson, M. and E. C., Merkle (2013). Generalized Linear Models for Categorical and Continuous Limited Dependent Variables. Chapman & Hall/CRC Statistics in the Social and Behavioral Sciences. Taylor & Francis.

Zuur, A. F., J. M., Hilbe, and E. N., Ieno (2013). A Beginner's Guide to GLM and GLMM with R: A Frequentist and Bayesian Perspective for Ecologists. Highland Statistics.

Book contents

7 - GLMs Part III – Zero-Inflated and Hurdle Models

Summary

Access options

References

Save book to Kindle

Save book to Dropbox

Save book to Google Drive