Skip to main content Accessibility help
×
Hostname: page-component-848d4c4894-jbqgn Total loading time: 0 Render date: 2024-07-04T22:11:09.287Z Has data issue: false hasContentIssue false

7 - GLMs Part III – Zero-Inflated and Hurdle Models

Published online by Cambridge University Press:  11 May 2017

Joseph M. Hilbe
Affiliation:
Jet Propulsion Laboratory, California Institute of Technology
Rafael S. de Souza
Affiliation:
Eötvös Loránd University, Budapest
Emille E. O. Ishida
Affiliation:
Université Clermont-Auvergne (Université Blaise Pascal), France
Get access

Summary

Zero-inflated models are mixture models. In the domain of count models, zero-inflated models involve the mixtures of a binary model for zero counts and a count model. It is a mixture model because the zeros are modeled by both the binary and the count components of a zero-inflated model.

The logic of a zero-inflated model can be expressed as

Pr(Y = 0) : Pr(Bin = 0) + [1 − Pr(Bin = 0)] × Pr(Count = 0)

Pr(Y ≥ 0) : 1 − Pr(Bin = 0) + PDFcount

Thus, the probability of a zero in a zero-inflated model is equal to the probability of a zero in the binary model component (e.g., the logistic) plus one minus the probability of a zero in the binary model times the probability of a zero count in the count model component. The probability that the response is greater than or equal to zero (as in e.g. the Poisson model) is equal to one minus the probability of a zero in the binary component plus the count model probability distribution. The above formulae are valid for all zero-inflated models.

Bayesian Zero-Inflated Poisson Model

We can apply the above formulae for a zero-inflated Poisson–logit model. The count component is a Poisson model and the binary component is a Bernoulli logistic model. Aside from the Poisson PDF, the key formulae for the zero-inflated Poisson–logit model, generally referred to as ZIP, include the probability of zero for a logistic model, 1/[1 + exp()], and the probability of a zero Poisson count, exp(−μ). Given that μ = exp(), the probability of a zero Poisson count with respect to the linear predictor is exp[− exp()]. The probability of all but zero counts is 1 − Pr(0), or 1 − exp[− exp()] or 1 − exp(−μ). The zero-inflated Poisson–logit model log-likelihood is given by the following expressions:[…]

Type
Chapter
Information
Bayesian Models for Astrophysical Data
Using R, JAGS, Python, and Stan
, pp. 184 - 214
Publisher: Cambridge University Press
Print publication year: 2017

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

References

Cameron, E. (2011). “On the estimation of confidence intervals for binomial population proportions in astronomy: the simplicity and superiority of the Bayesian approach.” Publ. Astronom. Soc. Australia 28, 128–139. DOI: 10.1071/AS10046. arXiv:1012.0566 [astro-ph.IM].
de Souza, R. S., E., Cameron, M., Killedar, J. M., Hilbe, R., Vilalta, U., Maio, V., Biffi et al. (2015). “The overlooked potential of generalized linear models in astronomy, I: Binomial regression.” Astron. Comput. 12, 21–32. DOI: http://dx.doi.org/10.1016/j.ascom.2015.04.002.
Elliott, J., R. S., de Souza, A., Krone-Martins, E., Cameron, E. O., Ishida, and J. M., Hilbe (2015). “The overlooked potential of generalized linear models in astronomy, II: gamma regression and photometric redshifts.” Astron. Comput. 10, 61–72. DOI: 10.1016/j.ascom.2015.01.002. arXiv: 1409.7699 [astro-ph.IM].
Hardin, J. W. and J. M., Hilbe (2012). Generalized Linear Models and Extensions, Third Edition. Taylor & Francis.
Hilbe, J. M. (2011). Negative Binomial Regression, Second Edition. Cambridge University Press.
Hilbe, J. M. (2014). Modeling Count Data. Cambridge University Press.
Hilbe, J. M. (2015). Practical Guide to Logistic Regression. Taylor & Francis.
McElreath, R. (2016). Statistical Rethinking: A Bayesian Course with Examples in R and Stan. Chapman & Hall/CRC Texts in Statistical Science. CRC Press.
Smithson, M. and E. C., Merkle (2013). Generalized Linear Models for Categorical and Continuous Limited Dependent Variables. Chapman & Hall/CRC Statistics in the Social and Behavioral Sciences. Taylor & Francis.
Zuur, A. F., J. M., Hilbe, and E. N., Ieno (2013). A Beginner's Guide to GLM and GLMM with R: A Frequentist and Bayesian Perspective for Ecologists. Highland Statistics.

Save book to Kindle

To save this book to your Kindle, first ensure [email protected] is added to your Approved Personal Document E-mail List under your Personal Document Settings on the Manage Your Content and Devices page of your Amazon account. Then enter the ‘name’ part of your Kindle email address below. Find out more about saving to your Kindle.

Note you can select to save to either the @free.kindle.com or @kindle.com variations. ‘@free.kindle.com’ emails are free but can only be saved to your device when it is connected to wi-fi. ‘@kindle.com’ emails can be delivered even when you are not connected to wi-fi, but note that service fees apply.

Find out more about the Kindle Personal Document Service.

Available formats
×

Save book to Dropbox

To save content items to your account, please confirm that you agree to abide by our usage policies. If this is the first time you use this feature, you will be asked to authorise Cambridge Core to connect with your account. Find out more about saving content to Dropbox.

Available formats
×

Save book to Google Drive

To save content items to your account, please confirm that you agree to abide by our usage policies. If this is the first time you use this feature, you will be asked to authorise Cambridge Core to connect with your account. Find out more about saving content to Google Drive.

Available formats
×