Overview of Linear Models

doi:10.1017/CBO9781139342674.002

2 - Overview of Linear Models

from II - Predictive Modeling Foundations

Published online by Cambridge University Press: 05 August 2014

Marjorie Rosenberg and

James Guszcza

Edited by

Edward W. Frees ,

Richard A. Derrig and

Glenn Meyers

Show author details

Marjorie Rosenberg: Affiliation:
University of Wisconsin-Madison
James Guszcza: Affiliation:
University of Wisconsin-Madison
Edward W. Frees: Affiliation:
University of Wisconsin, Madison
Richard A. Derrig: Affiliation:
Temple University, Philadelphia
Glenn Meyers: Affiliation:
ISO Innovative Analytics, New Jersey

Book contents

Get access

Summary

Chapter Preview. Linear modeling, also known as regression analysis, is a core tool in statistical practice for data analysis, prediction, and decision support. Applied data analysis requires judgment, domain knowledge, and the ability to analyze data. This chapter provides a summary of the linear model and discusses model assumptions, parameter estimation, variable selection, and model validation around a series of examples. These examples are grounded in data to help relate the theory to practice. All of these practical examples and exercises are completed using the open-source R statistical computing package. Particular attention is paid to the role of exploratory data analysis in the iterative process of criticizing, improving, and validating models in a detailed case study. Linear models provide a foundation for many of the more advanced statistical and machine-learning techniques that are explored in the later chapters of this volume.

Introduction

Linear models are used to analyze relationships among various pieces of information to arrive at insights or to make predictions. These models are referred to by many terms, including linear regression, regression, multiple regression, and ordinary least squares. In this chapter we adopt the term linear model.

Linear models provide a vehicle for quantifying relationships between an outcome (also referred to as dependent or target) variable and one or more explanatory (also referred to as independent or predictive) variables.

Type: Chapter
Information: Predictive Modeling Applications in Actuarial Science , pp. 13 - 64

DOI: https://doi.org/10.1017/CBO9781139342674.002 [Opens in a new window]

Publisher: Cambridge University Press

Print publication year: 2014

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

References

Akaike, H. (1974). A new look at the statistical model identification. IEEE Transactions on Automatic Control 19(6), 716–723.CrossRef Google Scholar

American Diabetes Association (2013a). Diabetes information. http://www.diabetes.org/diabetes-basics/?loc=GlobalNavDB.

American Diabetes Association (2013b). Diabetes information. http://www.diabetes.org/living-with-diabetes/complications/.

Brien, R. (2007). A caution regarding rules of thumb for variance inflation factors. Quality & Quantity 41, 673–690.Google Scholar

Centers with Disease Control and Prevention (2013). Body mass index information. http://www.cdc.gov/healthyweight/assessing/bmi/adult_bmi/index.html.

Dall, T. M., S. E., Mann, Y., Zhang, J., Martin, Y., Chen, and P., Hogan (2008). Economic costs of diabetes in the U.S. in 2007. Diabetes Care 31(3), 1–20.Google Scholar

Draper, N. R. and H. S., Smith (1998). Applied Regression Analysis. Wiley, New York.CrossRef Google Scholar

Freedman, D. A. (2005). Statistical Models: Theory and Practice. Cambridge University Press, New York.CrossRef Google Scholar

Frees, E. W. (2010). Regression Modeling with Actuarial and Financial Applications. Cambridge University Press, New York.Google Scholar

Gelman, A. and J., Hill (2008). Data Analysis Using Regression and Multilevel/Hierarchical Modeling. Cambridge University Press, New York.Google Scholar

Hastie, T., R., Tibshirani, and J., Friedman (2009). The Elements of Statistical Learning: Data Mining, Inference and Prediction (2nd edition). Springer-Verlag, New York.CrossRef Google Scholar

Little, R. and D., Rubin (2002). Statistical Analysis with Missing Data, Second Edition. Wiley, New Jersey.CrossRef Google Scholar

Medical Expenditure Panel Survey (2013). Meps website. www.meps.ahrq.gov/.

Rubin, D. (1976). Inference and missing data. Biometrika 63(3), 581–592.CrossRef Google Scholar

Stigler, S. M. (1986). The History of Statistics: The Measurement of Uncertainty before 1900. Belknap Press, Cambridge, MA.Google Scholar

R, Core Team (2013). R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing, Vienna, Austria. http://www.R-project.org.Google Scholar

Tukey, J. (1977). Exploratory Data Analysis. Addison-Wesley, Reading, Massachusetts.Google Scholar

Book contents

2 - Overview of Linear Models

Summary

Access options

References

Save book to Kindle

Save book to Dropbox

Save book to Google Drive