This class is all about applying regression analysis and linear models, including generalized linear models, mediation and moderation, with a little bit of machine learning techniques thrown in. The book we’ll use throughout the class, and that drives the structure of the lecture slides, is Regression Analysis and Linear Models by Richard Darlington and Andrew Hayes. This course uses R and RStudio for all data analyses.
A subset of the General Social Survey data set, a data set used in Quas et al. about high risk youth data set, and a data set regarding poverty, violence, and teen birth rates per state will be used in the examples. We will also pull from FiveThirtyEight’s open data on GitHub occassionally throughout the class (many of these data sets can be used for your class project as well if they have both continuous and categorical predictors). Finally, a small (ficticious) data set about The Office (US) and Parks and Recreation television shows is also available.
|Homework||HTML (Easier to Read)||RMD (To Work With)|
|Final Project||HTML||Example Final Project PDF and Word|