Before the first class meeting, read Chapter 1 Why Git? Why GitHub? from Happy Git and GitHub for the useR.
Read chapter 1 - Software for modeling from Tidy Modeling with R.
Read as needed Happy Git and GitHub for the useR
Start Chapters 1 and 2 in Introduction to Statistics in R on DataCamp—Due Jan 19 NLT 5:00 PM
Sign-up for a free account on GitHub. When you register for a free individual GitHub account, request a student discount to obtain a few private repositories as well as unlimited public repositories. Please use something similar to FirstNameLastName as your username when you register with GitHub. For example, my username on GitHub is alanarnholt. If you have a popular name such as John Smith, you may need to provide some other distinguishing characteristic in your username.
Send an email to arnholtat@appstate.edu with a Subject line of STT3860 - GitHub Username, and tell me in the body of your email your first name, last name, and your GitHub username.
Make sure RStudio is set up to communicate with Git by following the directions in HappyGitWithR for introducing yourself to Git.
Cache your credentials and set up a personal access token (PAT) by following the directions in HappyGitWithR.
TL;DR the chapters in Happy Git With R — follow this document to Set up Git and GitHub
Optional Resources:
Read the chapter 2 - A Tidyverse Primer from Tidy Modeling with R.
Start Chapters 3 and 4 in Introduction to Statistics in R on DataCamp—Due Jan 23 NLT 5:00 PM
Read chapter 3 - A Review of R Modeling Fundamentals from Tidy Modeling with R.
Start reproduction of Introduction to Statistics in R—Due Feb 2 NLT 5:00 PM
Finish reproduction of Introduction to Statistics in R—Due Feb 2 NLT 5:00 PM
Read chapter 4 - The Ames Housing Data from Tidy Modeling with R.
Read Chapters 1-5 of Correlation and Regression
Start Chapters 1 and 2 in Modeling with Data in the Tidyverse on DataCamp—Due Feb 6 NLT 5:00 PM
Read chapter 5 - Spending our Data from Tidy Modeling with R.
Start Chapters 3 and 4 in Modeling with Data in the Tidyverse on DataCamp—Due Feb 13 NLT 5:00 PM
Misc Regression topics
Read chapter 6 - Fitting Models with parsnip from Tidy Modeling with R
Read Appendix E.
Start reproduction of Modeling with Data in the Tidyverse—Due Feb 23 NLT 5:00 PM
In class review this document.
Finish reproduction of Modeling with Data in the Tidyverse—Due Feb 23 NLT 5:00 PM
Read chapter 7 - A Model Workflow from Tidy Modeling with R
Start Chapters 1 and 2 in Modeling with Tidymodels in R on DataCamp—Due Feb 27 NLT 5:00 PM
Read Not mtcars AGAIN
Read chapter 8 - Feature Engineering with recipes from Tidy Modeling with R
Start Chapters 3 and 4 in Modeling with Tidymodels in R on DataCamp—Due Mar 6 NLT 5:00 PM
Start reproduction of Modeling with Tidymodels in R - Due Mar 19 NLT 5:00 PM
Read chapter 9 - Judging Model Effectiveness from Tidy Modeling with R
Work on reproduction of Modeling with Tidymodels in R - Due Mar 19 NLT 5:00 PM
Directions for Project 1
Accept Project 1 - Presentations will be March 31.
Read chapter 10 - Resampling for Evaluating Performance from Tidy Modeling with R
Work on Project 1 - Presentations will be March 31.
Group presentations for Project 1—Mar 31
Read chapter 11 Comparing Models with Resampling from Tidy Modeling with R
Read Tree Based Methods
Read Chapters 2-4 in Tree based Models in R
Start Chapters 1 and 2 in Machine Learning with Tree Based Models in R on DataCamp—Due Apr 6 NLT 5:00 PM
Optional Resources:
Review as needed Train and Test
Review as needed Regression Trees
Review as needed Predictive Model Building
Read chapter 12 - Model Tuning and the Dangers of Overfitting from Tidy Modeling with R
Read Chapters 5-6 in Tree based Models in R
Start Chapters 3 and 4 in Machine Learning with Tree Based Models in R on DataCamp—Due Apr 13 NLT 5:00 PM
Start reproduction of Machine Learning with Tree Based Models in R—Due Apr 20 NLT 5:00 PM
Optional Resources:
Review as needed Train and Test
Review as needed Regression Trees
Review as needed Predictive Model Building
Read chapter 13 - Grid Search from Tidy Modeling with R
Work on reproduction of Machine Learning with Tree Based Models in R—Due Apr 20 NLT 5:00 PM
Read Stack Overflow
Read VOTERS
Read What do nuns think?
Finish reproduction of Machine Learning with Tree Based Models in R - Due Apr 20 NLT 5:00 PM
Directions for Project 2
Accept Project 2 —Group presentations will be May 7 during the final exam period (8 am—10:30 am).