Project Two Directions

Published

Last modified on March 08, 2025 07:18:33 Eastern Standard Time

The goal of this project is to create several different classification models and select the model that best predicts the response variable in a future data set the model has never seen.

  1. Form a group of no fewer than two and no greater than four classmates.
  2. Select one of the following data sets from the modeldata package:
Data Predict
credit_data Status
attrition Attrition
leaf_id_flavia species
mlc_churn churn
taxi tip
  1. Create at least three classification models using different specifications from the parsnip package.
  2. Create train and test sets.
  3. Use ten-fold cross-validation to tune all hyperparameters in your models.
  4. Report appropriate metrics (AUC, accuracy, etc.) for your trained models.
  5. Use the provided Quarto template as a start to creating a 6 to 10 minute presentation. Hide code you do not want to show during your presentation; but, make sure all the code you used to create the models is embedded in the slides.