Data | Predict |
---|---|
credit_data | Status |
attrition | Attrition |
leaf_id_flavia | species |
mlc_churn | churn |
taxi | tip |
Project Two Directions
The goal of this project is to create several different classification models and select the model that best predicts the response variable in a future data set the model has never seen.
- Form a group of no fewer than two and no greater than four classmates.
- Select one of the following data sets from the
modeldata
package:
- Create at least three classification models using different specifications from the
parsnip
package. - Create
train
andtest
sets. - Use ten-fold cross-validation to tune all hyperparameters in your models.
- Report appropriate metrics (AUC, accuracy, etc.) for your trained models.
- Use the provided Quarto template as a start to creating a 6 to 10 minute presentation. Hide code you do not want to show during your presentation; but, make sure all the code you used to create the models is embedded in the slides.
Your presentation should include:
- The models you evaluated and their performance metrics on the test set.
- The steps you performed in your
recipe
. - An evaluation of the time versus benefit of each model’s creation.