Data Science with R: Decision Trees and Random Forests/

...

Model Tuning Intuition 201

Understand how cross-validation is used to evaluate the bias-variance tradeoff of machine learning models.

We'll cover the following...

Back to the darts
Making the darts real
Estimating generalization error
What about the test holdout set?

Back to the darts

This lesson combines several topics. Assume there’s some training data, access to the CART classification tree algorithm, and some hyperparameter values. This is everything needed to perform ten-fold cross-validation (CV). Regarding the bias-variance tradeoff, each CV iteration is conceptually a dart thrown at the dartboard.

Performing CV for each set of hyperparameter values is a best practice. Assuming there are four sets of hyperparameter values, the following image visualizes cross-validation in terms of the bias-variance tradeoff:

Press + to interact

Welcome to the Course

Supervised Learning

Classification Tree Math

Using Classification Trees in R

Introducing the Bias-Variance Tradeoff

Model Tuning

Model Tuning with tidymodels

Feature Engineering

Regression Trees

The Random Forest Algorithm

Using Random Forests

Gradient Boosting Trees

Continuing Your Journey

Credit Card Fraud Detection using the R Language

Model Tuning Intuition 201

Back to the darts