Data Science with R: Decision Trees and Random Forests/

...

Tuning Random Forests

Learn why tuning the random forest algorithm is relatively easy.

We'll cover the following...

Random forest and the bias-variance tradeoff
Random forest hyperparameters
Tuning with tidymodels

Press + to interact

Here are a few things to consider:

First, the random forest’s bagging and feature randomization only provide each ensemble tree with limited training data. So, there’s no concern regarding ensemble trees overfitting (i.e., the lower right in the illustration).

Second, because there’s no concern for overfitting, the random forest algorithm sets the CART minbucket hyperparameter to 1. Given the training data provided, this setting allows ensemble trees to grow as deep and complex as the provided training data allows. Deep, complex trees address underfitting (i.e., the upper left in the illustration).

Third, the random forest algorithm uses many trees in the ensemble (e.g., 500 by default). Using so many wildly different trees in the ensemble “smooths out” the ...

Welcome to the Course

Supervised Learning

Classification Tree Math

Using Classification Trees in R

Introducing the Bias-Variance Tradeoff

Model Tuning

Model Tuning with tidymodels

Feature Engineering

Regression Trees

The Random Forest Algorithm

Using Random Forests

Gradient Boosting Trees

Continuing Your Journey

Credit Card Fraud Detection using the R Language

Tuning Random Forests

Random forest and the bias-variance tradeoff