Data Science with R: Decision Trees and Random Forests/

...

Feature Importance

Learn how the random forest algorithm determines the most important features for making accurate predictions.

We'll cover the following...

Finding the features that matter
Another use of OOB data
Determining important features
Listing important features

Finding the features that matter

When using machine learning, it’s natural to ask, “Which features are the most important for making accurate predictions?” The random forest implements permutation importance to help answer this question. Permutation importance works by randomly shuffling (permuting) feature data and assessing the impact of the shuffling on the quality of predictions.

Here’s the intuition of permutation importance:

If you permute the values of highly predictive features, tree accuracy should decrease a lot.
If you permute the values of features that aren’t predictive, tree accuracy shouldn’t decrease much.

Imagine the worst feature possible —a set of completely random values. Theoretically, tree accuracy would not decrease if you permute the feature values. ...

Welcome to the Course

Supervised Learning

Classification Tree Math

Using Classification Trees in R

Introducing the Bias-Variance Tradeoff

Model Tuning

Model Tuning with tidymodels

Feature Engineering

Regression Trees

The Random Forest Algorithm

Using Random Forests

Gradient Boosting Trees

Continuing Your Journey

Credit Card Fraud Detection using the R Language

Feature Importance

Finding the features that matter