What is a pruning algorithm?

The most commonly used pruning algorithm on decision trees is the Alpha-Beta pruning. You can have a quick look at it over here.

There are multiple ways to prune your decision tree. Some of which are:

Pruning by information gain
Pruning by classification performance on the validation set

Pruning by information gain makes use of the information initially available when the tree is built from the training data.

Pruning by classification performance on the validation set makes use of the validation dataset and prunes the decision tree according to the best classification on the validation dataset.

Pruning by information gain

The algorithm is as follows:

Catalog all twigsnodes whose children are all leaves.
Keep a total count of all the leaves in the tree.
Keep a threshold of the number of leaves in the tree needed.
Loop until the number of leaves in the tree exceeds the set threshold.
Find the twig which gives the least information gain.
Take the twig and remove its children.
We remove the children because we aren’t gaining enough information from the node, and hence the node can be declared irrelevant.
Now relabel the twig to be a leaf.
Change the leaf count.

Free Resources

Learn in-demand tech skills in half the time

PRODUCTS

Mock Interview

New

Courses

Skill Paths

Projects

Assessments

TRENDING TOPICS

Learn to Code

Tech Interview Prep

Generative AI

Data Science

Machine Learning

GitHub Students Scholarship

Early Access Courses

Blind 75

Layoffs

Pricing

For Individuals

Try for Free

Gift a Subscription

CONTRIBUTE

Become an Author

Become an Affiliate

Earn Referral Credits

RESOURCES

Blog

Cheatsheets

Webinars

Answers

ABOUT US

Our Team

Careers

Hiring

Frequently Asked Questions

Press

LEGAL

Cookie Policy

Business Terms of Service

Data Processing Agreement

INTERVIEW PREP COURSES

Grokking the Modern System Design Interview

Grokking the Product Architecture Design Interview

Grokking the Coding Interview Patterns

Machine Learning System Design

What is a pruning algorithm?

Introduction

Methods of pruning

Pruning algorithm

Pruning by information gain

Pruning by classification performance on the validation set