How is entropy used to build a decision tree?

Entropy

Entropy is a measure of impurity or disorder within a dataset. In the context of decision trees, entropy determines which node is to be used as a decision node when performing a split in the tree.

The distribution of the classes in the dataset determines the entropy of a dataset. If all the samples in the dataset belong to a single class, then the entropy is 0. If all the samples are equally distributed among the classes in the dataset, then the entropy is 1.

The entropy of a given dataset can be measured using the equation below:

Where,

S = dataset.
i = Unique class in S.
p_i = Proportion of examples belonging to class i in S.

Information gain

Information gain is used to measure the disorder in a dataset when splitting is performed on a specific feature.

When performing splitting, we can calculate the various values of information gain for different features and select the feature which gives us the highest value of information gain as it would provide us the most reduction in entropy.

Below is the formula used to calculate the information gain for a specific feature.

Note: When we follow the cloudy route we end up with 4 remaining rows {3,7,12,13}. In all these rows there is no negative output hence we make a leaf node there with a positive output.

Exercise

Now that we have learned how to perform a split on a dataset using entropy values, it's now time to complete the above diagram and find out the remaining nodes in the decision tree.

Below, we can see a reference table attached for the new dataset table for the "Sunny" and the "Rain" path on which we will perform our calculations.

Free Resources

Learn in-demand tech skills in half the time

PRODUCTS

Mock Interview

New

Courses

Skill Paths

Projects

Assessments

TRENDING TOPICS

Learn to Code

Tech Interview Prep

Generative AI

Data Science

Machine Learning

GitHub Students Scholarship

Early Access Courses

Blind 75

Layoffs

Pricing

For Individuals

Try for Free

Gift a Subscription

CONTRIBUTE

Become an Author

Become an Affiliate

Earn Referral Credits

RESOURCES

Blog

Cheatsheets

Webinars

Answers

ABOUT US

Our Team

Careers

Hiring

Frequently Asked Questions

Press

LEGAL

Cookie Policy

Business Terms of Service

Data Processing Agreement

INTERVIEW PREP COURSES

Grokking the Modern System Design Interview

Grokking the Product Architecture Design Interview

Grokking the Coding Interview Patterns

Machine Learning System Design

Feature	Information gain
Weather	0.2464
Temprature	0.0289
Humidity	0.1516
Wind Speed	0.0478

How is entropy used to build a decision tree?

Entropy

Information gain

Example

The entropy of the dataset

Information gain for other features

For weather

For temperature

For humidity

For wind speed

Selecting the feature to split

Information gain summary

Exercise

The final decision tree