Many Categories Impurity

Learn how CART decision trees handle categorical features with more than two categories.

Multivalue attributes

When building decision trees, the CART algorithm uses only two-way (i.e., binary) data splits. CART classification trees are constructed using the Gini gain calculation. This lesson expands this knowledge by teaching how the CART classification tree algorithm handles a widespread situation in business data—categorical features with more than two values.

Consider the following Adult Census Income data sample:

Get hands-on with 1400+ tech skills courses.