Many Categories Impurity
Learn how CART decision trees handle categorical features with more than two categories.
We'll cover the following
Multivalue attributes
When building decision trees, the CART algorithm uses only two-way (i.e., binary) data splits. CART classification trees are constructed using the Gini gain calculation. This lesson expands this knowledge by teaching how the CART classification tree algorithm handles a widespread situation in business data—categorical features with more than two values.
Consider the following Adult Census Income data sample:
Get hands-on with 1400+ tech skills courses.