Search⌘ K

Summary

Explore how to manage categorical data by distinguishing between quantitative and qualitative variables. Learn how to create dummy variables using pandas to prepare data for machine learning models, and discover techniques to eliminate redundant features for better model accuracy and reduced variance.

In this chapter, we learned about the following concepts.

Quantitative vs. qualitative data

  • Data can be quantitative or qualitative. Quantitative or numerical data contains numerical variables that can be discrete or continuous. Qualitative or categorical data contains categorical variables that define characteristics that can be nominal or ordinal.

  • Since we can’t perform mathematical operations on categorical data, we must deal with them before feeding them to our machine ...