

Categorical Distribution Plots

Categorical Distribution Plots

Let’s learn about distribution plots of observation at each level of the categorical variable.

As discussed in the previous lesson, we have different distribution plots to show each observation at each level of the categorical variable.

Let’s look at a few more.

The boxplot()

These types of plots are used to show the distribution of categorical data. A box plot (also known as a box-and-whisker plot) shows the distribution of quantitative data in a way that facilitates comparisons between variables or across levels of a categorical variable.

The box shows the quartiles of the data set while the whiskers extend to show the rest of the distribution—except those points that are determined to be outliers. Outliers are determined by a method that’s a function of the inter-quartile range.

In statistics, the quartiles of a ranked set of data values are the three points that divide the data set into four equal groups, with each group comprising a quarter of the data. A quartile is a type of quantile. The first quartile, Q1, is defined as the middle ...