Boxplot
Learn how to build a boxplot and modify its elements using ggplot2.
We'll cover the following...
Overview of the boxplot
A box and whisker plot (commonly known as a boxplot) is used to visualize multiple distributions using summary statistics. A boxplot provides information on the five summary statistics of the given dataset variable, which include:
- The minimum value
- The maximum value
- The median
- The first and third quartiles
In short, a boxplot is popularly used to display data distribution. It helps detect outliers and compare distributions of the data.
Press + to interact
Let’s study the structure of the boxplot:
- The box, as seen from the figure, is the core of the boxplot.
- The lower side of the box represents the first quartile (or Q1), i.e., the
of the data. The upper end of the box represents the third quartile (or Q3), i.e., thepercentile A percentile is a number that compares one score to the scores of the rest of the group. It displays the percentage by which a given score exceeds the other scores. For example, if a user’s score is in the 90th percentile for a test, it means that the user scored better than 90% of people who took the test.