What are different statistical measures for analysis?

Data
5
9
3
6

2. Median

Median is the value that lies in the center of the data set. To calculate the median, we first sort the data in ascending order and then choose the middle value. If the number of entries is odd, then the median is simply the center value. If this number is even, we take the average of the two central values to get the median.

Example

For the data given above, if we wish to find the median, first we sort the data as follows:

3, 5, 6, 9

Then, since the number of entries is fouran even number, with no number in the middle, we take an average of the two central values:

( 5 + 6 ) / 2.

The benefit of the median is that it ignores outliers, and gives an accurate center of the data.

3. Mode

Mode represents the most frequent value of a data set. If no values are repeated in the data, then there is no mode.

Example

For example, in the data above, there is no mode. However, if we have the following data:

5, 5, 7, 8, 9, 1, 2, 5, 8

Then the mode would be 5, since it is repeated three times.

Here, we sum the square of the difference of each value from the mean, divide it by the total number of entries, and take the square root.

Example

Data:
5, 6, 3, 2, 9, 10

Mean = (5+6+3+2+9+10) / 6 = 5.83 

SD = sqrt( ( ( 5 - 5.83)^2 + ( 6 - 5.83 )^2 + .... + ( 10 - 5.83)^2 ) / 6 )

= 2.9107

5. Range

The range is the difference between the highest and lowest point of the data. It gives us an idea of how the data is spread.

6. Percentiles

A percentile is a value or a score below which a percentage of the data falls. For example, if you have 10 mangoes and the second heaviest mango weighs 150gm, 80% of the mangoes weigh less. 150gm is the 80th percentile weight.

Formula

To get this “80,” we use the following equation:
( 10 - 2 / 10 ) * 100

7. Regression

Regression shows the relationship between a dependent and an independent variable. It explains how changes in one variable affect the other. See the formula and example graph for regression below.

Formula

Free Resources

License: Creative Commons-Attribution-ShareAlike 4.0 (CC-BY-SA 4.0)

Learn in-demand tech skills in half the time

PRODUCTS

Mock Interview

New

Courses

Skill Paths

Projects

Assessments

TRENDING TOPICS

Learn to Code

Tech Interview Prep

Generative AI

Data Science

Machine Learning

GitHub Students Scholarship

Early Access Courses

Blind 75

Layoffs

Pricing

For Individuals

Try for Free

Gift a Subscription

CONTRIBUTE

Become an Author

Become an Affiliate

Earn Referral Credits

RESOURCES

Blog

Cheatsheets

Webinars

Answers

ABOUT US

Our Team

Careers

Hiring

Frequently Asked Questions

Press

LEGAL

Cookie Policy

Business Terms of Service

Data Processing Agreement

INTERVIEW PREP COURSES

Grokking the Modern System Design Interview

Grokking the Product Architecture Design Interview

Grokking the Coding Interview Patterns

Machine Learning System Design

What are different statistical measures for analysis?

1. Mean

Example

2. Median

Example

3. Mode

Example

4. Standard deviation

Example

5. Range

6. Percentiles

Formula

7. Regression

Formula

Graph example