Introduction to Data Analysis and Visualization with R/

...

Data Distribution

Learn about different types of data distributions and their use cases.

We'll cover the following...

Understanding data distribution
Normal distribution
T-distribution
Uniform distribution
Skewed distribution
- Skewness
- Kurtosis
Poisson distribution
Binomial distribution
Exponential distribution
Central limit theorem

Press + to interact

We need to know these two terms to understand the distribution types:

Continuous data refers to data that can take any value within a specified range. For example, the weight of an object can be considered continuous data because it can take an infinite number of values within a particular range. In R, the double data type represents continuous data.
Discrete data, on the other hand, refers to data that can only take specific values. These values are usually whole numbers (integers) or countable data points and are distinct from one another with no intermediate values. For example, the number of siblings a person has or the number of books in a library are examples of discrete data.

Press + to interact

The normal distribution is often used to model real-world phenomena, such as the distribution of heights or IQ scores in a population. Many statistical procedures assume that the data being analyzed follows a normal distribution, so understanding and identifying normal distributions is an essential part of statistical analysis.

The empirical rule states that for a normal distribution:

68% of the data falls within one standard deviation of the mean.
95% of the data falls within two standard deviations of the mean.
99.7% of the data falls within three standard deviations of the mean.

Press + to interact

Getting Started

File Management

Data Structures

Data Cleaning

Statistical Analysis

Data Transformation

Data Visualization

Uber Data Analysis Using the R Language

Conclusion

Evaluation

Netflix Shows

Data Distribution

Understanding data distribution

Normal distribution