Data Visualization and Analysis With Seaborn Library/

...

Overview of Data Visualization, Variables, and their Types

Learn about the fundamentals of data visualizations and variables.

We'll cover the following...

Why do we need data visualization?
What is a variable?
Variables in Python
Primitive variable types in Python
Types of variables in statistical analysis

Data is an essential component of any organization—it’s all around us. For example, when we go grocery shopping, the store records certain information about our purchases, such as the items we purchase, their quantity, and so on. This information can assist companies in making critical decisions, such as which products are most popular with customers and which products earn the most profit.

The retailer can then analyze the data using their databases, going row by row in the records and understanding the relationships between different products. It may be feasible for a small local store, but what about large e-commerce retailers like Amazon, eBay, and others where millions of customer transactions exist?

In this case, it would be practically impossible to draw any meaningful conclusion by going through each record of data because the dataset is so large. Let’s consider that we have a restaurant dataset and are required to determine if customers with higher bills tend to give more tips.

The graph above illustrates the relationship between the total bill and tips. If we closely observe the graph, we can conclude that customers with higher bills tend to give higher tips.

What if we wanted to interpret how many customers are smokers in the dataset given above? We would need to review the data again. What if the number of records scales from 20 into the hundreds? It would become impractical to observe data only by following the rows, and the problem would scale as the size of the data increases.

In these kinds of scenarios, data visualizations come in handy. We can create several complex visualizations using Python’s seaborn library. In comparison to raw data, visualization can more effectively convey information. As Henrik Ibsen said, “A picture is worth a thousand words.” Additional visualizations capture the audience’s attention and efficiently convey information.

The data visualization above shows the relationship between the total bill and tips given by the customers. The visualization also categorizes the customers as smokers and nonsmokers. If we observe the above visualization closely, we can conclude that most customers are nonsmokers.

In this course, we’ll learn to construct various visualizations and discuss important concepts (such as variables, types of statistical analyses, and so on) in order to understand and use Python’s seaborn library for data visualization and analysis.

Press + to interact

About the Course

Introduction to Seaborn and Statistical Analysis

Plotting Numerical Data

Assessment on Plotting Numerical Data

Plotting Categorical Data

Plotting the Categorical Data

Assessment on Categorical Plots

Visualizing Distribution of Data

Data Visualization with Seaborn for Walmart Sales Projection

Assessment on Distributions of Data

Visualizing Regression Models

Assessment on Visualization of Regression Models

Styling and Figure Aesthetics

Assessment on Styling and Figure Aesthetics

Multiplot Grids

Assessment on Multi-Plot Grids

Project

Closing Remarks

Overview of Data Visualization, Variables, and their Types

Why do we need data visualization?

What is a variable?

Variables in Python

Primitive variable types in Python

Types of variables in statistical analysis