...

/

Identifying Exploratory Questions with EDA

Identifying Exploratory Questions with EDA

Learn to identify the questions you need to ask for exploratory data analysis.

What is EDA?

Exploratory data analysis, or EDA, is the process of exploring and analyzing our data through visualizations, statistics, and other methods of data storytelling. One of the first steps to EDA is identifying the types of questions we can explore with the dataset.

Initial data analysis

Let's start with loading a sample dataset, called the Gapminder dataset, helpfully preloaded into a pandas DataFrame using the Plotly package.

Press + to interact
import plotly
#Import the gapminder dataset
gapminder_data = plotly.data.gapminder()
#Print the feature names and head of the dataframe
print(gapminder_data.columns.tolist())
print(gapminder_data.head(10))

Let's take a look at the individual variables, and for those with acronyms, ...