Identifying Exploratory Questions with EDA
Learn to identify the questions you need to ask for exploratory data analysis.
We'll cover the following...
What is EDA?
Exploratory data analysis, or EDA, is the process of exploring and analyzing our data through visualizations, statistics, and other methods of data storytelling. One of the first steps to EDA is identifying the types of questions we can explore with the dataset.
Initial data analysis
Let's start with loading a sample dataset, called the Gapminder dataset, helpfully preloaded into a pandas DataFrame using the Plotly package.
Press + to interact
import plotly#Import the gapminder datasetgapminder_data = plotly.data.gapminder()#Print the feature names and head of the dataframeprint(gapminder_data.columns.tolist())print(gapminder_data.head(10))
Let's take a look at the individual variables, and for those with acronyms, ...