...

Kaggle Challenge - Exploratory Data Analysis

We'll cover the following...

1. Exploratory Data Analysis

Understand the Data Structure
Explore Numerical Attributes
Correlations Among Numerical Attributes
Explore Categorical Attributes
Jupyter Notebook

Our project is based on the Kaggle Housing Prices Competition. In this challenge we are given a dataset with different attributes for houses and their prices. Our goal is to develop a model that can predict the prices of houses based on this data.

At this point we already know two things:

We are given labeled training examples
We are asked to predict a value

What do these tell us in terms of framing our problem? The first point tells us that this is clearly a typical supervised learning task, while the second one tells us that this is a typical regression task.

If we look at the file with data description, data_description.txt, we can see the kind of attributes we are expected to have for the houses we are working with. Here is a sneak peek into some of the interesting attributes and their description from that file:

SalePrice – the property’s sale price in dollars. This is the target variable that we are trying to predict.
MSSubClass: The building class.
LotFrontage: Linear feet of street connected to property.
LotArea: Lot size in square feet. Street: Type of road access.
Alley: Type of alley access.
LotShape: General shape of property.
LandContour: Flatness of the property.
LotConfig: Lot configuration.
LandSlope: Slope of property. Neighborhood: Physical locations within Ames city limits.
Condition1: Proximity to main road or railroad.
HouseStyle: Style of dwelling.
OverallQual: Overall material and finish quality.
OverallCond: Overall condition rating.
YearBuilt: Original construction date.

Press + to interact

Python Fundamentals for Data Science

The Fundamentals of Statistics

Machine Learning 101

End-to-End Machine Learning Project

The Real Talk

Kaggle Challenge - Exploratory Data Analysis

1. Exploratory Data Analysis

Understand the Data Structure