Our project is based on the Kaggle Housing Prices Competition. In this challenge we are given a dataset with different attributes for houses and their prices. Our goal is to develop a model that can predict the prices of houses based on this data.

At this point we already know two things:

  1. We are given labeled training examples
  2. We are asked to predict a value

What do these tell us in terms of framing our problem? The first point tells us that this is clearly a typical supervised learning task, while the second one tells us that this is a typical regression task.

Get hands-on with 1400+ tech skills courses.