...

/

The Dataset and Exploratory Data Analysis

The Dataset and Exploratory Data Analysis

Learn how to read the dataset and perform exploratory data analysis.

Let's explore one of the most famous and benchmark datasets of the Titanic disaster history. This dataset is considered a first step toward classification in machine learning.

Dataset

In the Titanic dataset, we have the following features. We want to predict if the passenger survived or not. Therefore, the target will be the Survived column.

Data dictionary

  • PassengerId: Passenger ID

  • Pclass: Ticket class, where 1 = 1st, 2 = 2nd, and 3 = 3rd

  • Name: Passenger name

  • Sex: Male/femaleAge: Age in years

  • SibSp: Number of siblings and/or spouses aboard the Titanic

  • Parch: Number of parents and/or children aboard the Titanic

  • Ticket: Ticket number

  • Fare: Passenger fare

  • Cabin: Cabin number

  • Embarked: Port of embarkation, where C = Cherbourg, Q = Queenstown, and S = Southampton ...

Access this course and 1400+ top-rated courses and projects.