Data Preprocessing: Missing Values

Understand how to preprocess data effectively for binary classification by managing missing values. This lesson guides you through techniques like dropping rows or columns and imputing missing data to prepare clean datasets for machine learning algorithms.

We'll cover the following...

Data preparation and cleaning
Missing values

Cope with missing values

Data preparation and cleaning

Our data have different types. There are numerical data, such as “Age,” “SibSp,” “Parch,” and “Fare.” Then there are categorical data. Some of the categories are represented by numbers (“Survived,” “P-class”). Some are represented by text (“Sex” and “Embarked”). And finally, there is textual data (“Name,” “Ticket,” and “Cabin”).

This is quite a mess for data that we want to feed into a computer. Furthermore, when looking at train.info(), we can see that the counts vary for different columns. While we have 891 values for most columns, we only have 714 for “Age,” 204 for “Cabin,” and 889 for “Embarked”.

Before we can feed our data into any machine learning algorithm, we need ...

1.Getting Started

2.Binary Classification

3.Qubit and Quantum States

4.Probabilistic Binary Classifier

5.Working with Qubits

6.Working with Multiple Qubits

Project

7.Quantum Naïve Bayes

8.Quantum Computing Is Different

9.Quantum Bayesian Networks

10.Bayesian Inference

11.The World Is Not a Disk

12.Working with the Qubit Phase

13.Search for Relatives

14.Sampling

15.Conclusion

16.APPENDIX

Assessment

Data Preprocessing: Missing Values

Data preparation and cleaning