Supervised Learning: Classification
Learn the fundamental principles behind classification and implement classification using the sklearn library.
A common supervised learning problem is classification, applicable to data with discrete output labels. Examples of classification problems include:
Spam email detection
Face recognition
Plant species prediction
Human action recognition
In all examples mentioned above, we have a small set of classes or discrete labels to predict.
The following figure shows a binary classification problem having two classes:
The first class is represented by green circle points.
The second class is represented by blue square points.
Data points of both classes have two input features,
The commonly used classification algorithms include:
Logistic regression
Naïve Bayes classification
Nearest neighbor classification
Decision trees (applicable to both classification and regression)
Here we discuss logistic regression only, which works on the logistic function.
Logistic regression
In contrast to its name, logistic regression is a classification method. It’s a type of linear classifier that resembles linear regression. Logistic regression uses a sigmoid function which is given as: