Supervised Learning: Classification
Learn the fundamental principles behind classification and implement classification using the sklearn library.
A common supervised learning problem is classification, applicable to data with discrete output labels. Examples of classification problems include:
- Spam email detection 
- Face recognition 
- Plant species prediction 
- Human action recognition 
In all examples mentioned above, we have a small set of classes or discrete labels to predict.
The following figure shows a binary classification problem having two classes:
- The first class is represented by green circle points. 
- The second class is represented by blue square points. 
Data points of both classes have two input features, 
The commonly used classification algorithms include:
- Logistic regression 
- Naïve Bayes classification 
- Nearest neighbor classification 
- Decision trees (applicable to both classification and regression) 
Here we discuss logistic regression only, which works on the logistic function.
Logistic regression
In contrast to its name, logistic regression is a classification method. It’s a type of linear classifier that resembles linear regression. Logistic regression uses a sigmoid function which is given as:
Here,