Fundamentals of Machine Learning: A Pythonic Introduction/

...

Kernel Logistic Regression

Learn how to implement kernel logistic regression along with its derivation.

We'll cover the following...

Minimzing BCE Loss
- Implementation
Conclusion

In the previous lessons, we mastered logistic regression, a powerful discriminative classifier, and understood how to optimize it using gradient descent on the BCE Loss. However, as a linear model, standard logistic regression is fundamentally limited to solving problems where the classes are linearly separable.

To overcome this limitation and enable logistic regression to tackle complex, non-linear data (like concentric circles or interlocking spirals), we must employ the kernel trick.

We can kernelize logistic regression just like other linear models by observing that the parameter vector $\bold w$ is a linear combination of the feature vectors $\Phi(X)$ , that is:

\bold w = \Phi(X) \bold a

Here, $\bold a$ is the dual parameter vector, and, in this case, the loss function now depends upon $\bold a$ .

Minimzing BCE Loss

We need to find the model parameters ( $\bold a$ ) that result in the smallest BCE loss function value to minimize the BCE loss. The BCE loss is defined as:

\begin{align*} L_{BCE}(\bold{a})&=\sum_{i=1}^n L_i\\ L_i &= -(y_ilog(\hat y_i)+(1-y_i)log(1-\hat y_i)) \\ \hat{y}_i&=\sigma(z_i)=\frac{1}{1+e^{-z_i}} \\ z_i&=\bold a^T \Phi(X)^T\phi(\bold{x}_i) \end{align*}

Ask

Course Overview

Supervised Learning

Detect Cyber Intrusion Using Machine Learning

Clustering

Project: Bag of Visual Words

Generalized Linear Regression

Face Recognition Using Kernel Linear Discriminant

Support Vector Machine

Logistic Regression

Ensemble Learning

Early Stage Diabetes Prediction Using Ensemble Learning

Decoding Dimensions: PCA and Autoencoders

Image Reconstruction Using PCA

Image Colorization using Autoencoders

Colorful Face Generation with VAEs

Appendix

Wrapping Up

How to Predict the Traffic Volume Using Machine Learning

Kernel Logistic Regression

Minimzing BCE Loss