Overview of Probabilistic Models

Explore how probabilistic models in machine learning estimate the probability distribution of target variables instead of fixed values. Understand the roles of generative and discriminative models, and learn how Bayes’ rule underpins these methods for classification and regression tasks.

We'll cover the following...

What are probabilistic models?
The role of probability in learning
Bayes’ rule
Generative models
Discriminative models
Classification using probabilistic models
Regression using probabilistic models
Conclusion

Imagine a weather app that predicts the temperature tomorrow will be exactly $20^\circ\text{C}$ . What if it could tell you there’s an $80\%$ chance it will be $20^\circ\text{C}$ , a $15\%$ chance it will be $22^\circ\text{C}$ , and a $5\%$ chance it will be $18^\circ\text{C}$ ? Which prediction is more useful?

In the context of supervised learning, it’s far more advantageous to predict the distribution of the target variable rather than a specific fixed value. This distribution provides a measure of uncertainty or confidence alongside the prediction.

What are probabilistic models?

A probabilistic model in machine learning is any model that provides an output in terms of probabilities. Instead of directly predicting a class label (like “Dog”) or a single value (like “Price is $500”), it predicts a probability distributionA probability distribution is a mathematical function that describes the likelihood of different outcomes occurring in a random experiment. It provides a way to model and quantify the uncertainty associated with various events or values. over all possible outcomes.

Classification example: Instead of outputting “Class A,” a probabilistic model might output: $P(\text{Class A}) = 0.90$ , $P(\text{Class B}) = 0.08$ , $P(\text{Class C}) = 0.02$ . The fixed prediction is then typically the value with the highest probability (Class A).
Regression example: Instead of outputting “Price = $500,” the model might output a Gaussian distributionA Gaussian distribution is a bell-shaped curve that shows which values are most likely. Values near the center are common, and values far away are unlikely. centered at $500, indicating that prices close to $500 are highly likely, while prices far from it are unlikely.

In fact, numerous classification and regression models implicitly calculate the underlying distribution and generate a fixed target by sampling from this distribution. The most common approach involves sampling the value with the highest probability, although alternative sampling methods also exist.

The role of probability in learning

Probabilistic modeling forces us to explicitly calculate and model the relationship between our input features ( $\mathbf{x}$ ) and the target variable ( $y$ ). This formal approach to uncertainty is mathematically grounded and highly interpretable.

However, certain models don’t lend themselves easily to estimating a probability distribution for the target variable. In situations where obtaining samples of the target variable proves difficult, we turn to conventional machine learning models as an alternative solution.

To formally understand how these models calculate probabilities, we must turn to the foundational tool of probabilistic machine learning: Bayes’ rule. Bayes’ rule provides the mathematical framework for updating the probability of a hypothesis as new evidence or data is gathered, forming the basis of many powerful probabilistic algorithms.

Bayes’ rule

We can express the computation of the target probability distribution of a target variable $y$ given an input feature vector $\bold{x}$ , represented as $p(y|\bold{x})$ , using Bayes’ rule in the following manner:

p(y|\bold x) = \frac{p(\bold x|y)p(y)}{p(\bold x)}

1.Course Overview

2.Supervised Learning

Project

3.Clustering

Mini Project

4.Generalized Linear Regression

Mini Project

5.Support Vector Machine

6.Logistic Regression

7.Ensemble Learning

Mini Project

8.Decoding Dimensions: PCA and Autoencoders

Mini Project

Mini Project

Mini Project

9.Appendix

10.Wrapping Up

Project

Overview of Probabilistic Models

What are probabilistic models?

The role of probability in learning

Bayes’ rule