Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
How Restricted Boltzmann Machines Work? | Deep learning for Recommendation System
Feb 6, 2025
306 views
Written by Prashant Basnet
👋 Welcome, You’ve Landed on My Signature Page.
I’m a Software Development Engineer passionate about building scalable systems and solving problems.
Beyond engineering, I enjoy sharing ideas and documenting lessons so others can learn and build on them.This space is my digital notebook, a place where I reflect on what I’m learning and creating.
Here's a standard Boltzmann Machines:
image credit:
In theory it's a great problem, where reach node is connected to every other node.
A Restricted Boltzmann Machine:
image credit:
Let's look at example of movies recommendation system.
image credit:
What allows Restricted Boltzmann Machines to learn?
How does RBM adjust it's weight?
We know in other neural network we had gradient descent process which allowed back propagation of error.
But in this network, we don't have directed network. We have undirected network. How does the weights gets adjusted?
Gibs Sampling:
image credit
Let's see in graph:
image credit
In RBM, what does energy mean?
Weights are considered energy. Weights dictates the shape of this energy curve. Through the contrastive divergence process, we are finding what's the values (input & hidden layers) for the system to be in the lowest energy state possible.
At the end of convergence, this brings our system to minimal energy state at the end of this Contrastive Divergence process.
We have to keep repeating the Gibs sampling for converging our RBM?
In 1998, Jeffrey Hinton discovered a shortcut, we don't have to wait until it convergences to the end. We can only take 2 steps i.e first 2 passes. Which will be sufficient to understand how to adjust your curve in the initial stage.
We design the system with aim to always get to the minimum energy state possible. Through this, system is such, when we input our training value, our system is already going to be at the lowest state possible.