Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
What is GPT ? Transformer | Machine Learning Model
Prashant Basnet
Dec 30, 2024
23 views
What GPT-3 Means:
GPT means generative pre-trained transformer, 3 means 3rd generation.
GPT 3 is just one example of transformer. Something that transforms from one sequence into another. Language translation is just a great example.
Machine Learning Basics:
A machine learning is very flexible structure with tun-able parameters like bunch of knob & dials, then somehow we use many examples of what the output should look like give an input, tweaking and tuning the values of these parameters to mimic this behaviour.
A simplest form of machine learning is linear regression, where input and output are a single number. For example sq.ft of a house and a price, we want a line of best fit throughout the data, that can predict future house prices.
Deep Learning:
When you create a giant model with huge number of parameters without it either grossly overfitting the training data or being too complex & unmanageable to train.
Deep learning describes a class of models that in the last decade have proven to scale remarkably well. What unifies them is they all use the same training algorithm, called back-propagation.
Transformers as a Model Class:
Input Data Formatting:
Transformers' Operation:
The only way parameters/weights interact with the data being processed is through weighted sums. We also sprinkle some simple non linear function throughout.These weighted sums are packaged together as various component in a matrix vector product.
Parameters in GPT-3:
Those 175 billion parameters/weights in GPT-3 are organised into 27,938 matrices.
These 27,983 matrices falls in 8 different categories.
These weights are the actual brain. They are the things learned during training. And they determine how a model behaves.
In the next thread we will look into details of these categories.
#AI #MachineLearning #DeepLearning #GPT3 #ArtificialIntelligence #Transformers #NeuralNetworks #DataScience #NaturalLanguageProcessing #NLP #DeepLearningModels #TechExplained #AIModels #MLBasics #GenerativeAI #TechEducation #AIInnovation #DataScienceCommunity #AIResearch #LearnAI #AIExplained #TechKnowledge #MLTechniques
this images & notes are taken from the following video, all credit goes to 3Blue1Brown