Transformers, explained: Understand the model behind ChatGPT

  Рет қаралды 7,642

Leon Petrou

Leon Petrou

Күн бұрын

🚀 Learn AI Prompt Engineering: bit.ly/3v8O4Vt
In this technical overview, we dissect the architecture of Generative Pre-trained Transformer (GPT) models, drawing parallels between artificial neural networks and the human brain.
From the foundational GPT-1 to the advanced GPT-4, we explore the evolution of GPT models, focusing on their learning processes, the significance of data in training, and the revolutionary Transformer architecture.
This video is designed for curious non-technical people looking to understand the complexities of GPT models in a way that's easy to understand.
🔗 SOCIAL LINKS:
🌐 Website/Blog: www.futurise.com/
🐦 Twitter/X: / joinfuturise
🔗 LinkedIn: / futurisealumni
📘 Facebook: profile.php?...
📣 Subscribe: www.youtube.com/@leonpetrou?s...
⏰ Timestamps:
0:00 - Intro
0:27 - The Importance of Modeling The Human Brain
1:10 - Basics of Artificial Neural Networks (ANNs)
2:26 - Overview of GPT Models Evolution
3:34 - Training Large Language Models
7:05 - Transformer Architecture
7:45 - Understanding Tokenization
10:19 - Explaining Token Embeddings
17:03 - Deep Dive into Self-Attention Mechanism
18:53 - Multiheaded Self-Attention Explained
19:55 - Predicting the Next Word: The Process
22:33 - De-Tokenization: Converting Token IDs Back to Words
#llm #ml #chatgpt #nvidia #elearning #futurise #promptengineering #futureofwork #leonpetrou #anthropic #claude #claude3 #gemini #openai #transformers #techinsights

Пікірлер: 44
@JRKyt00
@JRKyt00 2 күн бұрын
Agreed--best explanation I've found. Now I get it (well...)!
@ravindranshanmugam782
@ravindranshanmugam782 3 ай бұрын
Excellent, went thro' multiple videos on basic understanding of Transformers. This is the best one I could quickly grasp. Effortlessly explained, Well done !!
@LeonPetrou
@LeonPetrou 3 ай бұрын
Thank you Ravindran! I try my best to teach things the same way that I'd like to be taught, which is simple and step-by-step. Let me know what other videos you'd like to see from my channel.
@ravindranshanmugam782
@ravindranshanmugam782 3 ай бұрын
Hi Leon, it would be great if you can make videos on Langchain and its application which are trending now. You can also add topics like Vectordatabase, Embedding, word2vec and so on. Anything on GenAI is hot now in tech space. Thanks.
@ovidioe.cabeza4750
@ovidioe.cabeza4750 Ай бұрын
Same for me, I am a python backend dev and getting transformer was being tough, but you helped me a lot, thank you!
@Yaser-z9j
@Yaser-z9j 6 сағат бұрын
Me too​@@ravindranshanmugam782
@wp1300
@wp1300 2 ай бұрын
1:12 ANN 2:26 GPT-1 ~ GPT-4 3:34 LLM 7:09 Transformer architecture 7:45 Tokenization & Detokenization 8:17 Step 1 10:14 Step 2 10:20 Token embeddings 14:48 Step 3 15:10 Position Enbedding 16:58 Step 4 17:17 Self-Attention 18:52 Multi-headed self-attention 19:55 Step 5 20:27 Feed-Forward 22:02 Step 6 22:32 Step 6
@Yaser-z9j
@Yaser-z9j 6 сағат бұрын
Awesome 👌 thank you so much, You are amazing
@michaelzap8528
@michaelzap8528 Ай бұрын
best. Finally i understand how gpt work now. Thanks male, u the champion.
@KonstantinosEvangelides
@KonstantinosEvangelides 19 күн бұрын
Can you do a separate video exploring further what are embeddings and what does the vector embeddings represent more thoroughly. Great video!!
@LeonPetrou
@LeonPetrou 16 күн бұрын
Great idea! I'll do this next.
@anibeto7
@anibeto7 3 ай бұрын
It was indeed a very informative video. It cleared a lot of the important ideas. Thanks a lot.
@JohnCohen-ur5hk
@JohnCohen-ur5hk Ай бұрын
Very Good Explanation. Thank You
@vj7668
@vj7668 Ай бұрын
Excellent !!! Thanks for simplifying it. Loved it !
@LeonPetrou
@LeonPetrou Ай бұрын
Appreciate that, thank you!
@programminglover2976
@programminglover2976 Ай бұрын
thank you so much.. really reallly well explained.
@Clammer999
@Clammer999 2 ай бұрын
Wow, this is one of the easiest to understand video on how transformers work. You also explained very tokens and embeddings which I was searching for. I’m a complete newbie and I kept hearing nuerons and neural networks. Is a neuron a physical device/hardware or it actually an algorithm? And a neural network is not a physical network?
@LeonPetrou
@LeonPetrou 2 ай бұрын
Thank you! Neural networks, and everything explained in this video is all software (except biological neurons which is in a human brain), it is all algorithms. It's basically just code. The hardware that the code runs on usually just requires high processing power / RAM. This can be a CPU or GPU.
@sudhanshusaxena8134
@sudhanshusaxena8134 Ай бұрын
Great explanation.
@LeonPetrou
@LeonPetrou Ай бұрын
Thank you very much!
@karannesh7700
@karannesh7700 Ай бұрын
thx for this great video !
@LeonPetrou
@LeonPetrou Ай бұрын
Appreciate it!
@MotulzAnto
@MotulzAnto Ай бұрын
THANK YOU! easy explanation..
@LeonPetrou
@LeonPetrou Ай бұрын
Appreciate it!
@Bachanginh
@Bachanginh 15 күн бұрын
cool man, im from vietnam
@rhktech
@rhktech Ай бұрын
very well explained (Y)
@Omniassassin7
@Omniassassin7 4 ай бұрын
This is amazing, thanks a lot man! Quick question, how are the self-attention layers produced? Does the model dynamically “decide” which contextual layer to use depending on the prompt, or is the set of layers learnt during training?
@LeonPetrou
@LeonPetrou 4 ай бұрын
My pleasure man, glad you like it. That's a great question. The structure and behavior of these self-attention layers are determined during the model's training phase, not during inference. Simply put, the model learns which words in a sentence should pay attention to which other words to better understand the sentence's meaning. This learning process is fixed once the model is fully trained.. it does not change or decide on a different structure when it's given new prompts to process.
@abooaw4588
@abooaw4588 3 ай бұрын
Bravo 🇨🇵Dommage que ce très bon niveau de d'explication n'est réservé que pour nous qui comprenons l'anglais. Lecun et Bengio en sont pour beaucoup. Heureusement que le nutshell n'est pas traduit par GPT à la noix!
@LeonPetrou
@LeonPetrou 3 ай бұрын
Merci beaucoup for your thoughtful comment! I'm glad you found the video informative. Your point about language accessibility is very important to us. We're actively exploring options to include subtitles in multiple languages in our future videos to ensure more viewers can benefit from our content.
@kamal9991999
@kamal9991999 Ай бұрын
This video is a lot better one ☝️
@LeonPetrou
@LeonPetrou Ай бұрын
Appreciate that!
@d96002
@d96002 Ай бұрын
not 175 trillion parameters but 1.75 trillion
@LeonPetrou
@LeonPetrou Ай бұрын
Thanks for clarifying, my bad.
@Keshi-lz3ef
@Keshi-lz3ef 4 ай бұрын
Great session!
@LeonPetrou
@LeonPetrou 4 ай бұрын
Thank you!
@dragonwood-hc4sw
@dragonwood-hc4sw Ай бұрын
Ed Stafford?
@LeonPetrou
@LeonPetrou Ай бұрын
I see it! haha
@NavdeepVarshney-ep4ck
@NavdeepVarshney-ep4ck 2 ай бұрын
Sir are u a researcher or ml enthusiast
@LeonPetrou
@LeonPetrou 2 ай бұрын
I'm a ml enthusiast with an engineering background. :)
@MaduraiKallan
@MaduraiKallan Ай бұрын
1.76 trillion for GPT 4
@LeonPetrou
@LeonPetrou Ай бұрын
indeed, thanks for clarifying!
@saeidnazemi1312
@saeidnazemi1312 4 ай бұрын
What happened to your hair?
@LeonPetrou
@LeonPetrou 4 ай бұрын
New year new me 😂
How chatgpt works
12:18
Lucidate
Рет қаралды 20 М.
NERF WAR HEAVY: Drone Battle!
00:30
MacDannyGun
Рет қаралды 58 МЛН
Transformer Neural Networks Derived from Scratch
18:08
Algorithmic Simplicity
Рет қаралды 130 М.
Generative AI in a Nutshell - how to survive and thrive in the age of AI
17:57
Why Does Diffusion Work Better than Auto-Regression?
20:18
Algorithmic Simplicity
Рет қаралды 234 М.
ChatGPT: 30 Year History | How AI Learned to Talk
26:55
Art of the Problem
Рет қаралды 1 МЛН
Has Generative AI Already Peaked? - Computerphile
12:48
Computerphile
Рет қаралды 871 М.
the truth about ChatGPT generated code
10:35
Low Level Learning
Рет қаралды 218 М.
What Do Neural Networks Really Learn? Exploring the Brain of an AI Model
17:35
Rational Animations
Рет қаралды 149 М.
Claude 3.5 Sonnet vs GPT-4o: Side-by-Side Tests
25:10
Patrick Storm
Рет қаралды 100 М.
Transformers for beginners | What are they and how do they work
19:59
Klavye İle Trafik Işığını Yönetmek #shorts
0:18
Osman Kabadayı
Рет қаралды 3,9 МЛН
⚡️Супер БЫСТРАЯ Зарядка | Проверка
1:00
Сколько реально стоит ПК Величайшего?
0:37
Hisense Official Flagship Store Hisense is the champion What is going on?
0:11
Special Effects Funny 44
Рет қаралды 3,2 МЛН