Confused which Transformer Architecture to use? BERT, GPT-3, T5, Chat GPT? Encoder Decoder Explained

  Рет қаралды 28,967

Datafuse Analytics

Datafuse Analytics

Күн бұрын

This video explains all the major Transformer Architectures and differentiates between various important Transformer Models.
Which Transformer Architecture to use to solve a particular problem statement in Natural Language Understanding (NLU) and Natural Languages Generation (NLG) is explained in a simplified manner.
Over the past 6 years, Transformers, a neural network architecture, have completely transformed state-of-the-art natural language processing and the way we approach to different problem statements in NLG and NLU.
Chapters:
0:00 Introduction
1:21 Encoder Branch
1:57 BERT
2:37 DistilBERT
3:19 RoBERTa
3:59 XLM
4:50 XLM-RoBERTa
5:32 ALBERT
6:40 ELECTRA
7:19 DeBERTa
8:13 Decoder Branch
8:50 GPT
9:13 CTRL
9:54 GPT-2
10:31 GPT-3
11:30 GPT-Neo/GPT-J-6B
11:50 Encoder-Decoder Branch
12:00 T5
13:05 BART
13:46 M2M-100
14:22 BigBird
#datascience #neuralnetwork #machinelearning #naturallanguageprocessing

Пікірлер: 64
@datafuseanalytics
@datafuseanalytics Жыл бұрын
In this video, I tried to explain all the major Transformer architectures. I have also explained the differences and training objective of each one of them. If you feel this video adds value in your life then please like, share and comment on this video and subscribe to this channel. If any suggestions and feedback then please drop in comment box.
@aurkom
@aurkom 11 ай бұрын
It would have been awesome if all the models had the release year mentioned along with it as well. Helps to get a birds eye view of the timeline.
@datafuseanalytics
@datafuseanalytics 11 ай бұрын
Hello. Yes, I am making a separate video on similar topic. It will be uploaded soon. Stay tuned my friend.
@hemantwani4757
@hemantwani4757 2 ай бұрын
Very nicely explained ❤👍
@snehotoshbanerjee1938
@snehotoshbanerjee1938 Ай бұрын
Great summary!!
@santoshpanigrahi5711
@santoshpanigrahi5711 Жыл бұрын
Thanks for sharing. It's very informative. Keep up with this work.
@datafuseanalytics
@datafuseanalytics Жыл бұрын
Thank you, Santosh, for watching the video.
@milindkubal2738
@milindkubal2738 Жыл бұрын
Amazing. Great work👍
@datafuseanalytics
@datafuseanalytics Жыл бұрын
Thanks Milind
@kevon217
@kevon217 Жыл бұрын
thanks for the excellent, well-explained summary!
@datafuseanalytics
@datafuseanalytics Жыл бұрын
Thank you Kevin
@falknfurter
@falknfurter 9 ай бұрын
I just found this video and it's very good. I'm currently trying to understand when to use what type of model. Looking at Huggingface is just overwhelming. That's where this video jumps in and provides an excellent overview of the major models. I wish there would be a similiar video explaining the various pretraining objectives.
@datafuseanalytics
@datafuseanalytics 9 ай бұрын
Hello. I will definitely make a video on the same. Thanks a lot. 😀
@ajitkumar15
@ajitkumar15 Жыл бұрын
Very nice and to the point video, thank you !!!
@datafuseanalytics
@datafuseanalytics Жыл бұрын
Hey thanks a lot Ajit 😃 🙏
@SagarBhalke-td3vy
@SagarBhalke-td3vy Жыл бұрын
Great explanation. Thank you very much
@datafuseanalytics
@datafuseanalytics Жыл бұрын
Glad it was helpful for you Sagar...
@SaketKumar-wy1wb
@SaketKumar-wy1wb Жыл бұрын
This is good. Keep up the good work. 🙂
@datafuseanalytics
@datafuseanalytics Жыл бұрын
Thank you Saket, I will
@sagar3482
@sagar3482 Жыл бұрын
Informative content Thanks for sharing this
@datafuseanalytics
@datafuseanalytics Жыл бұрын
Glad you liked it!
@ahmedelsabagh6990
@ahmedelsabagh6990 4 ай бұрын
Greate video!
@datafuseanalytics
@datafuseanalytics 4 ай бұрын
Thanks a lot. Please do share it with your friends 😁
@sanjaybhalke8032
@sanjaybhalke8032 Жыл бұрын
Thanks for sharing
@datafuseanalytics
@datafuseanalytics Жыл бұрын
My pleasure
@ganeshkharad
@ganeshkharad 10 ай бұрын
this is really nice explaination!!!
@datafuseanalytics
@datafuseanalytics 10 ай бұрын
Thanks a lot Ganesh 😃 🙏
@adityakshirsagar1391
@adityakshirsagar1391 Жыл бұрын
Informative 👍
@datafuseanalytics
@datafuseanalytics Жыл бұрын
Glad it was helpful and informative for you Aditya. Please do share it with your friends. More interesting videos will be uploaded soon
@exxzxxe
@exxzxxe Жыл бұрын
Well done!
@datafuseanalytics
@datafuseanalytics Жыл бұрын
Thanks David.
@user-os1xi8qf4y
@user-os1xi8qf4y 8 ай бұрын
thank you sir ! Fantastic method of explanation
@datafuseanalytics
@datafuseanalytics 7 ай бұрын
Hey buddy. Thanks a lot. 😀
@datafuseanalytics
@datafuseanalytics 7 ай бұрын
Hey buddy. Thanks a lot
@rembautimes8808
@rembautimes8808 5 ай бұрын
Excellent video and I joined as a sub. Like this style of going thru the various architectures and the use case. Maybe you can also update it with GPT 4 since it’s new out there.
@datafuseanalytics
@datafuseanalytics 4 ай бұрын
Thanks a lot for this amazing comment. I have uploaded the latest video using ChatGPT model - kzfaq.info/get/bejne/g7F4eMSpydXVqHU.html Please go through it and feel free to comment
@sarc007
@sarc007 Жыл бұрын
Excellent
@datafuseanalytics
@datafuseanalytics Жыл бұрын
Thanks a lot Suhail.
@WillBeebe
@WillBeebe Жыл бұрын
Superb 🎉
@datafuseanalytics
@datafuseanalytics Жыл бұрын
Hey thanks William
@amortalbeing
@amortalbeing 9 ай бұрын
thanks a lot❤
@datafuseanalytics
@datafuseanalytics 9 ай бұрын
You are most welcome 😃 Do check other videos too on AI on this channel.
@mhaya1
@mhaya1 10 ай бұрын
Kudos🎉
@datafuseanalytics
@datafuseanalytics 10 ай бұрын
Thank you 😃
@d4munche3z
@d4munche3z Жыл бұрын
Can you create a tutorial on Longformer and the concepts/code used to adapt an LLM for larger token sizes?
@datafuseanalytics
@datafuseanalytics Жыл бұрын
Hello David. I haven't made it yet. But I will definitely make one on Longformer etc which takes a whopping 4096 tokens as input. Thanks for your feedback.
@markfallu2389
@markfallu2389 Жыл бұрын
Great summary- would be good if you did an update
@datafuseanalytics
@datafuseanalytics Жыл бұрын
Sure. I will make an updated video comprising of all the possible model architectures
@tilkesh
@tilkesh Жыл бұрын
Thx
@datafuseanalytics
@datafuseanalytics Жыл бұрын
Most welcome 😃 😊
@ianboyles2425
@ianboyles2425 Жыл бұрын
there's some new important ones like the newer gpt Neo models, alpaca, llama, cereus, vicuna
@datafuseanalytics
@datafuseanalytics Жыл бұрын
Hello Ian. Yes. At the time of this session, these models weren't available. Thank you for your feedback. I will definitely make one video (part 2) which will encompass these models in a more simpler fashion
@projectbit2248
@projectbit2248 Жыл бұрын
Hello, how do I contact/ connect with you, with regards to a project?
@datafuseanalytics
@datafuseanalytics Жыл бұрын
Hello, please contact us via our email. datafuseanalytics@gmail.com
@chenpeter7428
@chenpeter7428 Жыл бұрын
It seems it does not cover BERT in computer vision.
@datafuseanalytics
@datafuseanalytics Жыл бұрын
Yes you are right Chen Peter
@ko-Daegu
@ko-Daegu Жыл бұрын
this sounds like copy pasted from online articles and just reading from them without extra info at all
@datafuseanalytics
@datafuseanalytics Жыл бұрын
Hey Ko-Jap. I referred multiple books for the same and then wrote the content in my language. But I did not refer to any online blogs or articles. Only books are the reference. But thank you for your valuable feedback. I will improve so that it doesn't sound as I am reading. 🙏😀
@yosup125
@yosup125 7 ай бұрын
for the algo
@datafuseanalytics
@datafuseanalytics 6 ай бұрын
Thank you
@gregbugaj
@gregbugaj 11 ай бұрын
Nice overview
@datafuseanalytics
@datafuseanalytics 11 ай бұрын
Hey Thanks a lot 😃
@saketkr
@saketkr Жыл бұрын
This is good. Keep up the good work. 🙂
@datafuseanalytics
@datafuseanalytics Жыл бұрын
Hey Thanks Saket
Double Stacked Pizza @Lionfield @ChefRush
00:33
albert_cancook
Рет қаралды 83 МЛН
A little girl was shy at her first ballet lesson #shorts
00:35
Fabiosa Animated
Рет қаралды 9 МЛН
Inside Akhilesh Yadav's Wild Wild Rally ft. Samdish Bhatia | Unfiltered by Samdish
21:16
What are Transformer Models and how do they work?
44:26
Serrano.Academy
Рет қаралды 108 М.
Transformers, explained: Understand the model behind GPT, BERT, and T5
9:11
Large Language Models in Five Formulas
58:02
Sasha Rush 🤗
Рет қаралды 33 М.
How a Transformer works at inference vs training time
49:53
Niels Rogge
Рет қаралды 50 М.
The Attention Mechanism in Large Language Models
21:02
Serrano.Academy
Рет қаралды 87 М.