Mamba Language Model Simplified In JUST 5 MINUTES!

  Рет қаралды 5,610

Analytics Camp

Analytics Camp

Күн бұрын

#mamba #ai #llm
Here’s a super simplified explanation of the Mamba language model with the Selective State Space Model (Selective SSM architecture). In the previous videos, I used the example of sequences of words to show how transformers use the Attention Mechanism to process natural language and predict the next word in a sequence of words, e.g., a sentence. In this video, I show you how Mamba’s AI architecture uses the Selective State Space Model to figure out which parts of the data. e.g., which words in a word sequence, are connected and how they might affect what happens next, e.g., to predict which word comes next.
Don’t forget to subscribe and watch these related videos:
Transformer Language Models Simplified in JUST 3 MINUTES!
• Transformer Language M...
This Is How Wxactly Language Models Work in AI - NO background needed!
• This is how EXACTLY La...
Backpropagation Simplified in JUST 2 MINUTES! --Neural Networks
• The Concept of Backpro...
www.youtube.com/@analyticsCam...
Key terms and concepts in the video:
00:00 Intro
00:31 Why Mamba?
00:52 State Space Models
01:14 Selectivity
01:25 Two stages of Selective SSM
01:48 Parameters
02:01 First stage: Projecting the Input
02:08 Discretization
02:25 Linear Time Invariance (LTI)
02:50 Dynamic data
03:14 B Parameter
03:19 C Parameter
03:39 Selection Mechanism
03:49 Hidden State update
03:58 Delta Parameter resets itself
04:30 Input Selection
04:41 Collocation
05:04 Each state update
05:09 Predicting the next word
05:23 Hardware-aware algorithm for Selective SSM
05:27 GPU with High Bandwidth Memory
05:34 Mamba’s overall architecture (H3 + Multi-layer Perceptron)
Stick around for more videos on LLM, Natural Language Processing (NLP), Generative AI, fun coding and machine learning projects, and follow Analytics Camp on Twitter (X): / analyticscamp

Пікірлер: 12
@zagoguic
@zagoguic 6 ай бұрын
Great video! Keep making them!
@analyticsCamp
@analyticsCamp 5 ай бұрын
Thanks! Will do!
@optiondrone5468
@optiondrone5468 6 ай бұрын
Thanks for this video, keep up the good work.
@analyticsCamp
@analyticsCamp 6 ай бұрын
Thanks for watching!
@doublesami
@doublesami 2 ай бұрын
Very informative looking forward for the in depth video on vision mamba or vmamba
@analyticsCamp
@analyticsCamp 2 ай бұрын
Thanks for watching and for your suggestion. Stay tuned :)
@kvlnnguyieb9522
@kvlnnguyieb9522 4 ай бұрын
a great video. next video, may be you can explain the details about selective mechanisms in code
@analyticsCamp
@analyticsCamp 4 ай бұрын
Great suggestion! Thanks for watching :)
@ln2deep
@ln2deep 6 ай бұрын
It's a bit unclear to me how the Mamba architecture works recurrently when looking at the architecture in 5.30. What is the input here? the whole sequence or individual tokens? Surely it'd have to be the whole sequence for Mamba to build a representation recurrently. But then it seems strange to have a skip connection on the whole sequence. I think I've missed something.
@analyticsCamp
@analyticsCamp 6 ай бұрын
Hi, thanks for your comment. I mentioned that delta discretizes the input as the word sequence into tokens, ..., and the fact that, at every step of the hidden state update, it takes into account the previous hidden state and the 'current input word'. I try to make an update on this, maybe reviewing the entire article if I can. Please do let me know if you are interested in any particular topic for a video.
@nidalidais9999
@nidalidais9999 5 ай бұрын
I liked your style and your funny personality
@analyticsCamp
@analyticsCamp 5 ай бұрын
Thanks for watching, I love your comment too :)
Mamba - a replacement for Transformers?
16:01
Samuel Albanie
Рет қаралды 246 М.
50 YouTubers Fight For $1,000,000
41:27
MrBeast
Рет қаралды 162 МЛН
THEY made a RAINBOW M&M 🤩😳 LeoNata family #shorts
00:49
LeoNata Family
Рет қаралды 41 МЛН
A clash of kindness and indifference #shorts
00:17
Fabiosa Best Lifehacks
Рет қаралды 99 МЛН
The moment we stopped understanding AI [AlexNet]
17:38
Welch Labs
Рет қаралды 720 М.
Generative AI in a Nutshell - how to survive and thrive in the age of AI
17:57
MAMBA from Scratch: Neural Nets Better and Faster than Transformers
31:51
Algorithmic Simplicity
Рет қаралды 150 М.
The Era of 1-bit LLMs by Microsoft | AI Paper Explained
6:10
AI Papers Academy
Рет қаралды 88 М.
MAMBA and State Space Models explained | SSM explained
22:27
AI Coffee Break with Letitia
Рет қаралды 42 М.
What is Retrieval-Augmented Generation (RAG)?
6:36
IBM Technology
Рет қаралды 613 М.
The Most Important Algorithm in Machine Learning
40:08
Artem Kirsanov
Рет қаралды 337 М.
Understanding Mamba and State Space Models
27:41
Trelis Research
Рет қаралды 4,1 М.
Top 50 Amazon Prime Day 2024 Deals 🤑 (Updated Hourly!!)
12:37
The Deal Guy
Рет қаралды 1,4 МЛН
Зачем ЭТО электрику? #секрет #прибор #энерголикбез
0:56
Александр Мальков
Рет қаралды 617 М.
S24 Ultra and IPhone 14 Pro Max telephoto shooting comparison #shorts
0:15
Photographer Army
Рет қаралды 8 МЛН
Телефон-електрошокер
0:43
RICARDO 2.0
Рет қаралды 1,3 МЛН
Cheapest gaming phone? 🤭 #miniphone #smartphone #iphone #fy
0:19