Long Context Language Models and their Biological Applications with Eric Nguyen - 690

  Рет қаралды 405

The TWIML AI Podcast with Sam Charrington

The TWIML AI Podcast with Sam Charrington

Күн бұрын

Today, we're joined by Eric Nguyen, PhD student at Stanford University. In our conversation, we explore his research on long context foundation models and their application to biology particularly Hyena - hazyresearch.stanford.edu/blo..., and its evolution into Hyena DNA - hazyresearch.stanford.edu/blo... and Evo - arcinstitute.org/news/blog/evo models. We discuss Hyena, a convolutional-based language model developed to tackle the challenges posed by long context lengths in language modeling. We dig into the limitations of transformers in dealing with longer sequences, the motivation for using convolutional models over transformers, its model training and architecture, the role of FFT in computational optimizations, and model explainability in long-sequence convolutions. We also talked about Hyena DNA, a genomic foundation model pre-trained on 1 million tokens, designed to capture long-range dependencies in DNA sequences. Finally, Eric introduces Evo, a 7 billion parameter hybrid model integrating attention layers with Hyena DNA's convolutional framework. We cover generating and designing DNA with language models, hallucinations in DNA models, evaluation benchmarks, the trade-offs between state-of-the-art models, zero-shot versus a few-shot performance, and the exciting potential in areas like CRISPR-Cas gene editing.
🎧 / 🎥 Listen or watch the full episode on our page: twimlai.com/go/690.
Note: Please note that this episode has audio and video sync issues. We apologize for any inconvenience this may cause and appreciate your understanding. Thank you for watching!
🔔 Subscribe to our channel for more great content just like this: kzfaq.info?sub_confi...
🗣️ CONNECT WITH US!
===============================
Subscribe to the TWIML AI Podcast: twimlai.com/podcast/twimlai/
Follow us on Twitter: / twimlai
Follow us on LinkedIn: / twimlai
Join our Slack Community: twimlai.com/community/
Subscribe to our newsletter: twimlai.com/newsletter/
Want to get in touch? Send us a message: twimlai.com/contact/
📖 CHAPTERS
===============================
00:00 - Introduction
01:14 - Motivation for Hyena architecture
02:39 - Limitations of transformer architectures with longer sequences
05:06 - Role of Fast Fourier Transform (FFT) in Hyena
07:54 - Explainability in long-sequence convolutions
09:07 - Hyena model
14:45 - Hyena DNA
19:10 - Hyena DNA model training
21:11 - Evo
24:32 - Designing DNA with language models
25:52 - Transformer-based approaches to DNA
28:21 - Hallucination in DNA models
33:41 - Evo gene editing tools
35:30 - Evo evaluation benchmarks
38:21 - Evo vs state-of-the-art models
40:38 - Zero-shot vs a few-shot performance
42:06 - Future directions
🔗 LINKS & RESOURCES
===============================
Hyena Hierarchy: Towards Larger Convolutional Language Models - hazyresearch.stanford.edu/blo...
HyenaDNA: learning from DNA with 1 Million token context - hazyresearch.stanford.edu/blo...
Evo: DNA foundation modeling from molecular to genome scale - arcinstitute.org/news/blog/evo
📸 Camera: amzn.to/3TQ3zsg
🎙️Microphone: amzn.to/3t5zXeV
🚦Lights: amzn.to/3TQlX49
🎛️ Audio Interface: amzn.to/3TVFAIq
🎚️ Stream Deck: amzn.to/3zzm7F5

Пікірлер
Fatih shares video language model providing real-time feedback and coaching on activities #CVPR2024
0:50
ВОДА В СОЛО
00:20
⚡️КАН АНДРЕЙ⚡️
Рет қаралды 30 МЛН
Как бесплатно замутить iphone 15 pro max
00:59
ЖЕЛЕЗНЫЙ КОРОЛЬ
Рет қаралды 8 МЛН
What is LoRA? Low-Rank Adaptation for finetuning LLMs EXPLAINED
8:22
AI Coffee Break with Letitia
Рет қаралды 38 М.
Large Language Models (LLMs) - Everything You NEED To Know
25:20
Matthew Berman
Рет қаралды 70 М.
The moment we stopped understanding AI [AlexNet]
17:38
Welch Labs
Рет қаралды 806 М.
LoRA explained (and a bit about precision and quantization)
17:07
What are Transformers (Machine Learning Model)?
5:50
IBM Technology
Рет қаралды 379 М.
Generative AI in a Nutshell - how to survive and thrive in the age of AI
17:57
Introduction to large language models
15:46
Google Cloud Tech
Рет қаралды 696 М.
I wish every AI Engineer could watch this.
33:49
1littlecoder
Рет қаралды 75 М.
In-Context Learning: EXTREME vs Fine-Tuning, RAG
21:42
code_your_own_AI
Рет қаралды 3,8 М.
Опасность фирменной зарядки Apple
0:57
SuperCrastan
Рет қаралды 10 МЛН
#samsung #retrophone #nostalgia #x100
0:14
mobijunk
Рет қаралды 11 МЛН
$1 vs $100,000 Slow Motion Camera!
0:44
Hafu Go
Рет қаралды 28 МЛН
Xiaomi SU-7 Max 2024 - Самый быстрый мобильник
32:11
Клубный сервис
Рет қаралды 521 М.