GPT背后的力量,Transformer模型入门简介

  Рет қаралды 4,223

David Shen

David Shen

Жыл бұрын

本篇讲解试图从最浅显的角度来让大家 了解Transformer模型,不涉及到任何数学公式和神 经网络的基础知识。

Пікірлер: 9
@DavidShen-ph6iq
@DavidShen-ph6iq Жыл бұрын
本视频所有资料都是从网络上获取,如有侵权,请告知,谢谢。😄
@DavidShen-ph6iq
@DavidShen-ph6iq Жыл бұрын
以下是Transformer模型学习路径所涉及到资料的链接。 1. What Are Transformer Models and How Do They Work?, txt.cohere.com/what-are-transformer-models/ 中文版 blog.csdn.net/shenyang2/article/details/131199513?spm=1001.2014.3001.5502 面向初学者的科普文章,不涉及到任何数学公式。本视频的内容也大部分来源于此。 2. The Illustrated Transformer, jalammar.github.io/illustrated-transformer/?ref=txt.cohere.com 中文版 blog.csdn.net/yujianmin1990/article/details/85221271 Transformer模型最出名的科普文章,介绍了模型的原理,架构还有数学公式。作者是Jay Alammar。 3。 Transformer通俗笔记:从Word2Vec、Seq2Seq逐步理解到GPT、BERT blog.csdn.net/v_JULY_v/article/details/127411638 周磊(July)的科普文章,更系统地介绍了模型和其所依赖的体系知识。 4。 从零实现Transformer、ChatGLM-6B、LangChain+LLM的本地知识库问答 blog.csdn.net/v_JULY_v/article/details/130090649 周磊(July)的科普文章,从代码实现的角度加深对模型的认识。而且更有ChatGLM和Langchain的代码分析。 5. Transformer论文逐段精读 kzfaq.info/get/bejne/pOChn6l6yKm3h4U.html 李沐关于transformer论文的精彩视频解读。 6. Attention is All You Need arxiv.org/abs/1706.03762 Transformer模型的原始论文。 7. 世界的参数倒影:为何GPT通过Next Token Prediction可以产生智能 zhuanlan.zhihu.com/p/632795115 张俊林对于LLM语言一些更深层次的思考和总结 ​
@davidhangzhou5113
@davidhangzhou5113 9 ай бұрын
多谢老师
@user-nu9kh7sy5j
@user-nu9kh7sy5j 6 ай бұрын
讲的太好了
@davidhangzhou5113
@davidhangzhou5113 9 ай бұрын
讲的挺 清晰的
@DavidShen-ph6iq
@DavidShen-ph6iq 7 ай бұрын
谢谢! :)
@wadergu
@wadergu 8 ай бұрын
挺容易懂得。老师多问一句, 那Input在Decoder里的作用是什么呢?
@DavidShen-ph6iq
@DavidShen-ph6iq 7 ай бұрын
不好意思,没理解你的input和decode是指的什么。
@jackielee8089
@jackielee8089 6 ай бұрын
讲的挺好,就是收音效果太差了,听着难受
Transformer论文逐段精读
1:27:05
Mu Li
Рет қаралды 383 М.
1❤️
00:17
Nonomen ノノメン
Рет қаралды 13 МЛН
Happy 4th of July 😂
00:12
Pink Shirt Girl
Рет қаралды 44 МЛН
Vivaan  Tanya once again pranked Papa 🤣😇🤣
00:10
seema lamba
Рет қаралды 33 МЛН
I wish I could change THIS fast! 🤣
00:33
America's Got Talent
Рет қаралды 119 МЛН
从零开始学习大语言模型(一)
20:13
林亦LYi
Рет қаралды 185 М.
Transformer Neural Networks, ChatGPT's foundation, Clearly Explained!!!
36:15
StatQuest with Josh Starmer
Рет қаралды 618 М.
Transformer模型详解,Attention is all you need
13:42
小黑黑讲AI
Рет қаралды 1,1 М.
The Narrated Transformer Language Model
29:30
Jay Alammar
Рет қаралды 289 М.
09 Transformer 之什么是注意力机制(Attention)
23:45
水论文的程序猿
Рет қаралды 10 М.
InstructGPT 论文精读【论文精读】
1:07:11
Mu Li
Рет қаралды 80 М.
1❤️
00:17
Nonomen ノノメン
Рет қаралды 13 МЛН