NEW CriticGPT by OpenAI: RLHF + FSBS

  Рет қаралды 2,552

code_your_own_AI

code_your_own_AI

15 күн бұрын

OpenAI developed an optimized RLHF plus Force Sampling Beam Search (FSBS) algorithm to improve the quality of our LLMs.
I have a deep dive why OpenAI felt the need to develop this technique and what is the status quo of our current LLM optimizations methodologies.
All rights w/ authors:
cdn.openai.com/llm-critics-he...
LLM Critics Help Catch LLM Bugs
Finding GPT-4’s mistakes with GPT-4
openai.com/index/finding-gpt4...
#aiagents
#airesearch
#openai

Пікірлер: 7
@WilliamThomas2040
@WilliamThomas2040 14 күн бұрын
Grasshopper here for class and leaving first comment. Thanks for the great videos!
@code4AI
@code4AI 13 күн бұрын
Thanks for watching!
@akg8111
@akg8111 13 күн бұрын
News agencies are the last thing I'd trust to deliver trustworthy data.
@dinoscheidt
@dinoscheidt 13 күн бұрын
Yes. If your buddy in the bar didn’t say it, it ain’t true. A great option is also astrology 🔮 I always get the facts I want
@TheRealUsername
@TheRealUsername 13 күн бұрын
They're being open again ? I didn't expect this...
@TheZEN2011
@TheZEN2011 12 күн бұрын
There AI is a bit of a copycat. Not much of a thinker. It's not really a copy as it's rewritten. But yeah they have some problems and more compute isn't going to fix it. I think they need to improve their neural nets architecture. And a bunch of other things.
@islandfireballkill
@islandfireballkill 13 күн бұрын
This is basically AI amplification. You exponentially refine the AI with AI. Now that they are doing the X^N trick where X is intelligence, the core question will be if the X is > 1 or < 1.
Adversarial Questions Test Multimodal MED AI sys
21:08
code_your_own_AI
Рет қаралды 1,3 М.
БОЛЬШОЙ ПЕТУШОК #shorts
00:21
Паша Осадчий
Рет қаралды 10 МЛН
HOW DID HE WIN? 😱
00:33
Topper Guild
Рет қаралды 45 МЛН
Survival skills: A great idea with duct tape #survival #lifehacks #camping
00:27
How ChatGPT is Trained
13:43
Ari Seff
Рет қаралды 518 М.
Decoding AI's Blind Spots: Solving Causal Reasoning
13:39
code_your_own_AI
Рет қаралды 2 М.
Proximal Policy Optimization | ChatGPT uses this
13:26
CodeEmporium
Рет қаралды 13 М.
The moment we stopped understanding AI [AlexNet]
17:38
Welch Labs
Рет қаралды 653 М.
5 Easy Ways to help LLMs to Reason
50:37
code_your_own_AI
Рет қаралды 3,9 М.
Masterclass on AI by Microsoft
20:50
code_your_own_AI
Рет қаралды 1,6 М.
Reinforcement Learning:  ChatGPT and RLHF
6:31
Graphics in 5 Minutes
Рет қаралды 9 М.
GraphRAG: LLM-Derived Knowledge Graphs for RAG
15:40
Alex Chao
Рет қаралды 92 М.
What Is an AI Anyway? | Mustafa Suleyman | TED
22:02
TED
Рет қаралды 1,2 МЛН
Easy Art with AR Drawing App - Step by step for Beginners
0:27
Melli Art School
Рет қаралды 14 МЛН
Здесь упор в процессор
18:02
Рома, Просто Рома
Рет қаралды 342 М.
Samsung Galaxy 🔥 #shorts  #trending #youtubeshorts  #shortvideo ujjawal4u
0:10
Ujjawal4u. 120k Views . 4 hours ago
Рет қаралды 7 МЛН
WATERPROOF RATED IP-69🌧️#oppo #oppof27pro#oppoindia
0:10
Fivestar Mobile
Рет қаралды 19 МЛН
$1 vs $100,000 Slow Motion Camera!
0:44
Hafu Go
Рет қаралды 23 МЛН