No video

RL1: Introduction to Reinforcement Learning: Chapter 1A Sutton & Barto TextBook

  Рет қаралды 109

Hubel Labs

Hubel Labs

Күн бұрын

This is a series of companion videos to Sutton & Barto's textbook on reinforcement learning used by some of the best universities as standard course text - including Stanford, UCL, Carnegie Mellon
Download Book for free: incompleteideas...
Buy on Amazon: www.amazon.com...
Github Code Repository: github.com/Sha...
This first video covers the first few sections of chapter one.
00:00:00 Video intro
00:00:35 Why follow Sutton & Barto's Reinforcement Learning Textbook
00:00:50 Where to download the book for free
00:01:30 Reinforcement Learning in Humans and Animals (David Silver's UCL course slide)
00:02:00 Motivations for learning reinforcement learning and importance for real life problems
00:02:30 Personalisation for marketing and online
00:02:43 Control systems in commercial climate control
00:02:55 ChatGPT & Reinforcement Learning with Human Feedback (RLHF)
00:03:10 Google Deepmind AlphaGo Zero for superhuman capability
00:04:05 RL as a type of problem and as a set of tools
00:04:20 Supervised Learning vs. Unsupervised Learning vs. Reinforcement Learning
00:06:17 Reinforcement Learning vs. Artificial Neural Networks
00:07:00 Key characteristics of reinforcement learning problems
00:07:10 Example: Pavlova vs. Mochi - Nemesis
00:07:22 Mr. Stick: Rewards and Action set
00:07:55 Pavlova's goal - as many treats as possible
00:08:15 Pavlova's environmental state
00:08:40 Stochasticity of environment
00:09:29 Pavlova's policy
00:10:00 Trial and error search for rewards
00:10:52 4 key characteristics of RL problem: goal, state, actions and sequence
00:11:30 Key components of an RL solution: Policy, Reward Signal, Value Function, Model

Пікірлер: 1
@AnoniChocolateMoose
@AnoniChocolateMoose 3 ай бұрын
Thankyou
MIT Introduction to Deep Learning | 6.S191
1:09:58
Alexander Amini
Рет қаралды 474 М.
Why Is He Unhappy…?
00:26
Alan Chikin Chow
Рет қаралды 102 МЛН
A teacher captured the cutest moment at the nursery #shorts
00:33
Fabiosa Stories
Рет қаралды 62 МЛН
👨‍🔧📐
00:43
Kan Andrey
Рет қаралды 10 МЛН
The moment we stopped understanding AI [AlexNet]
17:38
Welch Labs
Рет қаралды 913 М.
Generative AI in a Nutshell - how to survive and thrive in the age of AI
17:57
These Illusions Fool Almost Everyone
24:55
Veritasium
Рет қаралды 2,1 МЛН
What are AI Agents?
12:29
IBM Technology
Рет қаралды 156 М.
How I'd Learn AI (If I Had to Start Over)
15:04
Thu Vu data analytics
Рет қаралды 775 М.