No video

RL3: Python Code for Reinforcement Learning for Tic Tac Toe: Chapter 1C Sutton & Barto Text Book

  Рет қаралды 91

Hubel Labs

Hubel Labs

Күн бұрын

This is a series of companion videos to Sutton & Barto's textbook on reinforcement learning used by some of the best universities as standard course text - including Stanford, UCL, Carnegie Mellon
Download Book for free: incompleteideas...
Buy on Amazon: www.amazon.com...
Github Code Repository: github.com/Sha...
This second video covers the first few sections of chapter one.
00:00:00 Video intro
00:00:35 Overview of key code components - Loop through entire state space
00:03:56 Overview of key code components - State Class - hash, numpy array, next state method
00:06:55 Overview of key code components - Player Class - step size and epsilon-greedy configuration, backup algorithm, value table, choosing actions
00:14:45 Overview of key code components - Judger Class
00:15:30 Training function - save value table to permanent storage - binary file
00:16:54 Compete function - epsilon = 0 - greedy play, load policy
00:17:44 Python code demo and detailed explanation

Пікірлер
Get structured Json Output from OpenAI GPT API Consistently!
14:56
My Cheetos🍕PIZZA #cooking #shorts
00:43
BANKII
Рет қаралды 24 МЛН
How to connect to ChatGPT using C# [A new course on Udemy]
1:44
all about C# (CSharp)
Рет қаралды 23
Programming's Greatest Mistakes • Mark Rendle • GOTO 2023
51:24
GOTO Conferences
Рет қаралды 89 М.
These Illusions Fool Almost Everyone
24:55
Veritasium
Рет қаралды 2,1 МЛН
AI, Machine Learning, Deep Learning and Generative AI Explained
10:01
IBM Technology
Рет қаралды 65 М.
Google Gemini vs OpenAI chatGPT 4 bake-off - SURPRISING results!
34:51
The moment we stopped understanding AI [AlexNet]
17:38
Welch Labs
Рет қаралды 913 М.