Рет қаралды 46,967
In this brief tutorial you're going to learn the fundamentals of deep reinforcement learning, and the basic concepts behind actor critic methods. We'll cover the Markov decision process, the agent's policy, reward discounting and why it's necessary, and the actor critic algorithm. We'll implement an actor critic algorithm using Tensorflow 2 to handle the cart pole environment from the Open AI Gym.
Actor critic methods form the basis for more advanced algorithms such as deep deterministic policy gradients, soft actor critic, and twin delayed deep deterministic policy gradients, among others.
You can find the code for this video here:
github.com/philtabor/KZfaq-...
Learn how to turn deep reinforcement learning papers into code:
Get instant access to all my courses, including the new Prioritized Experience Replay course, with my subscription service. $29 a month gives you instant access to 42 hours of instructional content plus access to future updates, added monthly.
Discounts available for Udemy students (enrolled longer than 30 days). Just send an email to sales@neuralnet.ai
www.neuralnet.ai/courses
Or, pickup my Udemy courses here:
Deep Q Learning:
www.udemy.com/course/deep-q-l...
Actor Critic Methods:
www.udemy.com/course/actor-cr...
Curiosity Driven Deep Reinforcement Learning
www.udemy.com/course/curiosit...
Natural Language Processing from First Principles:
www.udemy.com/course/natural-...
Reinforcement Learning Fundamentals
www.manning.com/livevideo/rei...
Here are some books / courses I recommend (affiliate links):
Grokking Deep Learning in Motion: bit.ly/3fXHy8W
Grokking Deep Learning: bit.ly/3yJ14gT
Grokking Deep Reinforcement Learning: bit.ly/2VNAXql
Come hang out on Discord here:
/ discord
Need personalized tutoring? Help on a programming project? Shoot me an email! phil@neuralnet.ai
Website: www.neuralnet.ai
Github: github.com/philtabor
Twitter: / mlwithphil