Search icon CANCEL
Subscription
0
Cart icon
Your Cart (0 item)
Close icon
You have no products in your basket yet
Arrow left icon
Explore Products
Best Sellers
New Releases
Books
Videos
Audiobooks
Learning Hub
Free Learning
Arrow right icon
Arrow up icon
GO TO TOP
The Reinforcement Learning Workshop

You're reading from   The Reinforcement Learning Workshop Learn how to apply cutting-edge reinforcement learning algorithms to a wide range of control problems

Arrow left icon
Product type Paperback
Published in Aug 2020
Publisher Packt
ISBN-13 9781800200456
Length 822 pages
Edition 1st Edition
Languages
Arrow right icon
Authors (9):
Arrow left icon
Dr. Alexandra Galina Petre Dr. Alexandra Galina Petre
Author Profile Icon Dr. Alexandra Galina Petre
Dr. Alexandra Galina Petre
Anand N.S. Anand N.S.
Author Profile Icon Anand N.S.
Anand N.S.
Quan Nguyen Quan Nguyen
Author Profile Icon Quan Nguyen
Quan Nguyen
Anthony So Anthony So
Author Profile Icon Anthony So
Anthony So
Mayur Kulkarni Mayur Kulkarni
Author Profile Icon Mayur Kulkarni
Mayur Kulkarni
Aritra Sen Aritra Sen
Author Profile Icon Aritra Sen
Aritra Sen
Alessandro Palmas Alessandro Palmas
Author Profile Icon Alessandro Palmas
Alessandro Palmas
Emanuele Ghelfi Emanuele Ghelfi
Author Profile Icon Emanuele Ghelfi
Emanuele Ghelfi
Saikat Basak Saikat Basak
Author Profile Icon Saikat Basak
Saikat Basak
+5 more Show less
Arrow right icon
View More author details
Toc

Table of Contents (14) Chapters Close

Preface
1. Introduction to Reinforcement Learning 2. Markov Decision Processes and Bellman Equations FREE CHAPTER 3. Deep Learning in Practice with TensorFlow 2 4. Getting Started with OpenAI and TensorFlow for Reinforcement Learning 5. Dynamic Programming 6. Monte Carlo Methods 7. Temporal Difference Learning 8. The Multi-Armed Bandit Problem 9. What Is Deep Q-Learning? 10. Playing an Atari Game with Deep Recurrent Q-Networks 11. Policy-Based Methods for Reinforcement Learning 12. Evolutionary Strategies for RL Appendix

Introduction

In the previous chapter, we were introduced to the OpenAI Gym environment and also learned how to implement custom environments, depending on the application. You also learned the basics of TensorFlow 2, how to implement a policy using the TensorFlow 2 framework, and how to visualize learning using TensorBoard. In this chapter, we will see how Dynamic Programming (DP) works in general, from a computer science perspective. Then, we'll go over how and why it is used in RL. Next, we will dive deep into classic DP algorithms such as policy evaluation, policy iteration, and value iteration and compare them. Lastly, we will implement the algorithms in the classic coin-change problem.

DP is one of the most fundamental and foundational topics in computer science. Furthermore, RL algorithms such as Value Iteration, Policy Iteration, and others, as we will see, use the same basic principle: avoid repeated computations to save time, which is what DP is all about. The philosophy...

lock icon The rest of the chapter is locked
Register for a free Packt account to unlock a world of extra content!
A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.
Unlock this book and the full library FREE for 7 days
Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of
Renews at $19.99/month. Cancel anytime
Banner background image