Explore Products

Best Sellers

New Releases

Books

Videos

Audiobooks

Learning Hub

Free Learning

You're reading from Machine Learning for Developers Uplift your regular applications with the power of statistics, analytics, and machine learning

Product type Paperback

Published in Oct 2017

Publisher Packt

ISBN-13 9781786469878

Length 270 pages

Edition 1st Edition

Languages

Python

Tools

Scikit-learn

Concepts

Machine Learning

Authors (2):

Md Mahmudul Hasan

Rodolfo Bonnin

View More author details

Table of Contents (10) Chapters

Preface

1. Introduction - Machine Learning and Statistical Science

2. The Learning Process FREE CHAPTER

3. Clustering

4. Linear and Logistic Regression

5. Neural Networks

6. Convolutional Neural Networks

7. Recurrent Neural Networks

8. Recent Models and Developments

9. Software Installation and Configuration

Basic RL techniques: Q-learning

One of the most well-known reinforcement learning techniques, and the one we will be implementing in our example, is Q-learning.

Q-learning can be used to find an optimal action for any given state in a finite Markov decision process. Q-learning tries to maximize the value of the Q-function that represents the maximum discounted future reward when we perform action a in state s.

Once we know the Q-function, the optimal action a in state s is the one with the highest Q-value. We can then define a policy π(s), that gives us the optimal action in any state, expressed as follows:

We can define the Q-function for a transition point (s_t, a_t, r_t, s_t+1) in terms of the Q-function at the next point (s_t+1, a_t+1, r_t+1, s_t+2), similar to what we did with the total discounted future reward. This equation is known as the Bellman equation for Q-learning...

The rest of the chapter is locked

A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.

Unlock this book and the full library FREE for 7 days

Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of

Start free trial

Renews at $19.99/month. Cancel anytime

Authors (2)

Hasan

See other products by Hasan

Bonnin

Rodolfo Bonnin is a systems engineer and Ph.D. student at Universidad Tecnolgica Nacional, Argentina. He has also pursued parallel programming and image understanding postgraduate courses at Universitt Stuttgart, Germany. He has been doing research on high-performance computing since 2005 and began studying and implementing convolutional neural networks in 2008, writing a CPU- and GPU-supporting neural network feedforward stage. More recently he's been working in the field of fraud pattern detection with Neural Networks and is currently working on signal classification using machine learning techniques. He is also the author of Building Machine Learning Projects with Tensorflow and Machine Learning for Developers by Packt Publishing.

See other products by Bonnin