Explore Products

Best Sellers

New Releases

Books

Videos

Audiobooks

Learning Hub

Free Learning

You're reading from Hands-On Neural Networks with Keras Design and create neural networks using deep learning and artificial intelligence principles

Product type Paperback

Published in Mar 2019

Publisher Packt

ISBN-13 9781789536089

Length 462 pages

Edition 1st Edition

Languages

Python

Tools

Keras

Concepts

Artificial Intelligence

Author (1):

Niloy Purkait

View More author details

Table of Contents (16) Chapters

Preface

1. Section 1: Fundamentals of Neural Networks FREE CHAPTER

2. Overview of Neural Networks

3. A Deeper Dive into Neural Networks

4. Signal Processing - Data Analysis with Neural Networks

5. Section 2: Advanced Neural Network Architectures

6. Convolutional Neural Networks

7. Recurrent Neural Networks

8. Long Short-Term Memory Networks

9. Reinforcement Learning with Deep Q-Networks

10. Section 3: Hybrid Model Architecture

11. Autoencoders

12. Generative Networks

13. Section 4: Road Ahead

14. Contemplating Present and Future Developments

15. Other Books You May Enjoy

Leave a review - let other readers know what you think

Double Q-learning

Another augmentation to the standard Q-learning model we just built is the idea of Double Q-learning, which was introduced by Hado van Hasselt (2010, and 2015). The intuition behind this is quite simple. Recall that, so far, we were estimating our target values for each state-action pair using the Bellman equation and checking how far off the mark our predictions are at a given state, like so:

However, a problem arises from estimating the maximum expected future reward in this manner. As you may have noticed earlier, the max operator in the target equation (y_t) uses the same Q-values to evaluate a given action as the ones that are used to predict a given action for a sampled state. This introduces a propensity for overestimation of Q-values, eventually even spiraling out of control. To compensate for such possibilities, Van Hasselt et al. (2016) implemented...

The rest of the chapter is locked

A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.

Unlock this book and the full library FREE for 7 days

Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of

Start free trial

Renews at $19.99/month. Cancel anytime

Authors (1)

Purkait

Niloy Purkait is a technology and strategy consultant by profession. He currently resides in the Netherlands, where he offers his consulting services to local and international companies alike. He specializes in integrated solutions involving artificial intelligence, and takes pride in navigating his clients through dynamic and disruptive business environments. He has a masters in Strategic Management from Tilburg University, and a full specialization in data science from Michigan University. He has advanced industry grade certifications from IBM, in subjects like signal processing, cloud computing, machine and deep learning. He is also perusing advanced academic degrees in several related fields, and is a self-proclaimed lifelong learner.

See other products by Purkait