- DQN computes the Q value directly whereas Dueling DQN breaks down the Q value computation into value function and advantage function.
- Refer section Replay memory.
- When we use the same network for predicting target value and predicted value there will lot of divergence so we use separate target network.
- Refer section Replay memory.
- Refer section Dueling network.
- Dueling DQN breaks down the Q value computation into value function and advantage function whereas double DQN uses two Q function to avoid overestimation.
- Refer section Dueling network.




















































