- A3C is the Asynchronous Advantage Actor Critic network which uses several agents to learn parallel.
- Three A's are Asynchronous, Advantage, Actor Critic.
- A3C requires less computation power and training time than DQN.
- All agents (workers) works in copies of the environment and then global network aggregate their experience.
- Entropy is used to ensure enough exploration.
- Refer section How A3C works.




















































