- What is RL?
- What is the end goal of an agent?
- What are the main differences between supervised learning and RL?
- What are the benefits of combining deep learning and RL?
- Where does the term "reinforcement" come from?
- What is the difference between policy and value functions?
- Can the model of an environment be learned through interacting with it?





















































