Deep-Reinforcement-Learning

Implemented Deep Q-Network (DQN) and Double DQN on the game of Breakout using the OpenAI Gym.
Double DQN uses a target network to calculate the target Q-value (next state maximum) whereas the vanilla DQN uses the same network for computing Q values for both the current and next state.
All calculations were run on Google Colab. Due to limitations in availability of computational resources, the model could be trained only for 500 episodes.