Python & TensorFlow implementation of the PG Reinforce for deep reinforcement learning.
Dependencies:
- TensorFlow
- OpenAI Gym
- numpy
Run the example by cloning the repository and typing python main.py
At the beginning, the pole is going to fall easily.
Given enough training time (400~500 episodes), the cart will be able to keep it upright.
MIT