Reinforce Policy in TensorFlow

Python & TensorFlow implementation of the PG Reinforce for deep reinforcement learning.

Dependencies:

Run the example by cloning the repository and typing python main.py

Training results

At the beginning, the pole is going to fall easily.

Given enough training time (400~500 episodes), the cart will be able to keep it upright.

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
README.md		README.md
main.py		main.py
rl_pg_reinforce.py		rl_pg_reinforce.py