In this rosject we show you how to build the whole training system for a cartpole simulated in Gazebo and controlled with ROS using OpenAI framework.
You can get the full ROS training code, documentation in the form of Jupyter notebook, openai_ros package and cartpole Gazebo simulation here:
http://www.rosject.io/l/300c9b15-8153-4e8b-aac9-5a675814eb55/