深度学习课程笔记(十四)深度强化学习 --- Proximal Policy Optimization (PPO)
2018-07-17 16:54:51
Reference: https://blog.openai.com/openai-baselines-ppo/
Code: https://github.com/openai/baselines/tree/master/baselines/ppo2
Paper: https://arxiv.org/pdf/1707.06347.pdf
Video Tutorials: https://www.youtube.com/watch?v=OAKAZ hFmYoI&t=1s
深度学习课程笔记(十四)深度强化学习 --- Proximal Policy Optimization (PPO)
原文:https://www.cnblogs.com/wangxiaocvpr/p/9324316.html