contact, friction, etc. are unknown
CS294-112 深度强化学习 秋季学期(伯克利)NO.21 Guest lecture: Aviv Tamar (Combining Reinforcement Learning and Planning)
原文:https://www.cnblogs.com/ecoflex/p/9114106.html