model free: high variance. model based: high bias
within 1h of human demonstration of each task,
VR!!!
Deep RL Bootcamp TAs Research Overview
原文:https://www.cnblogs.com/ecoflex/p/8990885.html