转自:http://blog.csdn.net/songrotek/article/details/51382759
博客地址:http://blog.csdn.net/songrotek/article/category/5419801
增强学习Reinforcement Learning经典算法梳理3:TD方法
原文:http://www.cnblogs.com/1995hxt/p/6812372.html