Learning Notes: Morvan - Reinforcement Learning, Part 1: Q-learning
原文:http://www.cnblogs.com/casperwin/p/6305351.html