python-Dynamic-Programming-PROSAGA-码农传奇

项目作者： antonio-f

项目描述：

  Algorithms for Policy Evaluation, Estimation of Action Values, Policy Improvement, Policy Iteration, Truncated Policy Evaluation, Truncated Policy Iteration, Value Iteration . From Udacity's Deep Reinforcement Learning Nanodegree program.

高级语言： Jupyter Notebook

项目主页：

项目地址: git://github.com/antonio-f/Dynamic-Programming.git

创建时间： 2019-04-03T20:44:36Z
项目社区：https://github.com/antonio-f/Dynamic-Programming
开源协议：
下载