项目作者: antonio-f

项目描述 :
Algorithms for Policy Evaluation, Estimation of Action Values, Policy Improvement, Policy Iteration, Truncated Policy Evaluation, Truncated Policy Iteration, Value Iteration . From Udacity's Deep Reinforcement Learning Nanodegree program.
高级语言: Jupyter Notebook
项目地址: git://github.com/antonio-f/Dynamic-Programming.git
创建时间: 2019-04-03T20:44:36Z
项目社区:https://github.com/antonio-f/Dynamic-Programming

开源协议:

下载