项目作者: EyaRhouma

项目描述 :
Solving some gym env using policy based methods: hill climbing and Cross Entropy Method
高级语言: Jupyter Notebook
项目地址: git://github.com/EyaRhouma/Policy_based_method.git
创建时间: 2019-09-15T15:26:02Z
项目社区:https://github.com/EyaRhouma/Policy_based_method

开源协议:

下载


Value based method

In this repository we’re trying to solve 2 openAI Gym’s env using two policy based methods: Hill climbing and Cross-Entropy method

Hill Climbing

Hill_Climbing.ipynb is an implementation of hill climbing with adaptive noise scaling for OpenAI Gym’s Cartpole environment.

Result

Trained Agent

Cross-Entropy Method

CEM.ipynb is an implementation of the cross-entropy method for OpenAI Gym’s MountainCarContinuous environment.

Result

Trained Agent

Additionals

For more well explained methods for policy based method here’s a good blog:

http://kvfrans.com/simple-algoritms-for-solving-cartpole/

—> corresponding github: https://github.com/kvfrans/openai-cartpole