PPO with Hindsight Experience Replay (HER)
OpenAI’s Mar 2018 request for research highlighted the research trajectory of combining HER with other advances in RL. The goal of HER Variations is to explore these possibilities.
Requires baselines, which can be installed here: https://github.com/openai/baselines