"Learning to learn by gradient descent by gradient descent "by PyTorch -- a simple re-implementation.