Learning to sort numbers using a LSTM with a modified attention mechanism (Pointer Networks by Vinyals et al.).