Proximal Policy Optimization

OpenAI is implementing a new reinforcement learning algorithm, Proximal Policy Optimization, which will allow them to train AI policies. What are your thoughts about this? Let us know in the comments!
Read the full article here.