Deep Policy Gradient Algorithms: A Closer Look

4 209
15.4
Опубликовано 16 мая 2019, 22:57
Deep reinforcement learning methods are behind some of the most publicized recent results in machine learning. In spite of these successes, however, deep RL methods face a number of systemic issues: brittleness to small changes in hyperparameters, high reward variance across runs, and sensitivity to seemingly small algorithmic changes.

In this talk, we take a closer look at the potential root of these issues. Specifically, we study how the policy gradient primitives underlying popular deep RL algorithms reflect the principles informing their development.

See more at microsoft.com/en-us/research/v...
автотехномузыкадетское