Deep Policy Gradient Algorithms: A Closer Look

4 193
15.4
Следующее
Популярные
Опубликовано 16 мая 2019, 22:57
Deep reinforcement learning methods are behind some of the most publicized recent results in machine learning. In spite of these successes, however, deep RL methods face a number of systemic issues: brittleness to small changes in hyperparameters, high reward variance across runs, and sensitivity to seemingly small algorithmic changes.

In this talk, we take a closer look at the potential root of these issues. Specifically, we study how the policy gradient primitives underlying popular deep RL algorithms reflect the principles informing their development.

See more at microsoft.com/en-us/research/v...
Свежие видео
6 дней – 2 549 6370:18
Android x Wicked: Wickedly Open to Elphaba
9 дней – 46 82111:55
Armor for Cowboys?!
14 дней – 1 262 27510:55
Why is EVERYONE buying this headset?
Случайные видео
243 дня – 115 2331:03
Doogee T30 Series | Doogee T30 Max
22.03.20 – 274 5200:22
MIUI: Switch Between Apps
автотехномузыкадетское