Microsoft Research330 тыс
Опубликовано 20 сентября 2016, 16:46
Although reinforcement learning (RL) has had many successes, significant amounts of time and/or data can be required to reach acceptable performance. If agents or robots are to be deployed in real world environments, it is critical that our algorithms take advantage of existing human knowledge. This talk will discuss a selection of our recent work that improves RL by leveraging 1) demonstrations and 2) reward feedback from imperfect users.
Свежие видео
Случайные видео