Research talk: Towards efficient generalization in continual RL using episodic memory

508

12.1

Microsoft Research334 тыс

Следующее

25.01.22 – 3028:49

Industry talk: Key forces driving industry transformation and disruption

Популярные

83 дня – 7476:16

A generative model of biology for in-silico experimentation and discovery

356 дней – 17022:21

AI Forum 2023 | Towards Responsible AI Deployment

Опубликовано 25 января 2022, 1:40

Speaker: Mandana Samiei, PhD Student, McGill University and Mila (Quebec AI Institute)

Reinforcement learning (RL) is a powerful, brain-inspired framework to train agents for making sequential decisions in artificial intelligence. In this talk, the researchers consider two scenarios wherein RL can be challenging. The first is when non-stationarity plays an important role in the environment, and the second is when data and compute available to the agent are limited. We then discuss mitigation principles inspired by the brain’s capacity for episodic memory, that is, the subjective memory of specific previous events. However, the classical implementation of episodic memory in RL is computationally inefficient for storing and retrieving information. Besides that, simple episodic memories do not show good generalization to novel tasks. Despite the recent progress made by episodic memory in RL on the speed of learning, efficient generalization remains an open area for future explorations. The researchers propose that a more realistic view of episodic memory is one that incorporates predictive schemata into an external inference algorithm, which could theoretically help with generalization in RL.

Learn more about the 2021 Microsoft Research Summit: Aka.ms/researchsummit

Свежие видео