Reinforced Cross-Modal Matching and Self-Supervised Imitation Learning for VLN

1 205
22.3
Опубликовано 18 июня 2019, 18:25
Vision-Language Navigation is the task of navigating an embodied agent to carry out natural language instructions inside real 3D environments. We propose a novel Reinforced Cross-Modal Matching (RCM) approach that enforces cross-modal grounding both locally and globally via reinforcement learning (RL) and further introduce a Self-Supervised Imitation Learning (SIL) method to explore unseen environments by imitating its own past, good decisions.
Свежие видео
15 часов – 2050:22
Workout Leggings You'll Love!
21 час – 78 7539:13
iPhone 16 - 7 NEW Leaks!
22 часа – 72411:10
Repair | Surface Laptop Go 3
автотехномузыкадетское