Understanding Visual Scenes: Where are We?

1 522

50.7

Microsoft Research330 тыс

Следующее

17.08.16 – 7941:16:31

Opinion Dynamics and Influence in Social Networks

Популярные

147 дней – 545 0810:31

Join us for Research Forum on September 3, 2024

355 дней – 4 6923:36

The Prompt with Trevor Noah | Episode 1: IHME Population Mapping

Опубликовано 17 августа 2016, 3:13

Human visual scene understanding is remarkable: with only a brief glance at an image, an abundance of information is available, including scene category, 3D spatial structure, and the identity of the main objects in the scene. In the last two decades, there has been much progress towards building computer systems that have general visual perception ability. In the first part of the talk, I will present results of scene recognition on a new database with an exhaustive set of scene categories. When hundreds of categories become available, for the first time, we can test the performance of global features to classify scenes into categories covering most of the places encountered by humans. We evaluate numerous state-of-the-art algorithms for scene recognition, establish new bounds of computer performance, and compare them with human performance. In the second part of the talk, I will show that scene understanding seems to be mature enough for real world application in certain domains. I will demonstrate results on semantic segmentation of street-view images into buildings, trees, etc. I will also showcase several possible applications for scene understanding, including 3D reconstruction of building mesh models, prediction of how memorable an image is, and extrapolation of an image beyond its boundaries. This is joint work with Antonio Torralba, Long Quan, Aude Oliva, James Hays, Krista Ehinger and Phillip Isola.

Свежие видео