Oral Session: Deep Visual Analogy-Making

1 197

17.3

Microsoft Research334 тыс

Следующее

06.06.16 – 73952:18

Coalescence in Branching Trees and Branching Random Walks

Популярные

91 день – 1 5741:23:36

Decoding the Human Brain – A Neurosurgeon’s Experience

174 дня – 22215:51

Keynote: Building Globally Equitable AI

Опубликовано 6 июня 2016, 22:50

In addition to identifying the content within a single image, relating images and generating related images are critical tasks for image understanding. Recently, deep convolutional networks have yielded breakthroughs in producing image labels, annotations and captions, but have only just begun to be used for producing high-quality image outputs. In this paper we develop a novel deep network trained end-to-end to perform visual analogy making, which is the task of transforming a query image according to an example pair of related images. Solving this problem requires both accurately recognizing a visual relationship and generating a transformed query image accordingly. Inspired by recent advances in language modeling, we propose to solve visual analogies by learning to map images to a neural embedding in which analogical reasoning is simple, such as by vector subtraction and addition. In experiments, our model effectively models visual analogies on several datasets: 2D shapes, animated video game sprites, and 3D car models.

research.microsoft.com

Свежие видео