Vision to Language

42
Опубликовано 27 июня 2016, 21:58
The recent advances in computer vision, natural language processing and other related areas has led to a renewed interest in artificial intelligence applications spanning multiple domains. Specifically, the generation of natural human-like captions for images has seen an extraordinary increase in interest. In this session, the speakers provide insight into this area. They describe several techniques that combine state-of-the-art computer vision techniques and language models to produce descriptions of visual content with surprisingly high quality. The limitations of current approaches and the challenges that lie ahead are both emphasized.
автотехномузыкадетское