Microsoft Research330 тыс
Следующее
Опубликовано 6 сентября 2016, 16:59
This presentation introduces a discriminative model for the retrieval of pictures from text queries. The core idea of this approach is to minimize a loss directly related to the retrieval performance of the model. For that purpose, we formalize the retrieval task as a ranking problem, and introduce a learning procedure optimizing a loss function related to the ranking performance. This strategy hence addresses the retrieval problem directly and does not rely on an intermediate image annotation task, which contrasts with previous research. Moreover, our learning procedure builds upon recent work on the online learning of kernel- based classifiers. This yields an efficient, scalable algorithm, which can benefit from recent kernels developed for image comparison. The experiments performed over stock photography data show the advantage of our discriminative ranking approach over state-of-the- art alternatives (e.g. our model yields 26.3, for the best alternative model evaluated). Further analysis of the results shows that our model is especially advantageous over difficult queries such as queries with few relevant pictures or multiple-word queries.