Microsoft Research334 тыс
Опубликовано 27 июня 2016, 18:37
Entities and the relations among them are central for representing our knowledge about the world. My work concentrates on discovering relations from information sources available to us, including unstructured text corpora and structured data. Previous work for relation extraction can be classified into two categories in terms of how they represent relations. The first is based on supervised learning, representing relations using pre-defined types from knowledge bases. This approach relies on human efforts to define relation types. The second generalizes the Open IE style relation extraction, representing relations as clusters of textual patterns. This assumes semantic equivalence among patterns falling into the same cluster, failing to represent the diversity and ambiguity of the patterns. I will present a new approach, Universal Schema – the union of all relations seen among surface patterns and available structured knowledge bases. This representation preserves the diversity and ambiguity of textual patterns and allows us to generalize among them. In this talk, I will explain how to perform relation extraction in universal schema. We formalize the task as matrix completion and employ matrix factorization to learn implications among relations. Experiments demonstrate that using universal schema for relation extraction provides new state-of-the-art accuracy. We also extend universal schema to entity type extraction.
Свежие видео
Случайные видео
How do I fix the error “canceling statement due to conflict with recovery” for a read replica query?