Who 'Dat? Identity resolution in large email collections

31
Опубликовано 7 сентября 2016, 17:52
Automated techniques that can support the human activities of search and sense-making in large email collections are of increasing importance for a broad range of uses, including historical scholarship and lawyers involved in e-discovery incident to civil litigation. In this talk, I'll briefly describe some of the work to date on searching large email collections, and then for most of the talk I will focus on the more challenging task of support for sense-making. Specifically, I'll describe joint work with Tamer Elsayed to automatically resolve the identity of people who are mentioned ambiguously (e.g., just by first name) in a collection of email from a failed corporation (Enron). Our results indicate that for people who are well represented in the collection we can use a generative model to guess the right identity about 80 of the time. I'll conclude the talk with a few remarks on our next directions for techniques, evaluation, and additional types of collections to which similar ideas might be applied.
Случайные видео
218 дней – 349 5674:22
Why Do All YouTube Videos Look Alike?
22.09.21 – 3 980 8242:44:45
Ultimate Pc Repair Challenge!
14.06.21 – 233 1150:16
Shop Your Passion on Banggood!
автотехномузыкадетское