AWS re:Invent 2017: Tensors for Large-scale Topic Modeling and Deep Learning (MCL337)

2 930
30.5
Опубликовано 30 ноября 2017, 15:59
Tensors are higher order extensions of matrices that can incorporate multiple modalities and encode higher order relationships in data. This session will present recently developed tensor algorithms for topic modeling and deep learning with vastly improved performance over existing methods.

Topic models enable automated categorization of large document corpora, without requiring labeled data for training. They go beyond simple clustering since they allow for documents to have multiple topics. Tensor methods provide a fast and a guaranteed method for training these models. They incorporate co-occurrence statistics of triplets of words in documents. We are releasing a fast and a robust implementation that vastly outperform existing solutions while providing significantly faster training times and better topic quality. Moreover, training and inference are decoupled in our algorithm, so the user can select the relevant part based on their requirements. We will present benchmarks across multiple datasets of different sizes and AWS instance types, and provide notebook examples.
Случайные видео
96 дней – 34 67432:05
The Level1 Show August 14 2024: uBlock'd
22.12.22 – 13 30810:54
Introduction to FLEDGE
22.12.21 – 4 4943:01
Newegg Marketplace
20.05.09 – 40 4910:16
15 second search tip: Time
автотехномузыкадетское