LSH-Sampling Breaks the Computation Chicken-and-Egg Loop in Adaptive Stochastic Gradient Estimation

1 613

28.3

Microsoft Research334 тыс

Следующее

14.09.18 – 1 9201:08:31

Telling Stories: Analyzing Text to Understand Personality, Social behavior, and Narratives

Популярные

07.07.23 – 5336:06

Privacy-Preserving Domain Adaptation of Semantic Parsers

13.02.23 – 2 0421:13:57

Galea: The Bridge Between Mixed Reality and Neurotechnology

Опубликовано 14 сентября 2018, 17:04

Stochastic Gradient Descent or SGD is the most popular algorithm for large-scale optimization. In SGD, the gradient is estimated by uniform sampling with sample size one. There have been several results that show better gradient estimates, using weighted non-uniform sampling, which leads to faster convergence. Unfortunately, the per-iteration cost of maintaining this adaptive distribution is costlier than the exact gradient computation itself, which creates a chicken-and-egg loop making the fast convergence useless. In this paper, we break this barrier by providing the first demonstration of a sampling scheme, which leads to superior gradient estimation, while keeping the sampling cost per iteration similar to the uniform sampling. Such a scheme is possible due to recent advances in Locality Sensitive Hashing (LSH) literature. As a consequence, we improve the running time of all existing gradient descent algorithms.

See more at microsoft.com/en-us/research/v...

Свежие видео