New Directions in Robust Automatic Speech Recognition

446

24.8

Microsoft Research330 тыс

Следующее

06.09.16 – 271:03:48

More Natural Programming Through User Studies

Популярные

18 дней – 88748:11

AI for Business Transformation: Lessons from Healthcare

03.05.23 – 8124:06

Escapement: A Tool for Interactive Prototyping with Video via Sensor-Mediated Abstraction of Time

Опубликовано 6 сентября 2016, 5:37

As speech recognition technology is transferred from the laboratory to the marketplace, robustness in recognition is becoming increasingly important.┬á This talk will review and discuss several classical and contemporary approaches to robust speech recognition. ┬á The most tractable types of environmental degradation are produced by quasi-stationary additive noise and quasi-stationary linear filtering.┬á These distortions can be largely ameliorated by the classical techniques of cepstral high-pass filtering (as exemplified by cepstral mean normalization and RASTA filtering), as well as by techniques that develop statistical models of the distortion (such as codeword-dependent cepstral normalization and vector Taylor series expansion).┬á Nevertheless, these types of approaches fail to provide much useful improvement when speech is degraded by transient or non-stationary noise such as background music or speech.┬á We describe and compare the effectiveness of techniques based on missing-feature compensation, multi-band analysis, feature combination, and physiologically-motivated auditory scene analysis toward providing increased recognition accuracy in difficult acoustical environments.

Свежие видео