Real-time Single-channel Speech Enhancement with Recurrent Neural Networks

9 569
18.9
Следующее
Популярные
23.11.22 – 6 5133:18
Causal AI for Decision Making
Опубликовано 11 сентября 2019, 0:11
Single-channel speech enhancement using deep neural networks (DNNs) has shown promising progress in recent years. In this work, we explore several aspects of neural network training that impact the objective quality of enhanced speech in a real-time setting. In particular, we base all studies on a novel recurrent neural network that enhances full-band short-time speech spectra on a single-frame-in, single-frame-out basis, a framework that is adopted by most classical signal processing methods. We propose two novel learning objectives that allow separate control over expected speech distortion versus noise suppression. Moreover, we study the effect of feature normalization and sequence lengths on the objective quality of enhanced speech. Finally, we compare our method with state-of-the-art methods based on statistical signal processing and deep learning, respectively.

Slides: microsoft.com/en-us/research/u...

Learn more about the Audio and Acoustics Research Group: microsoft.com/en-us/research/g...
Свежие видео
15 дней – 255 9759:52
Solve your problems! UnifyDrive UT2
15 дней – 369 7080:29
Discover local delights with #AdvancedAI
19 дней – 47 7770:23
Take your time ⏰
автотехномузыкадетское