Dereverberation Suppression for Improved Speech Recognition and Human Perception

362
60.3
Опубликовано 6 сентября 2016, 5:43
The factors that harm the speech recognition results for un-tethered users are the ambient noise and the reverberation. While we have pretty sophisticated noise suppression algorithms, the de-reverberation is still an unsolved problem due to the difficulties in estimation and keeping track of the changes in the room response model. Sound capturing with microphone arrays provides partial de-reverberation and ambient noise reduction due to the better directivity. This improves the speech recognition results, but still WER is higher than with a close-talk microphone. This talk will present the results from the summer internship of Daniel Allred in MSR and is a follow-up to research done last summer. A full implementation of the dereverberation suppression algorithm designed last summer has been tested on actual room environments. We will present the possible improvements that can be achieved using our algorithm with proper parameter estimation. We also performed some preliminary studies into the human perception of various reverberation conditions and the use of our algorithm to alleviate those conditions. Results of comparative MOS tests will be shown, and an evaluation of what these results mean for future research in dereverberation algorithms for real-time communications channels will follow.
автотехномузыкадетское