Final intern talk: Distilling Self-Supervised-Learning-Based Speech Quality Assessment into Compact

112
Опубликовано 27 августа 2024, 15:41
Speaker: Benjamin Stahl
Host: Hannes Gamper

In this talk, we explore advancements in computational models for speech quality assessment. Self-supervised learning models have emerged as powerful front-ends, outperforming supervised-only models. However, their large size renders them impractical for production tasks. We discuss strategies to distill self-supervised learning-based models into more compact forms using unlabeled data, achieving significant size reduction while maintaining an advantage over supervised-only models.

See more at microsoft.com/en-us/research/v...
автотехномузыкадетское