Scattering Invariants for Audio Classification

2 212
19.4
Следующее
Популярные
23.11.22 – 6 4153:18
Causal AI for Decision Making
Опубликовано 21 июня 2016, 19:06
To obtain efficient feature representations for audio classification, it is desirable to have invariance to time-shift and stability to time-warping. Mel-frequency cepstral coefficients (MFCCs) satisfy these criteria, but are unsuitable for modeling large-scale temporal structure. The scattering transform extends this representation through a convolutional network of wavelet transforms and modulus operators, capturing structures at larger time scales. Additional invariance to frequency transposition with stability to frequency-warping is obtained by applying a second scattering transform along the log-frequency axis. Using these representations, we obtain state-of-the-art results on tasks such as phone segment classification and musical genre classification on the TIMIT and GTZAN datasets, respectively.
Случайные видео
94 дня – 7 4766:22
How to prompt Gemini Code Assist
190 дней – 323 3135:20
Why Nvidia Is Killing GTX
13.09.11 – 37 09210:47
Motorola Droid Bionic walkthrough
автотехномузыкадетское