Feature extraction with a multiscale modulation analysis for robust automatic speech recognition

F. Müller, A. Mertins

Abstract

In this work we present a new feature extraction method that is robust against the effects of varying vocal tract lengths. The principle of the method is based on invariant integration and makes use of a modulation filtering approach, similar to the recently proposed scattering transform. In particular, we show how the transform can be used to obtain features that are robust against variations of the vocal tract length. Phoneme recognition experiments show a clearly increased robustness in case of mismatching average vocal tract lengths.
OriginalspracheEnglisch
Titel2013 IEEE International Conference on Acoustics, Speech and Signal Processing
Seitenumfang5
Herausgeber (Verlag)IEEE
Erscheinungsdatum01.05.2013
Seiten7427-7431
Aufsatznummer6639106
ISBN (elektronisch)978-1-4799-0356-6
DOIs
PublikationsstatusVeröffentlicht - 01.05.2013
Veranstaltung2013 38th IEEE International Conference on Acoustics, Speech, and Signal Processing - Vancouver, Kanada
Dauer: 26.05.201331.05.2013
Konferenznummer: 101421

Fingerprint

Untersuchen Sie die Forschungsthemen von „Feature extraction with a multiscale modulation analysis for robust automatic speech recognition“. Zusammen bilden sie einen einzigartigen Fingerprint.

Zitieren