Feature extraction with a multiscale modulation analysis for robust automatic speech recognition

F. Müller, A. Mertins

Abstract

In this work we present a new feature extraction method that is robust against the effects of varying vocal tract lengths. The principle of the method is based on invariant integration and makes use of a modulation filtering approach, similar to the recently proposed scattering transform. In particular, we show how the transform can be used to obtain features that are robust against variations of the vocal tract length. Phoneme recognition experiments show a clearly increased robustness in case of mismatching average vocal tract lengths.
Original languageEnglish
Title of host publication2013 IEEE International Conference on Acoustics, Speech and Signal Processing
Number of pages5
PublisherIEEE
Publication date01.05.2013
Pages7427-7431
Article number6639106
ISBN (Electronic)978-1-4799-0356-6
DOIs
Publication statusPublished - 01.05.2013
Event2013 38th IEEE International Conference on Acoustics, Speech, and Signal Processing - Vancouver, Canada
Duration: 26.05.201331.05.2013
Conference number: 101421

Fingerprint

Dive into the research topics of 'Feature extraction with a multiscale modulation analysis for robust automatic speech recognition'. Together they form a unique fingerprint.

Cite this