In this work we present a new feature extraction method that is robust against the effects of varying vocal tract lengths. The principle of the method is based on invariant integration and makes use of a modulation filtering approach, similar to the recently proposed scattering transform. In particular, we show how the transform can be used to obtain features that are robust against variations of the vocal tract length. Phoneme recognition experiments show a clearly increased robustness in case of mismatching average vocal tract lengths.
|Title of host publication||2013 IEEE International Conference on Acoustics, Speech and Signal Processing|
|Number of pages||5|
|Publication status||Published - 01.05.2013|
|Event||2013 38th IEEE International Conference on Acoustics, Speech, and Signal Processing - Vancouver, Canada|
Duration: 26.05.2013 → 31.05.2013
Conference number: 101421