Projekte pro Jahr
Abstract
The spectral effects of vocal tract length (VTL) changes are one reason of why the recognition rate of today's speaker-independent automatic speech recognition (ASR) systems is considerably lower than the one of speaker-dependent systems. By using certain types of filter-banks these effects can be described by a translation in subband-index space. In this paper, nonlinear translation-invariant transforms that orig-inally have been proposed in the field of pattern recognition are investi-gated for their applicability in speaker-independent ASR tasks. It will be shown that the combination of different types of such transforms leads to features that are more robust against VTL changes than the standard Mel-frequency cepstral coefficients and that almost yield the performance of vocal tract length normalization without any adaption to individual speakers.
Originalsprache | Englisch |
---|---|
Seitenumfang | 9 |
Publikationsstatus | Veröffentlicht - 01.06.2009 |
Veranstaltung | NOLISP 2009 : Workshop on Non-Linear Speech Processing - Vic (Barcelona), Spanien Dauer: 25.06.2009 → 27.06.2009 http://www.wikicfp.com/cfp/servlet/event.showcfp?eventid=3313©ownerid=2 |
Tagung, Konferenz, Kongress
Tagung, Konferenz, Kongress | NOLISP 2009 : Workshop on Non-Linear Speech Processing |
---|---|
Land/Gebiet | Spanien |
Ort | Vic (Barcelona) |
Zeitraum | 25.06.09 → 27.06.09 |
Internetadresse |
Fingerprint
Untersuchen Sie die Forschungsthemen von „Nonlinear translation-invariant transformations for speaker-independent speech recognition“. Zusammen bilden sie einen einzigartigen Fingerprint.Projekte
- 1 Abgeschlossen
-
Invariante Merkmale für die automatische Spracherkennung
Mertins, A. (Projektleiter*in (PI))
01.01.07 → 31.12.11
Projekt: DFG-Projekte › DFG Einzelförderungen