Projekte pro Jahr
Abstract
Speaker-normalization and -adaptation methods are essential components of state-of-the-art speech recognition systems nowadays. Recently, so-called invariant integration features were presented which are motivated by the theory of invariants. While it was shown that the integration features outperform MFCCs when used with a basic monophone recognition system, it was left open, if their benefits still can be observed when a more sophisticated recognition system with speaker-normalization and/or speaker-adaptation components is used. This work investigates the combination of the integration features with standard speaker-normalization and -adaptation methods. We show that the integration features benefit from adaptation methods and significantly outperform MFCCs in matching, as well as in mismatching training-test conditions.
Originalsprache | Englisch |
---|---|
Seiten | 2622-2625 |
Seitenumfang | 4 |
Publikationsstatus | Veröffentlicht - 01.09.2010 |
Veranstaltung | 11th Annual Conference of the International Speech Communication Association: Spoken Language Processing for All - Makuhari, Japan Dauer: 26.09.2010 → 30.09.2010 Konferenznummer: 85334 |
Tagung, Konferenz, Kongress
Tagung, Konferenz, Kongress | 11th Annual Conference of the International Speech Communication Association: Spoken Language Processing for All |
---|---|
Kurztitel | INTERSPEECH 2010 |
Land/Gebiet | Japan |
Ort | Makuhari |
Zeitraum | 26.09.10 → 30.09.10 |
Fingerprint
Untersuchen Sie die Forschungsthemen von „Invariant Integration Features Combined with Speaker-Adaptation Methods“. Zusammen bilden sie einen einzigartigen Fingerprint.Projekte
- 1 Abgeschlossen
-
Invariante Merkmale für die automatische Spracherkennung
01.01.07 → 31.12.11
Projekt: DFG-Projekte › DFG Einzelförderungen