Invariant Integration Features Combined with Speaker-Adaptation Methods

Florian Müller, Alfred Mertins

Abstract

Speaker-normalization and -adaptation methods are essential components of state-of-the-art speech recognition systems nowadays. Recently, so-called invariant integration features were presented which are motivated by the theory of invariants. While it was shown that the integration features outperform MFCCs when used with a basic monophone recognition system, it was left open, if their benefits still can be observed when a more sophisticated recognition system with speaker-normalization and/or speaker-adaptation components is used. This work investigates the combination of the integration features with standard speaker-normalization and -adaptation methods. We show that the integration features benefit from adaptation methods and significantly outperform MFCCs in matching, as well as in mismatching training-test conditions.
OriginalspracheEnglisch
Seiten2622-2625
Seitenumfang4
PublikationsstatusVeröffentlicht - 01.09.2010
Veranstaltung11th Annual Conference of the International Speech Communication Association: Spoken Language Processing for All - Makuhari, Japan
Dauer: 26.09.201030.09.2010
Konferenznummer: 85334

Tagung, Konferenz, Kongress

Tagung, Konferenz, Kongress11th Annual Conference of the International Speech Communication Association: Spoken Language Processing for All
Kurztitel INTERSPEECH 2010
Land/GebietJapan
OrtMakuhari
Zeitraum26.09.1030.09.10

Fingerprint

Untersuchen Sie die Forschungsthemen von „Invariant Integration Features Combined with Speaker-Adaptation Methods“. Zusammen bilden sie einen einzigartigen Fingerprint.

Zitieren