Oldenburg logatome speech corpus (OLLO) for speech recognition experiments with humans and machines

Thorsten Wesker, Bernd Meyer, Kirsten Wagener, Jörn Anemüller, Alfred Mertins, Birger Kollmeier

Abstract

This paper introduces the new OLdenburg LOgatome speech corpus (OLLO) and outlines design considerations during its creation. OLLO is distinct from previous ASR corpora as it specifically targets (1) the fair comparison between human and machine speech recognition performance, and (2) the realistic representation of intrinsic variabilities in speech that are significant for automatic speech recognition (ASR) systems. To enable an unbiased human-machine comparison, OLLO is designed for recognition of individual phonemes that are embedded in logatomes, specifically, three-phoneme sequences with no semantic information. A balanced set of target-phonemes important for human and automatic speech recognition has been chosen, drawing on pilot ASR studies and cross-fertilization from the field of human speech intelligibility testing. Several intrinsic variabilities in speech are represented in OLLO, by recording from 40 speakers from four German dialect regions, and by covering six articulation characteristics. Results from preliminary phonetic time-labeling and ASR experiments are promising and consistent with corpus variabilities.

Original languageEnglish
Pages1273-1276
Number of pages14
Publication statusPublished - 01.12.2005
Event9th European Conference on Speech Communication and Technology
- Lisbon, Portugal
Duration: 04.09.200508.09.2005
Conference number: 67499

Conference

Conference9th European Conference on Speech Communication and Technology
Abbreviated titleINTERSPEECH 2005
Country/TerritoryPortugal
CityLisbon
Period04.09.0508.09.05

Fingerprint

Dive into the research topics of 'Oldenburg logatome speech corpus (OLLO) for speech recognition experiments with humans and machines'. Together they form a unique fingerprint.

Cite this