A multi-channel fusion framework for audio event detection

H. Phan, M. Maass, L. Hertel, R. Mazur, A. Mertins

Abstract

We propose in this paper a simple, yet efficient multi-channel fusion framework for joint acoustic event detection and classification. The joint problem on individual channels is posed as a regression problem to estimate event onset and offset positions. As an intermediate result, we also obtain the posterior probabilities which measure the confidence that event onsets and offsets are present at a temporal position. It facilitates the fusion problem by accumulating the posterior probabilities of different channels. The detection hypotheses are then determined based on the summed posterior probabilities. While the proposed fusion framework appears to be simple and natural, it significantly outperforms all the single-channel baseline systems on the ITC-Irst database. We also show that adding channels one by one into the fusion system yields performance improvements, and the performance of the fusion system is always better than those of the individual-channel counterparts.
Original languageEnglish
Title of host publication2015 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA)
Number of pages5
PublisherIEEE
Publication date01.10.2015
Pages1-5
Article number7336889
ISBN (Print)978-1-4799-7449-8
ISBN (Electronic)978-1-4799-7450-4
DOIs
Publication statusPublished - 01.10.2015
EventIEEE Workshop on Applications of Signal Processing to Audio and Acoustics 2015 - New Paltz, United States
Duration: 15.10.201521.10.2015
Conference number: 118380

Fingerprint

Dive into the research topics of 'A multi-channel fusion framework for audio event detection'. Together they form a unique fingerprint.

Cite this