A multi-channel fusion framework for audio event detection

H. Phan, M. Maass, L. Hertel, R. Mazur, A. Mertins


We propose in this paper a simple, yet efficient multi-channel fusion framework for joint acoustic event detection and classification. The joint problem on individual channels is posed as a regression problem to estimate event onset and offset positions. As an intermediate result, we also obtain the posterior probabilities which measure the confidence that event onsets and offsets are present at a temporal position. It facilitates the fusion problem by accumulating the posterior probabilities of different channels. The detection hypotheses are then determined based on the summed posterior probabilities. While the proposed fusion framework appears to be simple and natural, it significantly outperforms all the single-channel baseline systems on the ITC-Irst database. We also show that adding channels one by one into the fusion system yields performance improvements, and the performance of the fusion system is always better than those of the individual-channel counterparts.
Original languageEnglish
Title of host publication2015 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA)
Number of pages5
Publication date01.10.2015
Article number7336889
ISBN (Print)978-1-4799-7449-8
ISBN (Electronic)978-1-4799-7450-4
Publication statusPublished - 01.10.2015
EventIEEE Workshop on Applications of Signal Processing to Audio and Acoustics 2015 - New Paltz, United States
Duration: 15.10.201521.10.2015
Conference number: 118380


Dive into the research topics of 'A multi-channel fusion framework for audio event detection'. Together they form a unique fingerprint.

Cite this