Abstract
We propose in this paper a simple, yet efficient multi-channel fusion framework for joint acoustic event detection and classification. The joint problem on individual channels is posed as a regression problem to estimate event onset and offset positions. As an intermediate result, we also obtain the posterior probabilities which measure the confidence that event onsets and offsets are present at a temporal position. It facilitates the fusion problem by accumulating the posterior probabilities of different channels. The detection hypotheses are then determined based on the summed posterior probabilities. While the proposed fusion framework appears to be simple and natural, it significantly outperforms all the single-channel baseline systems on the ITC-Irst database. We also show that adding channels one by one into the fusion system yields performance improvements, and the performance of the fusion system is always better than those of the individual-channel counterparts.
Originalsprache | Englisch |
---|---|
Titel | 2015 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA) |
Seitenumfang | 5 |
Herausgeber (Verlag) | IEEE |
Erscheinungsdatum | 01.10.2015 |
Seiten | 1-5 |
Aufsatznummer | 7336889 |
ISBN (Print) | 978-1-4799-7449-8 |
ISBN (elektronisch) | 978-1-4799-7450-4 |
DOIs | |
Publikationsstatus | Veröffentlicht - 01.10.2015 |
Veranstaltung | IEEE Workshop on Applications of Signal Processing to Audio and Acoustics 2015 - New Paltz, USA / Vereinigte Staaten Dauer: 15.10.2015 → 21.10.2015 Konferenznummer: 118380 |