Abstract
We present strategies for perceptual improvements of embedded audio coding based on psychoacoustic weighting and spectral envelope restoration. The encoding schemes exhibit fine-grain bitrate scalability via the set partitioning in hierarchical trees (SPIHT) algorithm. Weighting factors and envelope parameters are transmitted under careful consideration of the amount of side information. For low bitrates, where the number of actually transmitted waveform coefficients is low, missing coefficients are shaped w.r.t. the spectral envelope. In our approach, the envelope information is transmitted in form of band-wise values of the l1-norm. Sets of standardized audio files as well as various audio data of contemporary music are encoded and the results are analyzed with objective measures of perceptual quality. The proposed coding scheme competes in perceptual quality with existing state-of-the-art fixed bitrate coders such as MPEG-2/4 AAC. For low bitrates, the proposed embedded coding envelope restoration (ECER) improves the perceptual audio quality notably.
Original language | English |
---|---|
Title of host publication | 2009 16th International Conference on Digital Signal Processing |
Number of pages | 5 |
Publisher | IEEE |
Publication date | 01.07.2009 |
Pages | 1-5 |
Article number | 5201096 |
ISBN (Print) | 978-1-4244-3297-4 |
ISBN (Electronic) | 978-1-4244-3298-1 |
DOIs | |
Publication status | Published - 01.07.2009 |
Event | 16th International Conference on Digital Signal Processing - Santorini, Greece Duration: 05.07.2009 → 07.07.2009 Conference number: 78362 |