Fine-grain scalable audio coding based on envelope restoration and the SPIHT algorithm

H. Hansen, S. Strahl, A. Mertins


We present strategies for perceptual improvements of embedded audio coding based on psychoacoustic weighting and spectral envelope restoration. The encoding schemes exhibit fine-grain bitrate scalability via the set partitioning in hierarchical trees (SPIHT) algorithm. Weighting factors and envelope parameters are transmitted under careful consideration of the amount of side information. For low bitrates, where the number of actually transmitted waveform coefficients is low, missing coefficients are shaped w.r.t. the spectral envelope. In our approach, the envelope information is transmitted in form of band-wise values of the l1-norm. Sets of standardized audio files as well as various audio data of contemporary music are encoded and the results are analyzed with objective measures of perceptual quality. The proposed coding scheme competes in perceptual quality with existing state-of-the-art fixed bitrate coders such as MPEG-2/4 AAC. For low bitrates, the proposed embedded coding envelope restoration (ECER) improves the perceptual audio quality notably.
Original languageEnglish
Title of host publication2009 16th International Conference on Digital Signal Processing
Number of pages5
Publication date01.07.2009
Article number5201096
ISBN (Print)978-1-4244-3297-4
ISBN (Electronic)978-1-4244-3298-1
Publication statusPublished - 01.07.2009
Event16th International Conference on Digital Signal Processing - Santorini, Greece
Duration: 05.07.200907.07.2009
Conference number: 78362


Dive into the research topics of 'Fine-grain scalable audio coding based on envelope restoration and the SPIHT algorithm'. Together they form a unique fingerprint.

Cite this