TY - GEN
T1 - Fine-grain scalable audio coding based on envelope restoration and the SPIHT algorithm
AU - Hansen, H.
AU - Strahl, S.
AU - Mertins, A.
N1 - Conference code: 78362
PY - 2009/7/1
Y1 - 2009/7/1
N2 - We present strategies for perceptual improvements of embedded audio coding based on psychoacoustic weighting and spectral envelope restoration. The encoding schemes exhibit fine-grain bitrate scalability via the set partitioning in hierarchical trees (SPIHT) algorithm. Weighting factors and envelope parameters are transmitted under careful consideration of the amount of side information. For low bitrates, where the number of actually transmitted waveform coefficients is low, missing coefficients are shaped w.r.t. the spectral envelope. In our approach, the envelope information is transmitted in form of band-wise values of the l1-norm. Sets of standardized audio files as well as various audio data of contemporary music are encoded and the results are analyzed with objective measures of perceptual quality. The proposed coding scheme competes in perceptual quality with existing state-of-the-art fixed bitrate coders such as MPEG-2/4 AAC. For low bitrates, the proposed embedded coding envelope restoration (ECER) improves the perceptual audio quality notably.
AB - We present strategies for perceptual improvements of embedded audio coding based on psychoacoustic weighting and spectral envelope restoration. The encoding schemes exhibit fine-grain bitrate scalability via the set partitioning in hierarchical trees (SPIHT) algorithm. Weighting factors and envelope parameters are transmitted under careful consideration of the amount of side information. For low bitrates, where the number of actually transmitted waveform coefficients is low, missing coefficients are shaped w.r.t. the spectral envelope. In our approach, the envelope information is transmitted in form of band-wise values of the l1-norm. Sets of standardized audio files as well as various audio data of contemporary music are encoded and the results are analyzed with objective measures of perceptual quality. The proposed coding scheme competes in perceptual quality with existing state-of-the-art fixed bitrate coders such as MPEG-2/4 AAC. For low bitrates, the proposed embedded coding envelope restoration (ECER) improves the perceptual audio quality notably.
UR - https://www.scopus.com/record/display.uri?eid=2-s2.0-70449602251&origin=inward&txGid=8d03923c614903d09bb48cf529809bc9
U2 - 10.1109/ICDSP.2009.5201096
DO - 10.1109/ICDSP.2009.5201096
M3 - Conference contribution
SN - 978-1-4244-3297-4
T3 - International Conference on Digital Signal Processing, Proceedings
SP - 1
EP - 5
BT - 2009 16th International Conference on Digital Signal Processing
PB - IEEE
T2 - 16th International Conference on Digital Signal Processing
Y2 - 5 July 2009 through 7 July 2009
ER -