Eran8.pdf (327.59 kB)

Shape adaptive integer transform for coding arbitrarily shaped objects in H.264/AVC

Download (327.59 kB)
conference contribution
posted on 16.07.2010 by Xiongwen Li, Eran Edirisinghe, Helmut Bez
The use of shape-adaptive transforms is a popular approach for coding arbitrarily shaped objects in image/video coding due to their adaptability at object edges and low complexity. In this respect shape adaptive DCT (SA-DCT) and shape adaptive DWT (SA-DWT) have been proposed in previous literature. The Integer Transform (IT), a derivative of the 4x4 DCT, has been adopted in the latest H.264/AVC standard for coding image blocks in residual data (texture). The associated integer arithmetic guarantees fast and accurate coding/decoding. In this paper, we propose a novel Shape Adaptive Integer Transform (SA-IT) which can be effectively used in future for enabling arbitrary shaped object coding in H.264. Though Integer Transforms are a derivative of 4x4 DCTs, in H.264, to maintain integer arithmetic capability, the post-and pre-scaling factors of transform process are integrated into the forward and inverse quantiser stages respectively for reducing the total number of multiplications and avoiding the loss of accuracy. Thus SA-IT considerably differs from SA-DCT and calls for novel design and implementation considerations based on combining those merits of both SA-DCT and IT. We provide theoretical proofs and support them with experimental justifications.

History

School

  • Science

Department

  • Computer Science

Citation

LI, X., EDIRISINGHE, E.A.and BEZ, H.E., 2006. Shape adaptive integer transform for coding arbitrarily shaped objects in H.264/AVC. IN: Apostolopoulos, J.G. and Said, A. (eds.) Visual Communications and Image Processing 2006, Proc. of SPIE-IS&T Electronic Imaging, 6077, 60770C, 11pp.

Publisher

© 2006 SPIE

Version

VoR (Version of Record)

Publication date

2006

Notes

Copyright 2006 Society of Photo-Optical Instrumentation Engineers. One print or electronic copy may be made for personal use only. Systematic electronic or print reproduction and distribution, duplication of any material in this paper for a fee or for commercial purposes, or modification of the content of the paper are prohibited. This paper can also be found at: http://dx.doi.org/10.1117/12.642320

Language

en

Exports