Shape adaptive integer transform for coding arbitrarily shaped objects in H.264/AVC

Li, Xiongwen; Edirisinghe, Eran; Bez, Helmut

Eran8.pdf (327.59 kB)

Shape adaptive integer transform for coding arbitrarily shaped objects in H.264/AVC

conference contribution

posted on 2010-07-16, 10:11 authored by Xiongwen Li, Eran Edirisinghe, Helmut Bez

The use of shape-adaptive transforms is a popular approach for coding arbitrarily shaped objects in image/video coding due to their adaptability at object edges and low complexity. In this respect shape adaptive DCT (SA-DCT) and shape adaptive DWT (SA-DWT) have been proposed in previous literature. The Integer Transform (IT), a derivative of the 4x4 DCT, has been adopted in the latest H.264/AVC standard for coding image blocks in residual data (texture). The associated integer arithmetic guarantees fast and accurate coding/decoding. In this paper, we propose a novel Shape Adaptive Integer Transform (SA-IT) which can be effectively used in future for enabling arbitrary shaped object coding in H.264. Though Integer Transforms are a derivative of 4x4 DCTs, in H.264, to maintain integer arithmetic capability, the post-and pre-scaling factors of transform process are integrated into the forward and inverse quantiser stages respectively for reducing the total number of multiplications and avoiding the loss of accuracy. Thus SA-IT considerably differs from SA-DCT and calls for novel design and implementation considerations based on combining those merits of both SA-DCT and IT. We provide theoretical proofs and support them with experimental justifications.

History

School

Science

Department

Computer Science

Citation

LI, X., EDIRISINGHE, E.A.and BEZ, H.E., 2006. Shape adaptive integer transform for coding arbitrarily shaped objects in H.264/AVC. IN: Apostolopoulos, J.G. and Said, A. (eds.) Visual Communications and Image Processing 2006, Proc. of SPIE-IS&T Electronic Imaging, 6077, 60770C, 11pp.

Publisher

Version

VoR (Version of Record)

Publication date

2006

Notes

Copyright 2006 Society of Photo-Optical Instrumentation Engineers. One print or electronic copy may be made for personal use only. Systematic electronic or print reproduction and distribution, duplication of any material in this paper for a fee or for commercial purposes, or modification of the content of the paper are prohibited. This paper can also be found at: http://dx.doi.org/10.1117/12.642320