posted on 2010-07-16, 10:11authored byXiongwen Li, Eran Edirisinghe, Helmut Bez
The use of shape-adaptive transforms is a popular approach for coding arbitrarily shaped objects in image/video coding
due to their adaptability at object edges and low complexity. In this respect shape adaptive DCT (SA-DCT) and shape
adaptive DWT (SA-DWT) have been proposed in previous literature. The Integer Transform (IT), a derivative of the
4x4 DCT, has been adopted in the latest H.264/AVC standard for coding image blocks in residual data (texture). The
associated integer arithmetic guarantees fast and accurate coding/decoding. In this paper, we propose a novel Shape
Adaptive Integer Transform (SA-IT) which can be effectively used in future for enabling arbitrary shaped object coding
in H.264. Though Integer Transforms are a derivative of 4x4 DCTs, in H.264, to maintain integer arithmetic capability,
the post-and pre-scaling factors of transform process are integrated into the forward and inverse quantiser stages
respectively for reducing the total number of multiplications and avoiding the loss of accuracy. Thus SA-IT considerably
differs from SA-DCT and calls for novel design and implementation considerations based on combining those merits of
both SA-DCT and IT. We provide theoretical proofs and support them with experimental justifications.
History
School
Science
Department
Computer Science
Citation
LI, X., EDIRISINGHE, E.A.and BEZ, H.E., 2006. Shape adaptive integer transform for coding arbitrarily shaped objects in H.264/AVC. IN: Apostolopoulos, J.G. and Said, A. (eds.) Visual Communications and Image Processing 2006, Proc. of SPIE-IS&T Electronic Imaging, 6077, 60770C, 11pp.
Copyright 2006 Society of Photo-Optical Instrumentation Engineers. One print or electronic copy may be made for personal use only. Systematic electronic or print reproduction and distribution, duplication of any material in this paper for a fee or for commercial purposes, or modification of the content of the paper are prohibited. This paper can also be found at: http://dx.doi.org/10.1117/12.642320