Analysis by synthesis spatial audio coding

Elfitri, Ikhwana; Shi, Xiyu; Kondoz, Ahmet

SPR-2013-0015-PROOF.pdf (813.75 kB)

Analysis by synthesis spatial audio coding

journal contribution

posted on 2018-10-11, 15:44 authored by Ikhwana Elfitri, Xiyu ShiXiyu Shi, Ahmet Kondoz

This study presents a novel spatial audio coding (SAC) technique, called analysis by synthesis SAC (AbS-SAC), with a capability of minimising signal distortion introduced during the encoding processes. The reverse one-to-two (R-OTT), a module applied in the MPEG Surround to down-mix two channels as a single channel, is first configured as a closed-loop system. This closed-loop module offers a capability to reduce the quantisation errors of the spatial parameters, leading to an improved quality of the synthesised audio signals. Moreover, a sub-optimal AbS optimisation, based on the closed-loop R-OTT module, is proposed. This algorithm addresses a problem of practicality in implementing an optimal AbS optimisation while it is still capable of improving further the quality of the reconstructed audio signals. In terms of algorithm complexity, the proposed sub-optimal algorithm provides scalability. The results of objective and subjective tests are presented. It is shown that significant improvement of the objective performance, when compared to the conventional open-loop approach, is achieved. On the other hand, subjective test show that the proposed technique achieves higher subjective difference grade scores than the tested advanced audio coding multichannel.

Funding

This work was supported by the ROMEO project (grant number: 287896), which was funded by the EC FP7 ICT collaborative research programme.

History

School

Loughborough University London

Published in

IET Signal Processing

Volume

8

Issue

1

Pages

30 - 38

Citation

ELFITRI, I., SHI, X. and KONDOZ, A., 2014. Analysis by synthesis spatial audio coding. IET Signal Processing, 8 (1), pp.30-38.

Publisher

Version

AM (Accepted Manuscript)

Publisher statement

This work is made available according to the conditions of the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International (CC BY-NC-ND 4.0) licence. Full details of this licence are available at: https://creativecommons.org/licenses/by-nc-nd/4.0/

Acceptance date

2013-05-09

Publication date

2014-02-01

Notes

This paper is a postprint of a paper submitted to and accepted for publication in IET Signal Processing and is subject to Institution of Engineering and Technology Copyright. The copy of record is available at the IET Digital Library.

DOI

https://doi.org/10.1049/iet-spr.2013.0015

ISSN

1751-9675

eISSN

1751-9683

Publisher version

https://doi.org/10.1049/iet-spr.2013.0015

Language

en

Administrator link

https://repository.lboro.ac.uk/account/articles/9462896

Usage metrics

Keywords

untagged

Licence

CC BY-NC-ND 4.0

Exports

RefWorks

BibTeX

Ref. manager

Endnote

DataCite

NLM

DC

Analysis by synthesis spatial audio coding

Funding

This work was supported by the ROMEO project (grant number: 287896), which was funded by the EC FP7 ICT collaborative research programme.

History

School

Published in

Volume

Issue

Pages

Citation

Publisher

Version

Publisher statement

Acceptance date

Publication date

Notes

DOI

ISSN

eISSN

Publisher version

Language

Administrator link

Usage metrics

Categories

Keywords

Licence

Exports