Evaluation of emerging frequency domain convolutive blind source separation algorithms based on real room recordings

Naqvi, Mohsen; Zhang, Yonggang; Tsalaile, Thato K.; Sanei, Saeid; Chambers, Jonathon

Chambers 4.pdf (496.79 kB)

Evaluation of emerging frequency domain convolutive blind source separation algorithms based on real room recordings

conference contribution

posted on 2009-11-30, 13:43 authored by Mohsen Naqvi, Yonggang Zhang, Thato K. Tsalaile, Saeid Sanei, Jonathon Chambers

This paper presents a comparative study of three of the emerging frequency domain convolutive blind source separation (FDCBSS) techniques i.e. convolutive blind separation of non-stationary sources due to Parra and Spence, penalty function-based joint diagonalization approach for convolutive blind separation of nonstationary sources due to Wang et al. and a geometrically constrained multimodal approach for convolutive blind source separation due to Sanei et al. Objective evaluation is performed on the basis of signal to interference ratio (SIR), performance index (PI) and solution to the permutation problem. The results confirm that a multimodal approach is necessary to properly mitigate the permutation in BSS and ultimately to solve the cocktail party problem. In other words, it is to make BSS semiblind by exploiting prior geometrical information, and thereby providing the framework to find robust solutions for more challenging source separation with moving speakers.

History

School

Mechanical, Electrical and Manufacturing Engineering

Citation

NAQVI, S.M. ... et al, 2008. Evaluation of emerging frequency domain convolutive blind source separation algorithms based on real room recordings. IN: Proceedings of the 5th IEEE Sensor Array and Multichannel Signal Processing Workshop. SAM 2008, Darmstadt, Germany, 21-23 July 2008, pp. 345-348

Publisher

Version

VoR (Version of Record)

Publication date

2008

Notes

This is a conference paper [© IEEE]. It is also available from: http://ieeexplore.ieee.org/ Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.