Sanei.pdf (147.56 kB)
A geometrically constrained multimodal approach for convolutive blind source separation
conference contributionposted on 2009-12-10, 10:08 authored by Saeid Sanei, Syed M.R. Naqvi, Jonathon Chambers, Yulia Hicks
A novel constrained multimodal approach for convolutive blind source separation is presented which incorporates video information related to geometrical position of both the speakers and the microphones, and the directionality of the speakers into the separation algorithm. The separation is performed in the frequency domain and the constraints are incorporated through a penalty function-based formulation. The separation results show a considerable improvement over traditional frequency domain convolutive BSS systems such as that developed by Parra and Spence. Importantly, the inherent permutation problem in the frequency domain BSS is potentially solved
- Mechanical, Electrical and Manufacturing Engineering
CitationSANEI, S....et al., 2007. A geometrically constrained multimodal approach for convolutive blind source separation. IN: IEEE International Conference on Acoustics, Speech and Signal Processing, (ICASSP 2007), Honolulu, HI, 15-20 April, pp. III-969 - III-972
- VoR (Version of Record)
NotesThis is a conference paper [© IEEE]. It is also available at: http://ieeexplore.ieee.org/ Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.