Underdetermined reverberant acoustic source separation using weighted full-rank nonnegative tensor models

Ahmed Al-Theme, W. L. Woo, S. S. Dlay, Bin Gao

Research output: Contribution to journalArticlepeer-review

6 Citations (Scopus)
40 Downloads (Pure)

Abstract

In this paper, a fusion of K models of full-rank weighted nonnegative tensor factor two-dimensional deconvolution (K-wNTF2D) is proposed to separate the acoustic sources that have been mixed in an underdetermined reverberant environment. The model is adapted in an unsupervised manner under the hybrid framework of the generalized expectation maximization and multiplicative update algorithms. The derivation of the algorithm and the development of proposed full-rank K-wNTF2D will be shown. The algorithm also encodes a set of variable sparsity parameters derived from Gibbs distribution into the K-wNTF2D model. This optimizes each sub-model in K-wNTF2D with the required sparsity to model the time-varying variances of the sources in the spectrogram. In addition, an initialization method is proposed to initialize the parameters in the K-wNTF2D. Experimental results on the underdetermined reverberant mixing environment have shown that the proposed algorithm is effective at separating the mixture with an average signal-to-distortion ratio of 3 dB.

Original languageEnglish
Pages (from-to)3411-3426
Number of pages16
JournalJournal of the Acoustical Society of America
Volume138
Issue number6
Early online date2 Dec 2015
DOIs
Publication statusPublished - Dec 2015

Fingerprint

Dive into the research topics of 'Underdetermined reverberant acoustic source separation using weighted full-rank nonnegative tensor models'. Together they form a unique fingerprint.

Cite this