Abstract
A novel approach to solve the single-channel source separation (SCSS) problem is presented. Most existing supervised SCSS methods resort exclusively to the independence waveform criteria as exemplified by training the prior information before the separation process. This poses a significant limiting factor to the applicability of these methods to real problem. Our proposed method does not require training knowledge for separating the mixture and it is based on decomposing the mixture into a series of oscillatory components termed as the intrinsic mode functions (IMFs). We show, in this paper, that the IMFs have several desirable properties unique to SCSS problem and how these properties can be advantaged to relax the constraints posed by the problem. In addition, we have derived a novel sparse non-negative matrix factorization to estimate the spectral bases and temporal codes of the sources. The proposed algorithm is a more complete and efficient approach to matrix factorization where a generalized criterion for variable sparseness is imposed onto the solution. Experimental testing has been conducted to show that the proposed method gives superior performance over other existing approaches.
Original language | English |
---|---|
Article number | 5570953 |
Pages (from-to) | 961-976 |
Number of pages | 16 |
Journal | IEEE Transactions on Audio, Speech and Language Processing |
Volume | 19 |
Issue number | 4 |
DOIs | |
Publication status | Published - 13 Sept 2010 |
Keywords
- Audio processing
- blind source separation (BSS)
- empirical mode decomposition (EMD)
- non-negative matrix factorization (NMF)
- single-channel source separation (SCSS)
- sparse features