Multi-Dimensional I-Vector Closed Set Speaker Identification based on an Extreme Learning Machine with and without Fusion Technologies

Musab T. S. Al-Kaltakchi, Wai L. Woo, Satnam S. Dlay, Jonathon A. Chambers

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

6 Citations (Scopus)

Abstract

In this article, I-vector Speaker Identification (SID) is exploited as a compact, low dimension, fixed length and modern state of the art system. The main structures for this study consist of four combinations of features which depend on Power Normalization Cepstral Coefficient (PNCC) and Mel Frequency Cepstral Coefficient (MFCC) features, with two different compensation approaches which have been previously proposed. The main system is modelled by I-vectors with low dimensions, and we also propose fusion strategies with different higher I-vector dimensions to improve the recognition rate. In addition, cumulative, concatenated, and interleaved fusion techniques are investigated to improve the conventional late fusion presented in our previous work. Moreover, the proposed system employs an Extreme Learning Machine (ELM) for classification purpose, which is efficient, less complex and less time consuming compared with traditional neural network based approaches. The system is evaluated on the TIMIT database for clean and AWGN environments and achieved a recognition rate of 96.67% and 80.83% respectively. The system shows improvements compared with the Gaussian Mixture Model-Universal Background Model (GMM-UBM) in our previously proposed scheme, with an improvement of 1.76% in clean speech and 2.1% for 30dB AWGN and with the highest improvement at 10dB with 43.81%.
Original languageEnglish
Title of host publication2017 Intelligent Systems Conference (IntelliSys)
PublisherIEEE
Pages1141-1146
Number of pages6
ISBN (Electronic)9781509064359
ISBN (Print)9781509064366
DOIs
Publication statusPublished - 26 Mar 2018
Event2017 Intelligent Systems Conference, IntelliSys 2017 - London, United Kingdom
Duration: 7 Sept 20178 Sept 2017

Conference

Conference2017 Intelligent Systems Conference, IntelliSys 2017
Country/TerritoryUnited Kingdom
CityLondon
Period7/09/178/09/17

Keywords

  • Multi-dimensional I-vectors
  • extreme learning machine
  • fusion technologies

Fingerprint

Dive into the research topics of 'Multi-Dimensional I-Vector Closed Set Speaker Identification based on an Extreme Learning Machine with and without Fusion Technologies'. Together they form a unique fingerprint.

Cite this