An adaptive ensemble classifier for mining concept drifting data streams

Dewan Farid, Li Zhang, Alamgir Hossain, Chowdhury Rahman, Rebecca Strachan, Graham Sexton, Keshav Dahal

Research output: Contribution to journalArticlepeer-review

143 Citations (Scopus)

Abstract

It is challenging to use traditional data mining techniques to deal with real-time data stream classifications. Existing mining classifiers need to be updated frequently to adapt to the changes in data streams. To address this issue, in this paper we propose an adaptive ensemble approach for classification and novel class detection in concept drifting data streams. The proposed approach uses traditional mining classifiers and updates the ensemble model automatically so that it represents the most recent concepts in data streams. For novel class detection we consider the idea that data points belonging to the same class should be closer to each other and should be far apart from the data points belonging to other classes. If a data point is well separated from the existing data clusters, it is identified as a novel class instance. We tested the performance of this proposed stream classification model against that of existing mining algorithms using real benchmark datasets from UCI (University of California, Irvine) machine learning repository. The experimental results prove that our approach shows great flexibility and robustness in novel class detection in concept drifting and outperforms traditional classification models in challenging real-life data stream applications.
Original languageEnglish
Pages (from-to)5895-5906
JournalExpert Systems with Applications
Volume40
Issue number15
DOIs
Publication statusPublished - 2013

Keywords

  • Adaptive ensembles
  • concept drift
  • clustering
  • data streams
  • decision trees
  • novel classes

Fingerprint

Dive into the research topics of 'An adaptive ensemble classifier for mining concept drifting data streams'. Together they form a unique fingerprint.

Cite this