On probabilistic models for uncertain sequential pattern mining

Muhammad Muzammal*, Rajeev Raman

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

16 Citations (Scopus)

Abstract

We study uncertainty models in sequential pattern mining. We consider situations where there is uncertainty either about a source or an event. We show that both these types of uncertainties could be modelled using probabilistic databases, and give possible-worlds semantics for both. We then describe "interestingness" criteria based on two notions of frequentness (previously studied for frequent itemset mining) namely expected support [C. Aggarwal et al. KDD'09;Chui et al., PAKDD'07,'08] and probabilistic frequentness [Bernecker et al., KDD'09]. We study the interestingness criteria from a complexity-theoretic perspective, and show that in case of source-level uncertainty, evaluating probabilistic frequentness is #P-complete, and thus no polynomial time algorithms are likely to exist, but evaluate the interestingness predicate in polynomial time in the remaining cases.

Original languageEnglish
Title of host publicationAdvanced Data Mining and Applications - 6th International Conference, ADMA 2010, Proceedings
Pages60-72
Number of pages13
EditionPART 1
DOIs
Publication statusPublished - 2010
Externally publishedYes
Event6th International Conference on Advanced Data Mining and Applications, ADMA 2010 - Chongqing, China
Duration: 19 Nov 201021 Nov 2010

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
NumberPART 1
Volume6440 LNAI
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference6th International Conference on Advanced Data Mining and Applications, ADMA 2010
Country/TerritoryChina
CityChongqing
Period19/11/1021/11/10

Keywords

  • Mining Uncertain Data
  • Novel Algorithms for Mining
  • Probabilistic Databases
  • Sequential Pattern Mining
  • Theoretical Foundations of Data Mining

Cite this