TY - GEN
T1 - On probabilistic models for uncertain sequential pattern mining
AU - Muzammal, Muhammad
AU - Raman, Rajeev
PY - 2010
Y1 - 2010
N2 - We study uncertainty models in sequential pattern mining. We consider situations where there is uncertainty either about a source or an event. We show that both these types of uncertainties could be modelled using probabilistic databases, and give possible-worlds semantics for both. We then describe "interestingness" criteria based on two notions of frequentness (previously studied for frequent itemset mining) namely expected support [C. Aggarwal et al. KDD'09;Chui et al., PAKDD'07,'08] and probabilistic frequentness [Bernecker et al., KDD'09]. We study the interestingness criteria from a complexity-theoretic perspective, and show that in case of source-level uncertainty, evaluating probabilistic frequentness is #P-complete, and thus no polynomial time algorithms are likely to exist, but evaluate the interestingness predicate in polynomial time in the remaining cases.
AB - We study uncertainty models in sequential pattern mining. We consider situations where there is uncertainty either about a source or an event. We show that both these types of uncertainties could be modelled using probabilistic databases, and give possible-worlds semantics for both. We then describe "interestingness" criteria based on two notions of frequentness (previously studied for frequent itemset mining) namely expected support [C. Aggarwal et al. KDD'09;Chui et al., PAKDD'07,'08] and probabilistic frequentness [Bernecker et al., KDD'09]. We study the interestingness criteria from a complexity-theoretic perspective, and show that in case of source-level uncertainty, evaluating probabilistic frequentness is #P-complete, and thus no polynomial time algorithms are likely to exist, but evaluate the interestingness predicate in polynomial time in the remaining cases.
KW - Mining Uncertain Data
KW - Novel Algorithms for Mining
KW - Probabilistic Databases
KW - Sequential Pattern Mining
KW - Theoretical Foundations of Data Mining
UR - http://www.scopus.com/inward/record.url?scp=78650184230&partnerID=8YFLogxK
U2 - 10.1007/978-3-642-17316-5_6
DO - 10.1007/978-3-642-17316-5_6
M3 - Conference contribution
AN - SCOPUS:78650184230
SN - 3642173152
SN - 9783642173158
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 60
EP - 72
BT - Advanced Data Mining and Applications - 6th International Conference, ADMA 2010, Proceedings
T2 - 6th International Conference on Advanced Data Mining and Applications, ADMA 2010
Y2 - 19 November 2010 through 21 November 2010
ER -