2012 USpanAnEfficientAlgorithmforMin

From GM-RKB

Jump to navigation Jump to search

(Yin et al., 2012) ⇒ Junfu Yin, Zhigang Zheng, and Longbing Cao. (2012). “USpan: An Efficient Algorithm for Mining High Utility Sequential Patterns.” In: Proceedings of the 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD-2012). ISBN:978-1-4503-1462-6 doi:10.1145/2339530.2339636

Subject Headings: Sequential Pattern Mining.

Notes

Cited By

Quotes

Author Keywords

Data mining; high utility sequential pattern mining; sequential pattern mining

Abstract

Sequential pattern mining plays an important role in many applications, such as bioinformatics and consumer behavior analysis. However, the classic frequency-based framework often leads to many patterns being identified, most of which are not informative enough for business decision-making. In frequent pattern mining, a recent effort has been to incorporate utility into the pattern selection framework, so that high utility (frequent or infrequent) patterns are mined which address typical business concerns such as dollar value associated with each pattern. In this paper, we incorporate utility into sequential pattern mining, and a generic framework for high utility sequence mining is defined. An efficient algorithm, USpan, is presented to mine for high utility sequential patterns. In USpan, we introduce the lexicographic quantitative sequence tree to extract the complete set of high utility sequences and design concatenation mechanisms for calculating the utility of a node and its children with two effective pruning strategies. Substantial experiments on both synthetic and real datasets show that USpan efficiently identifies high utility sequences from large scale data with very low minimum utility.

References

;

	Author	volume	Date Value	title	type	journal	titleUrl	doi	note	year
2012 USpanAnEfficientAlgorithmforMin	Longbing Cao Junfu Yin Zhigang Zheng			USpan: An Efficient Algorithm for Mining High Utility Sequential Patterns				10.1145/2339530.2339636		2012

Retrieved from "http://www.gabormelli.com/RKB/index.php?title=2012_USpanAnEfficientAlgorithmforMin&oldid=850002"

Facts

... more about "2012 USpanAnEfficientAlgorithmforMin"

Junfu Yin +, Zhigang Zheng + and Longbing Cao +

10.1145/2339530.2339636 +

Proceedings of the 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining +

USpan: An Efficient Algorithm for Mining High Utility Sequential Patterns +

2012 +