< Back to previous page

Publication

Mining top-k quantile-based cohesive sequential patterns

Book Contribution - Book Chapter Conference Contribution

Finding patterns in long event sequences is an important data mining task. Two decades ago research focused on finding all frequent patterns, where the anti-monotonic property of support was used to design efficient algorithms. Recent research focuses on producing a smaller output containing only the most interesting patterns. To achieve this goal, we introduce a new interestingness measure by computing the proportion of the occurrences of a pattern that are cohesive. This measure is robust to outliers, and is applicable to sequential patterns. We implement an efficient algorithm based on constrained prefix-projected pattern growth and pruning based on an upper bound to uncover the set of top-k quantile-based cohesive sequential patterns. We run experiments to compare our method with existing state-of-the-art methods for sequential pattern mining and show that our algorithm is efficient and produces qualitatively interesting patterns on large event sequences.
Book: Proceedings of the 2018 SIAM International Conference on Data Mining, May 3-5, 2018, San Diego, CA, USA
Pages: 90 - 98
ISBN:978-1-61197-532-1
Publication year:2018
Keywords:Computer. Automation
Accessibility:Closed