A direct mining approach to efficient constrained graph pattern discovery

Research output: Contribution to journal/Conference contribution in journal/Contribution to newspaperConference articleResearchpeer-review

  • Feida Zhu, Singapore Management University, Singapore
  • Zequn Zhang, University of Science and Technology of China, China
  • Qiang Qu, Denmark
Despite the wealth of research on frequent graph pattern mining, how to efficiently mine the complete set of those with constraints still poses a huge challenge to the existing algorithms mainly due to the inherent bottleneck in the mining paradigm. In essence, mining requests with explicitly-specified constraints cannot be handled in a way that is direct and precise. In this paper, we propose a direct mining framework to solve the problem and illustrate our ideas in the context of a particular type of constrained frequent patterns - the "skinny" patterns, which are graph patterns with a long backbone from which short twigs branch out. These patterns, which we formally define as l-long δ-skinny patterns, are able to reveal insightful spatial and temporal trajectory patterns in mobile data mining, information diffusion, adoption propagation, and many others.

Based on the key concept of a canonical diameter, we develop SkinnyMine, an efficient algorithm to mine all the l-long δ-skinny patterns guaranteeing both the completeness of our mining result as well as the unique generation of each target pattern. We also present a general direct mining framework together with two properties of reducibility and continuity for qualified constraints. Our experiments on both synthetic and real data demonstrate the effectiveness and scalability of our approach.
Original languageEnglish
JournalAssociation for Computing Machinery. Special Interest Group on Management of Data. International Conference Proceedings
Pages (from-to)821-832
Number of pages12
ISSN0730-8078
DOIs
Publication statusPublished - 1 Jan 2013
EventACM SIGMOD International Conference on Management of Data - New York, United States
Duration: 22 Jun 201327 Jun 2013

Conference

ConferenceACM SIGMOD International Conference on Management of Data
CountryUnited States
CityNew York
Period22/06/201327/06/2013

    Research areas

  • pattern mining, graph mining, skinny pattern, constraint, constrained mining, direct approach, path merge

See relations at Aarhus University Citationformats

ID: 166878466