Less is More: Non-Redundant Subspace Clustering

Research output: Contribution to book/anthology/report/proceedingArticle in proceedingsResearchpeer-review

  • Ira Assent
  • Emmanuel Müller, RWTH Aachen University, Germany
  • Stephan Günnemann, RWTH Aachen University, Germany
  • Ralph Krieger, RWTH Aachen University, Germany
  • Thomas Seidl, RWTH Aachen University, Germany
Clustering is an important data mining task for grouping
similar objects. In high dimensional data, however, eects
attributed to the \curse of dimensionality", render clustering
in high dimensional data meaningless. Due to this, recent
years have seen research on subspace clustering which
searches for clusters in relevant subspace projections of high
dimensional data. As the number of possible subspace projections
is exponential in the number of dimensions, the
number of possible subspace clusters can be overwhelming.
In this position paper, we present our work on identifying
non-redundant, relevant subspace clusters which reduce the
result set to a manageable size. We discuss techniques for
evaluating, visualizing and exploring subspace clusterings,
and propose some directions for future work.
Original languageEnglish
Title of host publication1st International Workshop on Discovering, Summarizing and Using Multiple Clusterings (MultiClust 2010) in conjunction with 16th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA (2010)
Publication year2010
ISBN (print)978-1-4503-0227-2
Publication statusPublished - 2010
Externally publishedYes

See relations at Aarhus University Citationformats

ID: 47659756