Explaining outliers by subspace separability

Research output: Contribution to book/anthology/report/proceedingArticle in proceedingsResearchpeer-review

  • Barbora Micenková, Denmark
  • Raymond T. Ng, University of British Columbia, Canada
  • Xuan-Hong Dang, Denmark
  • Ira Assent
Outliers are extraordinary objects in a data collection. Depending on the domain, they may represent errors,
fraudulent activities or rare events that are subject of our interest.
Existing approaches focus on detection of outliers or degrees of outlierness (ranking), but do not provide a possible explanation
of how these objects deviate from the rest of the data. Such explanations would help user to interpret or validate the detected
outliers.

The problem addressed in this paper is as follows: given an outlier detected by an existing algorithm, we propose a method that determines possible explanations for the outlier. These explanations are expressed in the form of subspaces in which the given outlier shows separability from the inliers. In this manner, our proposed method complements existing outlier detection algorithms by providing additional information about the outliers. Our method is designed to work with any existing
outlier detection algorithm and it also includes a heuristic that
gives a substantial speedup over the baseline strategy.
Original languageEnglish
Title of host publicationProceedings, IEEE 13th International Conference on Data Mining (ICDM 2013)
EditorsHui Xiong, George Karypis, Bhavani Thuraisingham, Diane Cook, Xindong Wu
Number of pages10
PublisherIEEE Press
Publication year2013
Pages518 - 527
ISBN (print) 978-0-7685-5108-1
Publication statusPublished - 2013
EventIEEE ICDM - Dallas, Texas, United States
Duration: 7 Dec 201310 Dec 2013

Conference

ConferenceIEEE ICDM
LandUnited States
ByDallas, Texas
Periode07/12/201310/12/2013
SeriesIEEE International Conference on Data Mining (ICDM'13)

See relations at Aarhus University Citationformats

ID: 70751795