Research output: Contribution to journal/Conference contribution in journal/Contribution to newspaper › Conference article › Research › peer-review
Final published version
Spectral clustering is one of the most advantageous clustering approaches. However, standard Spectral Clustering is sensitive to noisy input data and has a high runtime complexity. Tackling one of these problems often exacerbates the other. As real-world datasets are often large and compromised by noise, we need to improve both robustness and runtime at once. Thus, we propose Spectral Clustering-Accelerated and Robust (SCAR), an accelerated, robustified spectral clustering method. In an iterative approach, we achieve robustness by separating the data into two latent components: cleansed and noisy data. We accelerate the eigendecomposition – the most time-consuming step – based on the Nyström method. We compare SCAR to related recent state-of-the-art algorithms in extensive experiments. SCAR surpasses its competitors in terms of speed and clustering quality on highly noisy data.
Original language | English |
---|---|
Journal | Proceedings of the VLDB Endowment |
Volume | 15 |
Issue | 11 |
Pages (from-to) | 3031-3044 |
Number of pages | 14 |
ISSN | 2150-8097 |
DOIs | |
Publication status | Published - Jul 2022 |
Event | 48th International Conference on Very Large Data Bases, VLDB 2022 - Sydney, Australia Duration: 5 Sept 2022 → 9 Sept 2022 |
Conference | 48th International Conference on Very Large Data Bases, VLDB 2022 |
---|---|
Country | Australia |
City | Sydney |
Period | 05/09/2022 → 09/09/2022 |
See relations at Aarhus University Citationformats
ID: 284490116