Abstract
The importance of similarity search has become prominent in the fast-evolving vector databases, which apply content embedding techniques on complex data to produce and manage large collections of high-dimensional vectors. Processing of such data is only possible by using a similarity function for storage, structure, and retrieval. However, if multiple users access the collection, their views on similarity can differ as similarity, in general, is subjective and context-dependent. In this article, we elaborate on the problem of a similarity search engine implementation, where users use a common index but search with personalised views of similarity, implemented by a possibly different similarity model. Specifically, we define a foundational theoretical framework and conduct experiments on real-life data to confirm the viability of such an approach. The experiments also indicate future research directions needed to propose and implement an effective and efficient personalised similarity search engine.
Original language | English |
---|---|
Title of host publication | Similarity Search and Applications - 17th International Conference, SISAP 2024, Proceedings |
Editors | Edgar Chávez, Benjamin Kimia, Jakub Lokoč, Marco Patella, Jan Sedmidubsky |
Number of pages | 14 |
Place of publication | Cham |
Publisher | Springer |
Publication date | 2025 |
Pages | 126-139 |
ISBN (Print) | 978-3-031-75822-5 |
ISBN (Electronic) | 978-3-031-75823-2 |
DOIs | |
Publication status | Published - 2025 |
Event | 17th International Conference, SISAP 2024 - Providence, United States Duration: 4 Jun 2024 → 6 Jun 2024 Conference number: 17 https://www.sisap.org/2024/ |
Conference
Conference | 17th International Conference, SISAP 2024 |
---|---|
Number | 17 |
Country/Territory | United States |
City | Providence |
Period | 04/06/2024 → 06/06/2024 |
Internet address |
Series | Lecture Notes in Computer Science |
---|---|
Volume | 15268 |
ISSN | 0302-9743 |
Keywords
- Personalized similarity
- Similarity search
- Vector databases