Overview of the SISAP 2024 Indexing Challenge

Eric S. Tellez*, Martin Aumüller, Vladimir Mic

*Corresponding author for this work

Research output: Contribution to book/anthology/report/proceedingArticle in proceedingsResearchpeer-review

Abstract

The SISAP 2024 Indexing Challenge invited replicable and competitive approximate similarity search solutions for datasets of up to 100 million real-valued vectors. Participants are evaluated on the search performance of their implementations under quality constraints. Using a subset of the deep features of a neural network model provided by the LAION-5B dataset, the challenge posed three tasks, each with its unique focus:Task 1, Unrestricted indexing: Conduct a classical approximate nearest neighbors search, ensuring an average recall of at least 0.8 for 30-NN queries.Task 2, Memory-constrained indexing with reranking: Conduct nearest neighbors search in a low-memory setting where the dataset collection is only accessible on disk, ensuring the same quality as in Task 1.Task 3, Memory-constrained indexing without reranking: Conduct nearest neighbor search in a setting where the dataset cannot be accessed at search stage, ensuring an average recall of at least 0.4 for 30-NN queries. Task 1, Unrestricted indexing: Conduct a classical approximate nearest neighbors search, ensuring an average recall of at least 0.8 for 30-NN queries. Task 2, Memory-constrained indexing with reranking: Conduct nearest neighbors search in a low-memory setting where the dataset collection is only accessible on disk, ensuring the same quality as in Task 1. Task 3, Memory-constrained indexing without reranking: Conduct nearest neighbor search in a setting where the dataset cannot be accessed at search stage, ensuring an average recall of at least 0.4 for 30-NN queries. The present paper describes the details of the challenge, the evaluation system that was developed with it, and gives an overview of the submitted solutions.

Original languageEnglish
Title of host publicationSimilarity Search and Applications - 17th International Conference, SISAP 2024, Proceedings
EditorsEdgar Chávez, Benjamin Kimia, Jakub Lokoč, Marco Patella, Jan Sedmidubsky
Number of pages11
Place of publicationCham
PublisherSpringer
Publication date2025
Pages255–265
ISBN (Print)978-3-031-75822-5
ISBN (Electronic)978-3-031-75823-2
DOIs
Publication statusPublished - 2025
SeriesLecture Notes in Computer Science
Volume15268
ISSN0302-9743

Keywords

  • Approximate nearest neighbor search
  • Experimental comparison of search methods
  • Indexing and searching pipelines

Fingerprint

Dive into the research topics of 'Overview of the SISAP 2024 Indexing Challenge'. Together they form a unique fingerprint.

Cite this