Fast Filtering for Similarity Search Using Conjunctive Enumeration of Sketches in Order of Hamming Distance

Naoya Higuchi*, Yasunobu Imamura, Vladimir Mic, Takeshi Shinohara, Kouichi Hirata, Tetsuji Kuboyama

*Corresponding author af dette arbejde

Publikation: Bidrag til bog/antologi/rapport/proceedingKonferencebidrag i proceedingsForskningpeer review

Abstract

Sketches are compact bit-string representations of points, often employed for speeding up searches through the effects of dimensionality reduction and data compression. In this paper, we propose a novel sketch enumeration method and demonstrate its ability to realize fast filtering for approximate nearest neighbor search in metric spaces. Whereas the Hamming distance between the query’s sketch and sketches of points to be searched has been used for sketch prioritization traditionally, recent research has introduced asymmetric distances, enabling higher recall rates with fewer candidates. Additionally, sketch enumeration methods that speed up the filtering such that high-priority solution candidates are selected based on the priority of the sketch to the given query without the need for direct sketch comparisons have been proposed. Our primary goal in this paper is to further accelerate sketch enumeration through parallel processing. While Hamming distance-based enumeration can be parallelized relatively easily, achieving high recall rates requires a large number of candidates, and speeding up the filtering alone is insufficient for overall similarity search acceleration. Therefore, we introduce the conjunctive enumeration method, which concatenates two Hamming distance-based enumerations to approximate asymmetric distance-based enumeration. Then, we validate the effectiveness of the proposed method through experiments using large-scale public datasets. Our approach offers a significant acceleration effect, thereby enhancing the efficiency of similarity search operations.

OriginalsprogEngelsk
TitelProceedings of the 13th International Conference on Pattern Recognition Applications and Methods - ICPRAM
Redaktører Modesto Castrillon-Santana, Maria De Marsico, Ana Fred
Antal sider12
Vol/bind1
ForlagSCITEPRESS Digital Library
Publikationsdato2024
Sider499-510
ISBN (Trykt)9789897586842
DOI
StatusUdgivet - 2024
BegivenhedThe International Conference on Pattern Recognition Applications and Methods - Rome, Italien
Varighed: 24 feb. 202426 feb. 2024
https://icpram.scitevents.org/

Konference

KonferenceThe International Conference on Pattern Recognition Applications and Methods
Land/OmrådeItalien
ByRome
Periode24/02/202426/02/2024
Internetadresse

Fingeraftryk

Dyk ned i forskningsemnerne om 'Fast Filtering for Similarity Search Using Conjunctive Enumeration of Sketches in Order of Hamming Distance'. Sammen danner de et unikt fingeraftryk.

Citationsformater