2011 FastLocalitySensitiveHashing

From GM-RKB
Jump to navigation Jump to search

Subject Headings:

Notes

Cited By

Quotes

Author Keywords

Abstract

Locality-sensitive hashing (LSH) is a basic primitive in several large-scale data processing applications, including nearest-neighbor search, de-duplication, clustering, etc. In this paper we propose a new and simple method to speed up the widely-used Euclidean realization of LSH. At the heart of our method is a fast way to estimate the Euclidean distance between two d-dimensional vectors; this is achieved by the use of randomized Hadamard transforms in a non-linear setting. This decreases the running time of a k, L -parameterized LSH from [math]\displaystyle{ O (dkL) }[/math]to [math]\displaystyle{ O(d\log\;d + kL) }[/math]. Our experiments show that using the new LSH in nearest-neighbor applications can improve their running times by significant amounts. To the best of our knowledge, this is the first running time improvement to LSH that is both provable and practical.

References

;

 AuthorvolumeDate ValuetitletypejournaltitleUrldoinoteyear
2011 FastLocalitySensitiveHashingRavi Kumar
Anirban Dasgupta
Tamas Sarlos
Fast Locality-sensitive Hashing10.1145/2020408.20205782011