Loughborough University
Browse

SLoSH: set locality sensitive hashing via Sliced-Wasserstein embeddings

conference contribution
posted on 2023-12-14, 15:21 authored by Yuzhe Lu, Xinran Liu, Andrea SoltoggioAndrea Soltoggio, Soheil Kolouri

Learning from set-structured data is an essential problem with many applications in machine learning and computer vision. This paper focuses on a non-parametric, data-independent, and efficient learning algorithm from setstructured data using optimal transport and approximate nearest neighbor (ANN) solutions, particularly localitysensitive hashing. We consider the problem of set retrieval from an input set query. This retrieval problem requires 1) an efficient mechanism to calculate the distances/dissimilarities between sets and 2) an appropriate data structure for a fast nearest-neighbor search. To that end, we propose to use Sliced-Wasserstein embedding as a computationally efficient “set-2-vector” operator that enables downstream ANN with theoretical guarantees. The set elements are treated as samples from an unknown underlying distribution, and the Sliced-Wasserstein distance is used to compare sets. We demonstrate the effectiveness of our algorithm, denoted as Set Locality Sensitive Hashing (SLoSH), on various set retrieval datasets and compare our proposed embedding with standard set embedding approaches, including Generalized Mean (GeM) embedding/pooling, Featurewise Sort Pooling (FSPool), Covariance Pooling, and Wasserstein embedding and show consistent improvement in retrieval results, both in terms of accuracy and computational efficiency.

History

School

  • Science

Department

  • Computer Science

Published in

2024 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV)

Source

2024 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV)

Publisher

IEEE

Version

  • AM (Accepted Manuscript)

Acceptance date

2024-10-24

Language

  • en

Location

Waikoloa, Hawaii

Event dates

4th January 2024 - 8th January 2024

Depositor

Dr Andrea Soltoggio. Deposit date: 13 December 2023

Usage metrics

    Loughborough Publications

    Categories

    No categories selected

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC