Loughborough University
Browse

Wasserstein task embedding for measuring task similarities

Download (1.86 MB)
journal contribution
posted on 2025-04-07, 11:44 authored by Xinran Liu, Yikun Bai, Yuzhe Lu, Andrea SoltoggioAndrea Soltoggio, Soheil Kolouri

Measuring similarities between different tasks is critical in a broad spectrum of machine learning problems, including transfer, multi-task, continual, and meta-learning. Most current approaches to measuring task similarities are architecture-dependent: (1) relying on pre-trained models, or (2) training networks on tasks and using forward transfer as a proxy for task similarity. In this paper, we leverage the optimal transport theory and define a novel task embedding for supervised classification that is model-agnostic, training-free, and capable of handling (partially) disjoint label sets. In short, given a dataset with ground-truth labels, we perform a label embedding through multi-dimensional scaling and concatenate dataset samples with their corresponding label embeddings. Then, we define the distance between two datasets as the 2-Wasserstein distance between their updated samples. Lastly, we leverage the 2-Wasserstein embedding framework to embed tasks into a vector space in which the Euclidean distance between the embedded points approximates the proposed 2-Wasserstein distance between tasks. We show that the proposed embedding leads to a significantly faster comparison of tasks compared to related approaches like the Optimal Transport Dataset Distance (OTDD). Furthermore, we demonstrate the effectiveness of our embedding through various numerical experiments and show statistically significant correlations between our proposed distance and the forward and backward transfer among tasks on a wide variety of image recognition datasets.


History

School

  • Science

Published in

Neural Networks

Volume

181

Issue

January 2025

Publisher

Elsevier Ltd

Version

  • VoR (Version of Record)

Rights holder

©The Author(s)

Publisher statement

This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/)

Acceptance date

2024-10-07

Publication date

2024-10-15

Copyright date

2024

ISSN

0893-6080

eISSN

1879-2782

Language

  • en

Depositor

Dr Andrea Soltoggio. Deposit date: 29 October 2024

Article number

106796

Usage metrics

    Loughborough Publications

    Licence

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC