Loughborough University
Browse
- No file added yet -

Globally informative thompson sampling for structured bandit problems with application to crowdtranscoding

Download (338.2 kB)
conference contribution
posted on 2021-05-04, 12:42 authored by Xingchi Liu, Mahsa DerakhshaniMahsa Derakhshani, Ziming Zhu, Sangarapillai LambotharanSangarapillai Lambotharan
Multi-armed bandit is a widely-studied model for sequential decision-making problems. The most studied model in the literature is stochastic bandits wherein the reward of each arm follows an independent distribution. However, there is a wide range of applications where the rewards of different alternatives are correlated to some extent. In this paper, a class of structured bandit problems is studied in which rewards of different arms are functions of the same unknown parameter vector. To minimize the cumulative learning regret, we propose a globally informative Thompson sampling algorithm to learn and leverage the correlation among arms, which can deal with unknown multidimensional parameter and non-monotonic reward functions. Our studies demonstrate that the proposed algorithm achieves significant improvement in the learning speed. In particular, the designed algorithm is used to solve an edge transcoder selection problem in crowdsourced live video streaming systems and shows superior performance as compared to the existing schemes.

Funding

Communications Signal Processing Based Solutions for Massive Machine-to-Machine Networks (M3NETs)

Engineering and Physical Sciences Research Council

Find out more...

Leverhulme Trust Research Fellowship scheme under Grant Derakhshani-LTRF1920\16\67

History

School

  • Mechanical, Electrical and Manufacturing Engineering

Published in

3rd International Conference on Artificial Intelligence in Information and Communication (ICAIIC)

Pages

210-215

Source

3rd International Conference on Artificial Intelligence in Information and Communication (ICAIIC)

Publisher

IEEE

Version

  • AM (Accepted Manuscript)

Rights holder

© IEEE

Publisher statement

Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works

Acceptance date

2021-03-13

Publication date

2021-04-29

Copyright date

2021

ISBN

9781728176383

Language

  • en

Location

Korea/Virtual

Event dates

13th April 2021 - 16th April 2021

Depositor

Dr Mahsa Derakhshani. Deposit date: 31 March 2021

Usage metrics

    Loughborough Publications

    Categories

    No categories selected

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC