📣 Help Shape the Future of UKRI's Gateway to Research (GtR)

We're improving UKRI's Gateway to Research and are seeking your input! If you would be interested in being interviewed about the improvements we're making and to have your say about how we can make GtR more user-friendly, impactful, and effective for the Research and Innovation community, please email gateway@ukri.org.

Devising robust Multi-Armed Bandit algorithms in the presence of non-stationarities and long-range dependencies

Lead Research Organisation: Lancaster University
Department Name: Mathematics and Statistics

Abstract

The Multi-Armed Bandit (MAB) problem is one of the most central instances of sequential decision making under uncertainty, which plays a key role in online learning and optimization. MABs arise in a variety of modern real-world applications, such as online advertisement, Internet routing, and sequential portfolio selection, only to name a few. In this problem, a forecaster aims to maximize the expected sum of the rewards actively collected from unknown processes. MABs are typically studied under the assumption that the rewards are i.i.d.. However, this assumption does not necessarily hold in many practical situations. The objective of this project is to analyze the possibilities and limitations of more challenging, yet more realistic (restless) MAB settings, where the reward distributions may exhibit long-range dependencies and may possess potential non-stationarities. As part of the project, novel MAB strategies with good performance guarantees will be sought, and applications to real-world problems will be explored.

Publications

10 25 50

Studentship Projects

Project Reference Relationship Related To Start End Student Name
EP/V520214/1 30/09/2020 30/05/2027
2437073 Studentship EP/V520214/1 30/09/2020 29/09/2024 Ali Arabzadeh