Doorkeeper

Talk by Xuedong Shang (Université de Lille)

Thu, 17 Dec 2020 17:00 - 18:00 JST
Online Link visible to participants
Register

Registration is closed

Get invited to future events

Free admission

Description

Title: Bayesian Best-Arm Identification and Beyond.

Abstract:
We consider the following sequential decision-making problem: A learner sequentially chooses options among a given set to evaluate and receives noisy feedback about their qualities in order to find the best one. Imagine that each evaluation is costly, hence a natural goal is to confidently identify the best option with as little evaluations as possible. Such scenarios, often known as (fixed-confidence) best-arm identification (BAI) in a stochastic multi-armed bandit game, serve as an fundamental abstraction for many real-world applications like A/B/C testing, clinical trials, hyper-parameter optimization for machine learning/deep learning algorithms, etc. In this presentation, I will talk about some new insights on BAI from a Bayesian perspective. In particular, we theoretically justify the use of Bayesian-flavoured algorithms for fixed-confidence BAI and propose a variant which disposes of the computational burden of existing Bayesian BAI algorithms. Then I would like to highlight the use of Bayesian BAI algorithms in guiding the design of a dynamic hyper-parameter optimization algorithm. As for future perspectives, I would like to include some discussions about how Bayesian algorithms would perform in a more general case where side information is taken into account (a.k.a. BAI for linear bandits).

About this community

RIKEN AIP Public

RIKEN AIP Public

Public events of RIKEN Center for Advanced Intelligence Project (AIP)

Join community