Thompson Sampling via Local Uncertainty

This repo corresponds to the Thompson Sampling via Local Uncertainty paper, published in ICML 2020. We propose a new probabilistic modeling framework for Thompson sampling, where local latent variable uncertainty is used to sample the mean reward. We implemented our algorithms LU-Gauss/SIVI based on the Benchmark proposed by Carlos Riquelme.

Please, use the following when citing the code or the paper:

@article{zhendong2020luts, 
title={Thompson Sampling via Local Uncertainty},
author={Wang, Zhendong and Zhou, Mingyuan},
journal={International Conference on Machine Learning, ICML.}, 
year={2020}
}

Dependencies

This code is based on Python 2.7, with the main dependencies being TensorFlow==1.14 and other dependencies stated in the Benchmark

Dataset

You can download the required dataset from the benckmark.

Training

You can run comparison of all algorithms from run_bandit.py file, with one argument for a specific dataset. It will save the output results in one .npz file.

    python run_bandit.py -dataset='mushroom'

Acknowledgement

We thank greatly Riquelme et al. for making their code public.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
bandits		bandits
README.md		README.md
run_bandit.py		run_bandit.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Thompson Sampling via Local Uncertainty

Dependencies

Dataset

Training

Acknowledgement

About

Releases

Packages

Languages

Zhendong-Wang/Thompson-Sampling-via-Local-Uncertainty

Folders and files

Latest commit

History

Repository files navigation

Thompson Sampling via Local Uncertainty

Dependencies

Dataset

Training

Acknowledgement

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages