thompson

1.1.0
10.05k

The multi-armed bandit by Thompson Sampling, UCB-Upper confidence Bound, and randomized sampling.