thompson

1.1.0
10.37k

The multi-armed bandit by Thompson Sampling, UCB-Upper confidence Bound, and randomized sampling.