thompson

1.1.0
9.42k

The multi-armed bandit by Thompson Sampling, UCB-Upper confidence Bound, and randomized sampling.