thompson

1.1.0
9.78k

The multi-armed bandit by Thompson Sampling, UCB-Upper confidence Bound, and randomized sampling.