btb.selection.best module¶
-
class
btb.selection.best.
BestKReward
(choices, k=2)[source]¶ Bases:
btb.selection.ucb1.UCB1
Best K reward selector
Computes the average reward from the past scores by using only the highest k scores. In implementation, the other scores are replaced with ``nan``s such that they still factor into the number of arm pulls.
- Parameters
k (int) – number of best scores to consider
-
class
btb.selection.best.
BestKVelocity
(choices, k=2)[source]¶ Bases:
btb.selection.best.BestKReward
Best K velocity selector