搜索结果: 1-1 共查到“统计学 Stochastic Bandits”相关记录1条 . 查询时间(0.046 秒)
The KL-UCB Algorithm for Bounded Stochastic Bandits and Beyond
Stochastic Bandits Beyond KL-UCB
2011/3/21
This paper presents a finite-time analysis of the KL-UCB algorithm, an online, horizon-free index policy for stochastic bandit problems. We prove two distinct results: first, for arbitrary bounded rew...