搜索结果: 1-4 共查到“统计学 Bandit”相关记录4条 . 查询时间(0.156 秒)
On the Complexity of Bandit and Derivative-Free Stochastic Convex Optimization
Bandit Derivative-Free Stochastic Convex Optimization
2012/11/23
The problem of stochastic convex optimization with bandit feedback (in the learning community) or without knowledge of gradients (in the optimization community) has received much attention in recent y...
Deterministic Sequencing of Exploration and Exploitation for Multi-Armed Bandit Problems
Deterministic Sequencing Exploration Exploitation Multi-Armed Bandit Problems
2011/7/7
In the Multi-Armed Bandit (MAB) problem, there are a given set of arms with unknown reward distributions. At each time, a player selects one arm to play, aiming to maximize the total expected reward o...
Stochastic Bandit Based on Empirical Moments
multiarmed bandit Stochastic Bandit Empirical asymptotic Burnetas Katehakis
2011/6/17
In the multiarmed bandit problem a gambler chooses an arm
of a slot machine to pull considering a tradeoff between exploration and
exploitation. We study the stochastic bandit problem where each arm...
Multi-armed bandit problem with precedence relations
Markov chains multi-armed bandits Kullback–Leibler number likelihood ratio optimal stopping scheduling
2010/4/27
Consider a multi-phase project management problem where the
decision maker needs to deal with two issues: (a) how to allocate resources to
projects within each phase, and (b) when to enter the next ...