方略学科导航

搜索结果: 1-4 共查到“统计学 Bandit”相关记录4条 . 查询时间(0.156 秒)

On the Complexity of Bandit and Derivative-Free Stochastic Convex Optimization Bandit Derivative-Free Stochastic Convex Optimization 2012/11/23

The problem of stochastic convex optimization with bandit feedback (in the learning community) or without knowledge of gradients (in the optimization community) has received much attention in recent y...

存档附件原文地址

Deterministic Sequencing of Exploration and Exploitation for Multi-Armed Bandit Problems Deterministic Sequencing Exploration Exploitation Multi-Armed Bandit Problems 2011/7/7

In the Multi-Armed Bandit (MAB) problem, there are a given set of arms with unknown reward distributions. At each time, a player selects one arm to play, aiming to maximize the total expected reward o...

存档附件原文地址

Stochastic Bandit Based on Empirical Moments multiarmed bandit Stochastic Bandit Empirical asymptotic Burnetas Katehakis 2011/6/17

In the multiarmed bandit problem a gambler chooses an arm of a slot machine to pull considering a tradeoff between exploration and exploitation. We study the stochastic bandit problem where each arm...

存档附件原文地址

Multi-armed bandit problem with precedence relations Markov chains multi-armed bandits Kullback–Leibler number likelihood ratio optimal stopping scheduling 2010/4/27

Consider a multi-phase project management problem where the decision maker needs to deal with two issues: (a) how to allocate resources to projects within each phase, and (b) when to enter the next ...

存档附件原文地址

中国研究生教育排行榜-条

正在加载...

中国学术期刊排行榜-条

正在加载...

世界大学科研机构排行榜-条

正在加载...

中国大学排行榜-条

正在加载...

人　物-篇

正在加载...

课　件-篇

正在加载...

视听资料-篇

正在加载...

研招资料 -篇

正在加载...

知识要闻-篇

正在加载...

国际动态-篇

正在加载...

会议中心-篇

正在加载...

学术指南-篇

正在加载...

学术站点-篇

正在加载...

中国研究生教育排行榜-条

中国学术期刊排行榜-条

世界大学科研机构排行榜-条

中国大学排行榜-条

人 物-篇

课 件-篇

视听资料-篇

知识库-篇

研招资料 -篇

知识要闻-篇

国际动态-篇

会议中心-篇

学术指南-篇

学术站点-篇

人　物-篇

课　件-篇