搜索结果: 1-2 共查到“数学 multi-armed bandits”相关记录2条 . 查询时间(0.105 秒)
Combinatorial Network Optimization with Unknown Variables: Multi-Armed Bandits with Linear Rewards
Network Optimization Unknown Variables
2010/11/24
In the classic multi-armed bandits problem, the goal is to have a policy for dynamically operating arms that each yield stochastic rewards with unknown means. The key metric of interest is regret, de...
PAC-Bayesian aggregation and multi-armed bandits
PAC-Bayesian aggregation multi-armed bandits
2010/11/22
This habilitation thesis presents several contributions to (1) the PAC-Bayesian analysis of statistical learning, (2) the three aggregation problems: given d functions, how to predict as well as (i) ...