Zzong's Notes

MAB

17건의 항목

2026년 7월 20일
Exploration and Exploitation trade-off
- MAB
- reinforcement_learning
2026년 7월 20일
epsilon-greedy algorithm
2026년 7월 20일
A Contextual-Bandit Approach to Personalized News Article Recommendation
2026년 7월 20일
Burst-induced Multi-Armed Bandit for Learning Recommendation
2026년 7월 20일
Recommender systems using LinUCB - A contextual multi-armed bandit approach
2026년 6월 14일
Exploring compact reinforcement-learning representations with linear regression
2026년 6월 14일
EXP3
- reinforcement_learning
- MAB
2026년 6월 14일
Reinforcement Learning
- reinforcement_learning
- MAB
2026년 6월 14일
UCB
- MAB
- reinforcement_learning
2026년 6월 14일
An Asymptotically Optimal Primal-Dual Incremental Algorithm for Contextual Linear Bandits
2026년 6월 14일
Deep Bayesian Bandits Showdown - An Empirical Comparison of Bayesian Deep Networks for Thompson Sampling
2026년 6월 14일
Optimal Regret Analysis of Thompson Sampling in Stochastic Multi-armed Bandit Problem with Multiple Plays
2026년 6월 14일
Mortal Multi Armed Bandit (2008)
- recoteam
- MAB
2026년 6월 14일
Multi-Armed Bandit
- MAB
- reinforcement_learning
2026년 6월 14일
Chernoff bounds
- MAB
2026년 6월 14일
method of moments
- MAB
- reinforcement_learning
2026년 6월 14일
추천시스템에서 Unbiased Offline Evaluation