Improved Regret Bounds of Bilinear Bandits using
Action Space Analysis
Kyoungseok Jang 1 Kwang-Sung Jun 2 Se-Young Yun 3 Wanmo Kang 1
Abstract arrange couples based on their experiences to get better rat-
ings and rewards. Balancing exploration and exploitation is
We consider the bilinear bandit problem where the core framework of the bandit approach, and researchers
the learner chooses a ...


雷达卡




京公网安备 11010802022788号







