Stage-wise Conservative Linear Bandits
Ahmadreza Moradipari, Christos Thrampoulidis, Mahnoosh Alizadeh
Department of Electrical and Computer Enginnering
University of California, Santa Barbara
ahmadreza_moradipari@ucsb.edu
Abstract
We study stage-wise conservative linear stochastic bandits: an instance of bandit
optimization, which accounts for (unknown) “safety constraints" that appear in
applications such as onli ...


雷达卡


京公网安备 11010802022788号







