Thompson Sampling for Multinomial Logit
Contextual Bandits
Min-hwan Oh Garud Iyengar
Columbia University Columbia University
New York, NY New York, NY
m.oh@columbia.edu garud@ieor.columbia.edu
Abstract
We consider a dynamic assortment selection problem where the goal is to offer
a sequence of assortments that maximizes the expected cumulative reve ...


雷达卡



京公网安备 11010802022788号







