Deciding What to Learn: A Rate-Distortion Approach
Dilip Arumugam 1 Benjamin Van Roy 1
Abstract of information that, while insufficient to fully identify the
environment, suffices to guide effective decisions. Then,
Agents that learn to select optimal actions repre-
the agent can prioritize gathering of information about this
sent a prominent focus of the sequential decision ...


雷达卡




京公网安备 11010802022788号







