Now showing items 1-1 of 1

    • Liu, Xi (2020-04-23)
      Bandit learning has been widely applied to handle the exploration-exploitation dilemma in sequential decision problems. To solve the dilemma, a large number of bandit algorithms have been proposed. While many of these ...