Bandit Algorithms / Tor Lattimore, Csaba Szepesvári.

Decision-making in the face of uncertainty is a significant challenge in machine learning, and the multi-armed bandit model is a commonly used framework to address it. This comprehensive and rigorous introduction to the multi-armed bandit problem examines all the major settings, including stochastic...

Full description

Bibliographic Details
Main Authors: Lattimore, Tor, 1987- (Author), Szepesvári, Csaba, (Author)
Format: Book
Published: Cambridge : Cambridge University Press, 2020.
Online Access:CONNECT