Asymptotically efficient allocation rules for the multiarmed bandit problem with multiple plays-Part II: Markovian rewards
Anantharam, V., Varaiya, P., Walrand, J.Volume:
32
Language:
english
Journal:
IEEE Transactions on Automatic Control
DOI:
10.1109/tac.1987.1104485
Date:
November, 1987
File:
PDF, 422 KB
english, 1987