![](/img/cover-not-exists.png)
Policies without Memory for the Infinite-Armed Bernoulli Bandit under the Average-Reward Criterion
Herschkorn, Stephen J., Peköz, Erol, Ross, Sheldon M.Volume:
10
Language:
english
Journal:
Probability in the Engineering and Informational Sciences
DOI:
10.1017/S0269964800004149
Date:
January, 1996
File:
PDF, 545 KB
english, 1996