![](/img/cover-not-exists.png)
Q-Learning and Enhanced Policy Iteration in Discounted Dynamic Programming
Bertsekas, Dimitri P., Yu, HuizhenVolume:
37
Language:
english
Journal:
Mathematics of Operations Research
DOI:
10.1287/moor.1110.0532
Date:
February, 2012
File:
PDF, 1.84 MB
english, 2012