![](/img/cover-not-exists.png)
On Average Versus Discounted Reward Temporal-Difference Learning
John N. Tsitsiklis, Benjamin Van RoyVolume:
49
Language:
english
Pages:
13
DOI:
10.1023/a:1017980312899
Date:
November, 2002
File:
PDF, 87 KB
english, 2002