![](/img/cover-not-exists.png)
Instance-dependent ââ-bounds for policy evaluation in tabular reinforcement learning
Pananjady, Ashwin, Wainwright, Martin J.Year:
2020
Journal:
IEEE Transactions on Information Theory
DOI:
10.1109/TIT.2020.3027316
File:
PDF, 832 KB
2020