![](/img/cover-not-exists.png)
A Reinforcement Learning Algorithm Based on Policy Iteration for Average Reward: Empirical Results with Yield Management and Convergence Analysis
Abhijit GosaviVolume:
55
Language:
english
Pages:
25
DOI:
10.1023/b:mach.0000019802.64038.6c
Date:
April, 2004
File:
PDF, 238 KB
english, 2004