Learning a dynamic policy by using policy gradient: application to biped walking
Takamitsu Matsubara, Jun Morimoto, Jun Nakanishi, Masa-Aki Sato, Kenji DoyaVolume:
38
Year:
2007
Language:
english
Pages:
14
DOI:
10.1002/scj.20441
File:
PDF, 1.35 MB
english, 2007