A robust policy bootstrapping algorithm for multi-objective reinforcement learning in non-stationary environments
Abdelfattah, Sherif, Kasmarik, Kathryn, Hu, JiankunLanguage:
english
Journal:
Adaptive Behavior
DOI:
10.1177/1059712319869313
Date:
August, 2019
File:
PDF, 1.68 MB
english, 2019