Original language | English |
---|---|
Pages (from-to) | 164-174 |
Journal | PRICAI2008, Proceedings Lecture Notes in Computer Science |
Volume | 5351 |
Publication status | Published - 2008 Dec 19 |
Behavior Learning Based on a Policy Gradient Method: Separation of Environmental Dynamics and State Values in Policies
Seiji Ishihara, Seiji Ishihara;Harukazu Igarashi
Research output: Contribution to journal › Article › peer-review
1
Citation
(Scopus)