Behavior learning based on a policy gradient method: Separation of environmental dynamics and state-values in policies

Ishihara Seiji, Igarashi Harukazu

Research output: Contribution to journalArticlepeer-review

1 Citation (Scopus)

Fingerprint

Dive into the research topics of 'Behavior learning based on a policy gradient method: Separation of environmental dynamics and state-values in policies'. Together they form a unique fingerprint.

Engineering & Materials Science