Policy gradient reinforcement learning with separated knowledge: Environmental dynamics and action-values in policies

Seiji Ishihara, Harukazu Igarashi

Research output: Contribution to journalArticlepeer-review

Fingerprint

Dive into the research topics of 'Policy gradient reinforcement learning with separated knowledge: Environmental dynamics and action-values in policies'. Together they form a unique fingerprint.

Engineering & Materials Science