Policy Gradient Reinforcement Learning with Environmental Dynamics and Action-Values in Policies

Seiji Ishihara, Harukazu Igarashi

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Fingerprint

Dive into the research topics of 'Policy Gradient Reinforcement Learning with Environmental Dynamics and Action-Values in Policies'. Together they form a unique fingerprint.

Mathematics

Engineering & Materials Science