Applying the Policy Gradient Method to Behavior Learning in Multiagent Systems: The Pursuit Problem

Seiji Ishihara, Harukazu Igarashi

Research output: Contribution to journalArticle

6 Citations (Scopus)
Original languageEnglish
Pages (from-to)101-109
JournalSystems and Computers in Japan
Volume37
Publication statusPublished - 2006 Jun 12

Cite this