TY - GEN
T1 - Efficient experience reuse in non-Markovian environments
AU - Dung, Le Tien
AU - Komeda, Takashi
AU - Takagi, Motoki
PY - 2008
Y1 - 2008
N2 - Learning time is always a critical issue in Reinforcement Learning, especially when Recurrent Neural Networks are used to predict Q values in non-Markovian environments. Experience reuse has been received much attention due to its ability to reduce learning time. In this paper, we propose a new method to efficiently reuse experience. Our method generates new episodes from recorded episodes using an action-pair merger. Recorded episodes and new episodes are replayed after each learning epoch. We compare our method with standard online learning, and learning using experience replay in a vision-based robot problem. The results show the potential of this approach.
AB - Learning time is always a critical issue in Reinforcement Learning, especially when Recurrent Neural Networks are used to predict Q values in non-Markovian environments. Experience reuse has been received much attention due to its ability to reduce learning time. In this paper, we propose a new method to efficiently reuse experience. Our method generates new episodes from recorded episodes using an action-pair merger. Recorded episodes and new episodes are replayed after each learning epoch. We compare our method with standard online learning, and learning using experience replay in a vision-based robot problem. The results show the potential of this approach.
KW - Recurrent neural networks
KW - Reinforcement learning
UR - http://www.scopus.com/inward/record.url?scp=56749173285&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=56749173285&partnerID=8YFLogxK
U2 - 10.1109/SICE.2008.4655239
DO - 10.1109/SICE.2008.4655239
M3 - Conference contribution
AN - SCOPUS:56749173285
SN - 9784907764296
T3 - Proceedings of the SICE Annual Conference
SP - 3327
EP - 3332
BT - Proceedings of SICE Annual Conference 2008 - International Conference on Instrumentation, Control and Information Technology
T2 - SICE Annual Conference 2008 - International Conference on Instrumentation, Control and Information Technology
Y2 - 20 August 2008 through 22 August 2008
ER -