Unified algorithm to improve reinforcement learning in dynamic environments: An instance-based approach

Unified algorithm to improve reinforcement learning in dynamic environments: An instance-based approach

Ribeiro, Richardson and Favarim, Fábio and Barbosa, Marco A.C. and Borges, André Pinz and Dordal, Osmar Betazzi and Koerich, Alessandro L. and Enembreck, Fabrício

ICEIS 2012 – Proceedings of the 14th International Conference on Enterprise Information Systems 2012

Abstract : This paper presents an approach for speeding up the convergence of adaptive intelligent agents using reinforcement learning algorithms. Speeding up the learning of an intelligent agent is a complex task since the choice of inadequate updating techniques may cause delays in the learning process or even induce an unexpected acceleration that causes the agent to converge to a non-satisfactory policy. We have developed a technique for estimating policies which combines instance-based learning and reinforcement learning algorithms in Markovian environments. Experimental results in dynamic environments of different dimensions have shown that the proposed technique is able to speed up the convergence of the agents while achieving optimal action policies, avoiding problems of classical reinforcement learning approaches.