Two Steps Reinforcement Learning

authors

FERNANDEZ REBOLLO, FERNANDO
BORRAJO MILLAN, DANIEL

published in

INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS Journal

publication date

February 2008

start page

213

end page

245

issue

2

volume

23

Digital Object Identifier (DOI)

https://doi.org/10.1002/int.20255

International Standard Serial Number (ISSN)

0884-8173

Electronic International Standard Serial Number (EISSN)

1098-111X

abstract

When applying reinforcement learning in domains with very large or continuous state spaces, the experience obtained by the learning agent in the interaction with the environment must be generalized. The generalization methods are usually based on the approximation of the value functions used to compute the action policy and tackled in two different ways. On the one hand by using an approximation of the value functions based on a supervized learning method. On the other hand, by discretizing the environment to use a tabular representation of the value functions. In this work, we propose an algorithm that uses both approaches to use the benefits of both mechanisms, allowing a higher performance. The approach is based on two learning phases. In the first one, a learner is used as a supervized function approximator, but using a machine learning technique which also outputs a state space discretization of the environment, such as nearest prototype classifiers or decision trees do. In the second learning phase, the space discretization computed in the first phase is used to obtain a tabular representation of the value function computed in the previous phase, allowing a tuning of such value function approximation. Experiments in different domains show that executing both learning phases improves the results obtained executing only the first one. The results take into account the resources used and the performance of the learned behavior. [Número especial sobre: Modeling Decisions for Artificial Intelligence]

Two Steps Reinforcement Learning Articles

Overview

authors

published in

publication date

start page

end page

issue

volume

Digital Object Identifier (DOI)

International Standard Serial Number (ISSN)

Electronic International Standard Serial Number (EISSN)

abstract