Activate Activate Activate
contact  
Hello. Sign in to personalize your visit. New user? Register now.  

In
By author
Evolutionary Computation

Quarterly (Spring, Summer, Fall, Winter)
141 pp. per issue
7 x 10
Founded: 1993
ISSN 1063-6560

E-ISSN 1530-9304
2008 ISI Impact Factor: 3.000

Evolutionary Computation

Spring 2006, Vol. 14, No. 1, Pages 1-19
Posted Online April 10, 2006.
(doi:10.1162/evco.2006.14.1.1)
© 2006 Massachusetts Institute of Technology
A Study of Structural and Parametric Learning in XCS

Tim Kovacs

University of Bristol, Bristol BS8 1UB, England, U.K.

Manfred Kerber

University of Birmingham, Birmingham B15 2TT, England, U.K.

PDF (366.314 KB) PDF Plus (370.364 KB)

Abstract

The performance of a learning classifier system is due to its two main components. First, it evolves new structures by generating new rules in a genetic process; second, it adjusts parameters of existing rules, for example rule prediction and accuracy, in an evaluation step, which is not only important for applying the rules, but also for the genetic process. The two components interleave and in the case of XCS drive the pop-ulation toward a minimal, fit, non-overlapping population. In this work we attempt to gain new insights as to the relative contributions of the two components. We find that the genetic component has an additional role when using the train/test approach which is not present in online learning. We compare XCS to a system in which the rule set is restricted to the initial random population (XCS-NGA, that is, XCS No Genetic Algorithm). For small Boolean functions we can give XCS-NGA all possible rules of a particular condition length. In online learning, XCS-NGA can, given sufficiently many rules, achieve a surprisingly high classification accuracy, comparable to that of XCS. In a train/test approach, however, XCS generalises better than XCS-NGA and there seem to be limitations of XCS-NGA which cannot be overcome simply by increasing the population size. This illustrates that the requirements of a function approximator tend to differ between reinforcement learning (which is typically online) and concept learning (which is typically train/test).

Technology Partner - Atypon Systems, Inc.
  CrossRef member COUNTER member