Activate Activate Activate
contact  
Hello. Sign in to personalize your visit. New user? Register now.  

In
By author

Monthly
288 pp. per issue, 6 x 9,
illustrated
Founded: 1989
ISSN 0899-7667
E-ISSN 1530-888X
2008 ISI Impact Factor: 2.378

Neural Computation

March 2007, Vol. 19, No. 3, Pages 757-779
Posted Online February 13, 2007.
(doi:10.1162/neco.2007.19.3.757)
© 2007 Massachusetts Institute of Technology
Training Recurrent Networks by Evolino

Jürgen Schmidhuber

IDSIA, 6928 Manno (Lugano), Switzerland, and TU Munich, 85748 Garching, München, Germany,

Daan Wierstra

Matteo Gagliolo

Faustino Gomez

IDSIA, 6928 Manno (Lugano), Switzerland,

PDF (328.812 KB) PDF Plus (342.061 KB)

In recent years, gradient-based LSTM recurrent neural networks (RNNs) solved many previously RNN-unlearnable tasks. Sometimes, however, gradient information is of little use for training RNNs, due to numerous local minima. For such cases, we present a novel method: EVOlution of systems with LINear Outputs (Evolino). Evolino evolves weights to the nonlinear, hidden nodes of RNNs while computing optimal linear mappings from hidden state to output, using methods such as pseudo-inverse-based linear regression. If we instead use quadratic programming to maximize the margin, we obtain the first evolutionary recurrent support vector machines. We show that Evolino-based LSTM can solve tasks that Echo State nets (Jaeger, 2004a) cannot and achieves higher accuracy in certain continuous function generation tasks than conventional gradient descent RNNs, including gradient-based LSTM.

Technology Partner - Atypon Systems, Inc.
  CrossRef member COUNTER member