Innovating Works

AlgoRNN

Financiado
Recurrent Neural Networks and Related Machines That Learn Algorithms
Recurrent neural networks (RNNs) are general parallel-sequential computers. Some learn their programs or weights. Our supervised Long Short-Term Memory (LSTM) RNNs were the first to win pattern recognition contests, and recently e... Recurrent neural networks (RNNs) are general parallel-sequential computers. Some learn their programs or weights. Our supervised Long Short-Term Memory (LSTM) RNNs were the first to win pattern recognition contests, and recently enabled best known results in speech and handwriting recognition, machine translation, etc. They are now available to billions of users through the world's most valuable public companies including Google and Apple. Nevertheless, in lots of real-world tasks RNNs do not yet live up to their full potential. Although universal in theory, in practice they fail to learn important types of algorithms. This ERC project will go far beyond today's best RNNs through novel RNN-like systems that address some of the biggest open RNN problems and hottest RNN research topics: (1) How can RNNs learn to control (through internal spotlights of attention) separate large short-memory structures such as sub-networks with fast weights, to improve performance on many natural short-term memory-intensive tasks which are currently hard to learn by RNNs, such as answering detailed questions on recently observed videos? (2) How can such RNN-like systems metalearn entire learning algorithms that outperform the original learning algorithms? (3) How to achieve efficient transfer learning from one RNN-learned set of problem-solving programs to new RNN programs solving new tasks? In other words, how can one RNN-like system actively learn to exploit algorithmic information contained in the programs running on another? We will test our systems existing benchmarks, and create new, more challenging multi-task benchmarks. This will be supported by a rather cheap, GPU-based mini-brain for implementing large RNNs. ver más
31/03/2024
USI
3M€
Duración del proyecto: 81 meses Fecha Inicio: 2017-06-12
Fecha Fin: 2024-03-31

Línea de financiación: concedida

El organismo H2020 notifico la concesión del proyecto el día 2024-03-31
Línea de financiación objetivo El proyecto se financió a través de la siguiente ayuda:
ERC-2016-ADG: ERC Advanced Grant
Cerrada hace 8 años
Presupuesto El presupuesto total del proyecto asciende a 3M€
Líder del proyecto
UNIVERSITA DELLA SVIZZERA ITALIANA No se ha especificado una descripción o un objeto social para esta compañía.
Perfil tecnológico TRL 4-5