A Modern Self-Referential Weight Matrix That Learns to Modify Itself

2 years ago

source link: https://arxiv.org/abs/2202.05780
Go to the source link to view the article. You can view the picture content, updated content and better typesetting reading experience. If the link is broken, please click the button below to view the snapshot at that time.

[Submitted on 11 Feb 2022]

A Modern Self-Referential Weight Matrix That Learns to Modify Itself

Download PDF

The weight matrix (WM) of a neural network (NN) is its program. The programs of many traditional NNs are learned through gradient descent in some error function, then remain fixed. The WM of a self-referential NN, however, can keep rapidly modifying all of itself during runtime. In principle, such NNs can meta-learn to learn, and meta-meta-learn to meta-learn to learn, and so on, in the sense of recursive self-improvement. While NN architectures potentially capable of implementing such behavior have been proposed since the '90s, there have been few if any practical studies. Here we revisit such NNs, building upon recent successes of fast weight programmers and closely related linear Transformers. We propose a scalable self-referential WM (SRWM) that uses outer products and the delta update rule to modify itself. We evaluate our SRWM in supervised few-shot learning and in multi-task reinforcement learning with procedurally generated game environments. Our experiments demonstrate both practical applicability and competitive performance of the proposed SRWM. Our code is public.

Recommend

A Modern Self-Referential Weight Matrix That Learns to Modify Itself

A Modern Self-Referential Weight Matrix That Learns to Modify Itself

Recommend

World's Wealthiest People, 2022 (12 April 2022) - CEOWORLD magazine

Google is waging a legal war against pet scams

Adding Alt Text To Twitter Images Using C#

Former Ethereum developer sentenced to over five years in prison for helping Nor...

百度程序员开发避坑指南（3）

奇纳金科完成数千万元B轮融资，宿迁市产业基金领投

芒格基金减持阿里巴巴

Baskin-Robbins

华夏中国交建reit(508018)预期收益率0.2+%左右

从共识机制和桥安全分析，看这五大 NFT 平台的安全性

About Joyk