从神经网络可观察的情况下识别学习规则

论文标题

从神经网络可观察的情况下识别学习规则

Identifying Learning Rules From Neural Network Observables

论文作者

Nayebi, Aran, Srivastava, Sanjana, Ganguli, Surya, Yamins, Daniel L. K.

论文摘要

大脑在学习过程中修饰其突触优势，以更好地适应其环境。但是，控制学习的基本可塑性规则是未知的。已经提出了许多建议，包括HEBBIAN机制，明确的错误反向传播和各种替代方案。关于确定任何给定的学习规则是否在实际生物系统中可以操作的任何特定的实验测量是一个开放的问题。在这项工作中，我们针对这个问题采用了“虚拟实验”方法。通过人工神经网络模拟理想化的神经科学实验，我们生成了在各种神经网络体系结构，损失函数，学习规则超标者和参数初始化中测量的大规模学习轨迹数据集。然后，我们采用一种歧视方法，训练线性和简单的非线性分类器，以基于这些可观察到的功能从功能中识别学习规则。我们表明，只能基于权重，激活或瞬时层次活动的总体统计数据来分开不同类别的学习规则，并且这些结果推广到有限的访问轨迹和固定体系结构和学习课程。我们确定每个可观察到的统计数据与规则识别最相关的统计数据，发现跨培训的网络活动的统计数据比从突触强度获得的统计数据对单元不足和测量噪声更强大。我们的结果表明，从几百个单位的次突触活动的电生理记录中获得的激活模式，通常在学习过程中以更广泛的间隔进行测量，可以为识别学习规则提供良好的基础。

The brain modifies its synaptic strengths during learning in order to better adapt to its environment. However, the underlying plasticity rules that govern learning are unknown. Many proposals have been suggested, including Hebbian mechanisms, explicit error backpropagation, and a variety of alternatives. It is an open question as to what specific experimental measurements would need to be made to determine whether any given learning rule is operative in a real biological system. In this work, we take a "virtual experimental" approach to this problem. Simulating idealized neuroscience experiments with artificial neural networks, we generate a large-scale dataset of learning trajectories of aggregate statistics measured in a variety of neural network architectures, loss functions, learning rule hyperparameters, and parameter initializations. We then take a discriminative approach, training linear and simple non-linear classifiers to identify learning rules from features based on these observables. We show that different classes of learning rules can be separated solely on the basis of aggregate statistics of the weights, activations, or instantaneous layer-wise activity changes, and that these results generalize to limited access to the trajectory and held-out architectures and learning curricula. We identify the statistics of each observable that are most relevant for rule identification, finding that statistics from network activities across training are more robust to unit undersampling and measurement noise than those obtained from the synaptic strengths. Our results suggest that activation patterns, available from electrophysiological recordings of post-synaptic activities on the order of several hundred units, frequently measured at wider intervals over the course of learning, may provide a good basis on which to identify learning rules.

下载PDF全文

下载文献需遵守相关版权规定

论文标题