超越尖峰网络：树突扩增和输入隔离的计算优势

论文标题

超越尖峰网络：树突扩增和输入隔离的计算优势

Beyond spiking networks: the computational advantages of dendritic amplification and input segregation

论文作者

Capone, Cristiano, Lupo, Cosimo, Muratore, Paolo, Paolucci, Pier Stanislao

论文摘要

大脑可以有效地学习各种任务，激励人们寻找以生物学启发的学习规则来改善当前的人工智能技术。大多数生物学模型都是由点神经元组成的，无法实现机器学习中的最新性能。最近的工作提出，树突输入的分离（神经元接收感觉信息和隔离隔室中的高阶反馈），高频峰值的产生将支持生物神经元中的错误反向传播。但是，这些方法需要对神经元的良好时空结构进行传播错误，这在生物网络中不可能可行。为了放松这一假设，我们建议爆发和树突输入分离为基于生物学上的基于目标的学习提供了自然支持，这不需要错误传播。我们提出了一个由三个分离隔室组成的金字塔神经元模型。基底和顶室之间的巧合机制允许产生高频尖峰。该体系结构允许基于由教学信号触发的目标爆发活动与复发连接引起的目标爆发活动之间的比较，从而提供了爆发依赖的学习规则，从而为基于目标的学习提供了支持。我们表明，该框架可用于有效解决时空任务，例如商店和3D轨迹的回忆。最后，我们建议这种神经元结构自然允许策划``层次模仿学习''，从而使挑战长途决策任务的分解能够分解为更简单的子任务。这可以在两级网络中实现，在该网络中，高网络充当``经理''，并为低网络````worker''''产生上下文信号。

The brain can efficiently learn a wide range of tasks, motivating the search for biologically inspired learning rules for improving current artificial intelligence technology. Most biological models are composed of point neurons, and cannot achieve the state-of-the-art performances in machine learning. Recent works have proposed that segregation of dendritic input (neurons receive sensory information and higher-order feedback in segregated compartments) and generation of high-frequency bursts of spikes would support error backpropagation in biological neurons. However, these approaches require propagating errors with a fine spatio-temporal structure to the neurons, which is unlikely to be feasible in a biological network. To relax this assumption, we suggest that bursts and dendritic input segregation provide a natural support for biologically plausible target-based learning, which does not require error propagation. We propose a pyramidal neuron model composed of three separated compartments. A coincidence mechanism between the basal and the apical compartments allows for generating high-frequency bursts of spikes. This architecture allows for a burst-dependent learning rule, based on the comparison between the target bursting activity triggered by the teaching signal and the one caused by the recurrent connections, providing the support for target-based learning. We show that this framework can be used to efficiently solve spatio-temporal tasks, such as the store and recall of 3D trajectories. Finally, we suggest that this neuronal architecture naturally allows for orchestrating ``hierarchical imitation learning'', enabling the decomposition of challenging long-horizon decision-making tasks into simpler subtasks. This can be implemented in a two-level network, where the high-network acts as a ``manager'' and produces the contextual signal for the low-network, the ``worker''.

下载PDF全文

下载文献需遵守相关版权规定

论文标题