论文标题

复杂值神经网络的通用近似定理

The universal approximation theorem for complex-valued neural networks

论文作者

Voigtlaender, Felix

论文摘要

我们将神经网络的经典通用近似定理推广到复杂值的神经网络。确切地说,我们考虑具有复杂激活功能的FeedForward网络$σ:\ Mathbb {C} \ to \ Mathbb {C {C} $,其中每个神经元执行操作$ \ MATHBB {C}^n \ to \ to \ to \ Mathbb {c} \ mathbb {c}^n $和a bias $ b \ in \ mathbb {c} $,并且使用$σ$ plactied componentwise。我们完全表征了这些激活功能$σ$,相关的复杂网络具有通用近似属性,这意味着它们可以在任何紧凑的子集上均匀地近似任何连续函数,$ \ mathbb {c}^d $任意良好。 Unlike the classical case of real networks, the set of "good activation functions" which give rise to networks with the universal approximation property differs significantly depending on whether one considers deep networks or shallow networks: For deep networks with at least two hidden layers, the universal approximation property holds as long as $σ$ is neither a polynomial, a holomorphic function, or an antiholomorphic function.另一方面,浅网络是通用的,并且仅当$σ$的实际部分或虚构部分不是多谐波函数时。

We generalize the classical universal approximation theorem for neural networks to the case of complex-valued neural networks. Precisely, we consider feedforward networks with a complex activation function $σ: \mathbb{C} \to \mathbb{C}$ in which each neuron performs the operation $\mathbb{C}^N \to \mathbb{C}, z \mapsto σ(b + w^T z)$ with weights $w \in \mathbb{C}^N$ and a bias $b \in \mathbb{C}$, and with $σ$ applied componentwise. We completely characterize those activation functions $σ$ for which the associated complex networks have the universal approximation property, meaning that they can uniformly approximate any continuous function on any compact subset of $\mathbb{C}^d$ arbitrarily well. Unlike the classical case of real networks, the set of "good activation functions" which give rise to networks with the universal approximation property differs significantly depending on whether one considers deep networks or shallow networks: For deep networks with at least two hidden layers, the universal approximation property holds as long as $σ$ is neither a polynomial, a holomorphic function, or an antiholomorphic function. Shallow networks, on the other hand, are universal if and only if the real part or the imaginary part of $σ$ is not a polyharmonic function.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源