MUXCONV：卷积神经网络中的信息多路复用

论文标题

MUXCONV：卷积神经网络中的信息多路复用

MUXConv: Information Multiplexing in Convolutional Neural Networks

论文作者

Lu, Zhichao, Deb, Kalyanmoy, Boddeti, Vishnu Naresh

论文摘要

近年来，卷积神经网络已经看到了计算效率的显着提高。一个关键的驱动力是通过组合$ 1 \ times 1 $和深度可分开的卷积来代替标准卷积层来交易模型表达和效率的想法。但是，效率的价格是跨网络和渠道的信息流的次优流。为了克服这一限制，我们提出了MuxConv，该层旨在通过逐渐多样地将通道和网络中的空间信息逐步增加信息流，同时减轻计算复杂性。此外，为了证明MuxConv的有效性，我们将其集成到有效的多目标进化算法中，以搜索最佳模型超参数，同时优化准确性，紧凑性和计算效率。在ImageNet上，所得模型被称为MuxNet，匹配性能（75.3％的TOP-1准确性）和MobileNetV3的多重ADD操作（2.18亿），而在所有三个标准中都超过1.6 $ \ times $紧凑型。 Muxnet在转移学习和适应对象检测时的性能也很好。在ChestX-Ray 14基准测试中，其精度与最先进的精度相当，而$ 3.3 \ times $ $紧凑，$ 14 \ times $ a更效率。同样，与MobilenEtV2相比，对Pascal VOC 2007的检测更准确，快28％，紧凑率高6％。代码可从https://github.com/human-analysis/muxconv获得

Convolutional neural networks have witnessed remarkable improvements in computational efficiency in recent years. A key driving force has been the idea of trading-off model expressivity and efficiency through a combination of $1\times 1$ and depth-wise separable convolutions in lieu of a standard convolutional layer. The price of the efficiency, however, is the sub-optimal flow of information across space and channels in the network. To overcome this limitation, we present MUXConv, a layer that is designed to increase the flow of information by progressively multiplexing channel and spatial information in the network, while mitigating computational complexity. Furthermore, to demonstrate the effectiveness of MUXConv, we integrate it within an efficient multi-objective evolutionary algorithm to search for the optimal model hyper-parameters while simultaneously optimizing accuracy, compactness, and computational efficiency. On ImageNet, the resulting models, dubbed MUXNets, match the performance (75.3% top-1 accuracy) and multiply-add operations (218M) of MobileNetV3 while being 1.6$\times$ more compact, and outperform other mobile models in all the three criteria. MUXNet also performs well under transfer learning and when adapted to object detection. On the ChestX-Ray 14 benchmark, its accuracy is comparable to the state-of-the-art while being $3.3\times$ more compact and $14\times$ more efficient. Similarly, detection on PASCAL VOC 2007 is 1.2% more accurate, 28% faster and 6% more compact compared to MobileNetV2. Code is available from https://github.com/human-analysis/MUXConv

下载PDF全文

下载文献需遵守相关版权规定

论文标题