融合MobileNetV3特征的结构化剪枝方法

doi:10.16183/j.cnki.jsjtu.2022.077

[1]

KRIZHEVSKY

A

, SUTSKEVER

I

, HINTON

G E

.

ImageNet classification with deep convolutional neural networks

[J]. Communications of the ACM, 2017, 60(6): 84-90.

DOI:10.1145/3065386 URL [本文引用: 1]

We trained a large, deep convolutional neural network to classify the 1.2 million high-resolution images in the ImageNet LSVRC-2010 contest into the 1000 different classes. On the test data, we achieved top-1 and top-5 error rates of 37.5% and 17.0%, respectively, which is considerably better than the previous state-of-the-art. The neural network, which has 60 million parameters and 650,000 neurons, consists of five convolutional layers, some of which are followed by max-pooling layers, and three fully connected layers with a final 1000-way softmax. To make training faster, we used non-saturating neurons and a very efficient GPU implementation of the convolution operation. To reduce overfitting in the fully connected layers we employed a recently developed regularization method called \"dropout\" that proved to be very effective. We also entered a variant of this model in the ILSVRC-2012 competition and achieved a winning top-5 test error rate of 15.3%, compared to 26.2% achieved by the second-best entry.

[2]

李洋洋, 史历程, 万卫兵, 等.

基于卷积神经网络的三维物体检测方法

[J]. 上海交通大学学报, 2018, 52(1): 7-12.

[本文引用: 1]

LI

Yangyang

, SHI

Licheng

, WAN

Weibing

, et al.

A convolutional neural network-based method for 3D object detection

[J]. Journal of Shanghai Jiao Tong University, 2018, 52(1): 7-12.

[本文引用: 1]

[3]

KANG

J

, TARIQ

S

, OH

H

, et al.

A survey of deep learning-based object detection methods and datasets for overhead imagery

[J]. IEEE Access, 2022, 10: 20118-20134.

DOI:10.1109/ACCESS.2022.3149052 URL [本文引用: 1]

[4]

张峻宁, 苏群星, 王成, 等.

一种改进变换网络的域自适应语义分割网络

[J]. 上海交通大学学报, 2021, 55(9): 1158-1168.

[本文引用: 1]

ZHANG

Junning

, SU

Qunxing

, WANG

Cheng

, et al.

A domain adaptive semantic segmentation network based on improved transformation network

[J]. Journal of Shanghai Jiao Tong University, 2021, 55(9): 1158-1168.

[本文引用: 1]

[5]

LI

X

, YANG

Y B

, ZHAO

Q J

, et al.

Spatial pyramid based graph reasoning for semantic segmentation[C]//2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Seattle, USA: IEEE, 2020: 8947-8956.

[本文引用: 1]

[6]

高晗, 田育龙, 许封元, 等.

深度学习模型压缩与加速综述

[J]. 软件学报, 2021, 32(1): 68-92.

[本文引用: 1]

GAO

Han

, TIAN

Yulong

, XU

Fengyuan

, et al.

Survey of deep learning model compression and acceleration

[J]. Journal of Software, 2021, 32(1): 68-92.

[本文引用: 1]

[7]

耿丽丽, 牛保宁.

深度神经网络模型压缩综述

[J]. 计算机科学与探索, 2020, 14(9): 1441-1455.

DOI:10.3778/j.issn.1673-9418.2003056 [本文引用: 1]

近年来，随着深度学习的飞速发展，深度神经网络受到了越来越多的关注，在许多应用领域取得了显著效果。通常，在较高的计算量下，深度神经网络的学习能力随着网络层深度的增加而不断提高，因此深度神经网络在大型数据集上的表现非常卓越。然而，由于其计算量大、存储成本高、模型复杂等特性，使得深度学习无法有效地应用于轻量级移动便携设备。因此，压缩、优化深度学习模型成为目前研究的热点。当前主要的模型压缩方法有模型裁剪、轻量级网络设计、知识蒸馏、量化、体系结构搜索等。对以上方法的性能、优缺点和最新研究成果进行了分析总结，并对未来研究方向进行了展望。

GENG

Lili

, NIU

Baoning

.

Survey of deep neural networks model compression

[J]. Journal of Frontiers of Computer Science & Technology, 2020, 14(9): 1441-1455.

[本文引用: 1]

[8]

WU

J

, WANG

Y

, WU

Z

, et al.

Deep k-means: Retraining and parameter sharing with harder cluster assignments for compressing deep convolutions[C]//Proceedings of the 35th International Conference on Machine Learning. Stockholm, Sweden: PMLR, 2018: 5363-5372.

[本文引用: 1]

[9]

AGGARWAL

V

, WANG

W L

, ERIKSSON

B

, et al.

Wide compression: Tensor ring nets[C]//2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Salt Lake City, USA: IEEE, 2018: 9329-9338.

[本文引用: 1]

[10]

CHEN

H T

, GUO

T Y

, XU

C

, et al.

Learning student networks in the wild[C]//2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Nashville, USA: IEEE, 2021: 6424-6433.

[本文引用: 1]

[11]

HOWARD

A G

, ZHU

M

, CHEN

B

, et al.

MobileNets: Efficient convolutional neural networks for mobile vision applications

[EB/OL]. (2017-04-17) [2022-03-18]. https://arxiv.org/abs/1704.04861.

URL [本文引用: 3]

[12]

SANDLER

M

, HOWARD

A

, ZHU

M L

, et al.

MobileNetV2: Inverted residuals and linear bottlenecks[C]//2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Salt Lake City, USA: IEEE, 2018: 4510-4520.

[本文引用: 3]

[13]

HOWARD

A

, SANDLER

M

, CHEN

B

, et al.

Searching for MobileNetV3[C]//2019 IEEE/CVF International Conference on Computer Vision. Seoul, Korea: IEEE, 2019: 1314-1324.

[本文引用: 5]

[14]

CHOLLET

F

.

Xception: Deep learning with depthwise separable convolutions[C]//2017 IEEE Conference on Computer Vision and Pattern Recognition. Honolulu, USA: IEEE, 2017: 1800-1807.

[本文引用: 2]

[15]

KIM

E

, AHN

C

, OH

S

.

NestedNet: Learning nested sparse structures in Deep Neural Networks[C]//2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Salt Lake City, USA: IEEE, 2018: 8669-8678.

[本文引用: 2]

[16]

LI

Y S

, CHEN

Y P

, DAI

X Y

, et al.

MicroNet: Improving image recognition with extremely low FLOPs[C]//2021 IEEE/CVF International Conference on Computer Vision. Montreal, Canada: IEEE, 2021: 458-467.

[本文引用: 2]

[17]

YANN

L C

, DENKER

J S

, SOLLA

S A

.

1990. Optimal brain damage

[J]. Neural Information Proceeding Systems. 1989, 2(279): 598-605.

[本文引用: 2]

[18]

HASSIBI

B

, STORK

D G

, WOLFF

G J

.

Optimal Brain Surgeon and general network pruning[C]//IEEE International Conference on Neural Networks. San Francisco, USA: IEEE, 1993: 293-299.

[本文引用: 2]

[19]

HAN

S

, POOL

J

, TRAN

J

, et al.

Learning both weights and connections for efficient neural networks[C]//Proceedings of the 28th International Conference on Neural Information Processing Systems. Montreal, Canada: MIT Press, 2015: 1135-1143.

[本文引用: 2]

[20]

CHEN

W L

, WILSON

J T

, TYREE

S

, et al.

Compressing neural networks with the hashing trick

[EB/OL]. (2015-04-19)[2022-03-18]. https://arxiv.org/abs/1504.04788.

URL [本文引用: 2]

[21]

LI

H

, ASIM

K

, IGOR

D

, et al.

Pruning filters for efficient convNets

[EB/OL]. (2017-05-10) [2022-03-18], https://arxiv.org/abs/1608.08710.

URL [本文引用: 2]

[22]

CHEN

Y H

, EMER

J

, SZE

V

.

Eyeriss: A spatial architecture for energy-efficient dataflow for convolutional neural networks[C]//2016 ACM/IEEE 43rd Annual International Symposium on Computer Architecture. Seoul, Korea: IEEE, 2016: 367-379.

[本文引用: 2]

[23]

LIU

Z

, LI

J

, SHEN

Z

, et al.

Learning efficient convolutional networks through network slimming[C]//2017 IEEE International Conference on Computer Vision. Venice, Italy: IEEE, 2017: 2755-2763.

[本文引用: 3]

[24]

韦越, 陈世超, 朱凤华, 等.

基于稀疏正则化的卷积神经网络模型剪枝方法

[J]. 计算机工程, 2021, 47(10): 61-66.

DOI:10.19678/j.issn.1000-3428.0059375 [本文引用: 3]

现有卷积神经网络模型剪枝方法仅依靠自身参数信息难以准确评估参数重要性，容易造成参数误剪且影响网络模型整体性能。提出一种改进的卷积神经网络模型剪枝方法，通过对卷积神经网络模型进行稀疏正则化训练，得到参数较稀疏的深度卷积神经网络模型，并结合卷积层和BN层的稀疏性进行结构化剪枝去除冗余滤波器。在CIFAR-10、CIFAR-100和SVHN数据集上的实验结果表明，该方法能有效压缩网络模型规模并降低计算复杂度，尤其在SVHN数据集上，压缩后的VGG-16网络模型在参数量和浮点运算量分别减少97.3%和91.2%的情况下，图像分类准确率仅损失了0.57个百分点。

WEI

Yue

, CHEN

Shichao

, ZHU

Fenghua

, et al.

Pruning method for convolutional neural network models based on sparse regularization

[J]. Computer Engineering, 2021, 47(10): 61-66.

DOI:10.19678/j.issn.1000-3428.0059375 [本文引用: 3]

The existing pruning algorithms for Convolutional Neural Network(CNN) models exhibit a low accuracy in evaluating the importance of parameters by relying on their own parameter information, which would easily lead to mispruning and affect the performance of model.To address the problem, an improved pruning method for CNN models is proposed.By training the model with sparse regularization, a deep convolutional neural network model with sparse parameters is obtained.Structural pruning is performed by combining the sparsity of the convolution layer and the BN layer to remove redundant filters.Experimental results on CIFAR-10, CIFAR-100 and SVHN datasets show that the proposed pruning method can effectively compress the network model scale and reduce the computational complexity.Especially on the SVHN dataset, the compressed VGG-16 network model reduces the amount of parameters and FLOPs by 97.3% and 91.2%, respectively, and the accuracy of image classification only loses 0.57 percentage points.

[25]

卢海伟, 夏海峰, 袁晓彤.

基于滤波器注意力机制与特征缩放系数的动态网络剪枝

[J]. 小型微型计算机系统, 2019, 40(9): 1832-1838.

[本文引用: 2]

结构化剪枝是模型压缩的一种有效方式,裁减掉网络中不重要的滤波器,减小网络的计算量和存储量.然而,仅仅基于滤波器自身的参数信息是无法准确判断该滤波器是否冗余.针对以上问题,提出一种利用卷积层和BN层双层参数信息的动态网络剪枝方法,该方法利用滤波器注意力机制以及BN(Batch Normalization)层缩放系数选择冗余滤波器,并对其进行裁剪.该方法具有三个优势:1)端到端的训练剪枝:训练和剪枝同时进行,训练速度更快.2)更大的优化空间:训练过程中动态调整被裁剪的滤波器,搜索最优的剪枝策略.3)更准确的滤波器选择:运用多重参数信息精确选取冗余的滤波器,提高了网络的泛化性能.实验分别在标准CIFAR-10数据集和CIFAR-100数据集上进行,尤其在CIFAR-10数据集上的实验结果表明,压缩后的ResNet56和ResNet110的浮点运算率减少40%多,但精度比基本网络高.

LU

Haiwei

, XIA

Haifeng

, YUAN

Xiaotong

.

Dynamic network pruning via filter attention mechanism and feature scaling factor

[J]. Journal of Chinese Computer Systems, 2019, 40(9): 1832-1838.

[本文引用: 2]

Structured pruning is an effective way of model compression,which reduces the unimportant filters in the network and reduces the amount of computation and storage of the network..However,it is impossible to accurately determine the filter based on the parameter information of the filter itself.A dynamic pruning method is proposed,which uses the attention mechanism of the filter and the BN layer scaling factor to select a redundant filter and crop it.The method has three advantages:1.End-to-end training pruning:training and pruning are performed at the same time and the training speed is faster.2.Larger optimization space:The training network dynamically adjusts the cropped filter to search for the optimal pruning strategy.3.More accurate filter selection:Multiple parameter information selects redundant filters to ensure the performance of the network.The experiments were carried out on CIFAR-10 and CIFAR-100 respectively.The experimental results on the CIFAR-10 dataset showed that the floating point operations of the compressed ResNet56 and ResNet110 were reduced by more than 40%,but the accuracy was improved.

[26]

LIU

C T

, LIN

T W

, WU

Y H

, et al.

Computation-performance optimization of convolutional neural networks with redundant filter removal

[J]. IEEE Transactions on Circuits & Systems, 2019, 66(5): 1908-1921.

[本文引用: 5]

[27]

IOFFE

S

, SZEGEDY

C

.

Batch normalization: Accelerating deep network training by reducing internal covariate shift[C]//Proceedings of the 32nd International Conference on International Conference on Machine Learning. Lille, France: JMLR: W&CP, 2015: 448-456.

[本文引用: 2]

[28]

KULKARNI

U

, MEENA

S M

, GURLAHOSUR

S V

, et al.

Quantization friendly MobileNet (QF-MobileNet) architecture for vision based applications on embedded platforms

[J]. Neural Networks: The Official Journal of the International Neural Network Society, 2021, 136: 28-39.

DOI:10.1016/j.neunet.2020.12.022 URL [本文引用: 1]

[29]

叶会娟, 刘向阳.

基于稀疏卷积核的卷积神经网络研究及其应用

[J]. 信息技术, 2017, 41(10): 5-9.

[本文引用: 1]

YE

Huijuan

, LIU

Xiangyang

.

Research and application of convolutional neural network based on sparse convolution kernel

[J]. Information Technology, 2017, 41(10): 5-9.

[本文引用: 1]

[30]

WU

S L

, ZHANG

F R

, CHEN

H D

, et al.

Semantic understanding based on multi-feature kernel sparse representation and decision rules for mangrove growth

[J]. Information Processing & Management, 2022, 59(2): 102813.

DOI:10.1016/j.ipm.2021.102813 URL [本文引用: 1]

[31]

MERINO

P

.

A difference-of-convex functions approach for sparse PDE optimal control problems with nonconvex costs

[J]. Computational Optimization & Applications, 2019, 74(1): 225-258.

[本文引用: 1]

[32]

GAO

X R

, BAI

Y Q

, LI

Q

.

A sparse optimization problem with hybrid L2-Lp regularization for application of magnetic resonance brain images

[J]. Journal of Combinatorial Optimization, 2021, 42(4): 760-784.

DOI:10.1007/s10878-019-00479-x [本文引用: 1]

ImageNet classification with deep convolutional neural networks

1

2017