Compression of Deep Convolutional Neural Networks for Fast and Low Power Mobile Applications

소식
arrow_forward_ios
세미나

세미나

Compression of Deep Convolutional Neural Networks for Fast and Low Power Mobile Applications

이름:

김용덕

직함: 박사

소속:

삼성전자 SW

주최:

날짜: 2016. 12. 13. 오전 10:00 - 오전 10:00

위치: 302-308

요약

Compression is required for the deployments of deep convolutional neural network (CNN) on mobile devices. To deploy deep CNNs on mobile devices, we present a simple and effective scheme to compress the entire CNN, which we call one-shot whole network compression. The proposed scheme consists of three steps: (1) rank selection with variational Bayesian matrix factorization, (2) Tucker decomposition on kernel tensor, and (3) fine-tuning to recover accumulated loss of accuracy, and each step can be easily implemented using publicly available tools. We demonstrate the effectiveness of the proposed scheme by testing the performance of various compressed CNNs on the smartphone. Significant reductions in model size, runtime, and energy consumption are obtained, at the cost of small loss in accuracy.

연사 소개

expand_less

New Quality Metrics for Graph Visualisation

expand_more

Simulation-Enhanced Visual Computing for Real World Applications

세미나

Compression of Deep Convolutional Neural Networks for Fast and Low Power Mobile Applications

소식

Compression of Deep Convolutional Neural Networks for Fast and Low Power Mobile Applications