Neural architecture search for smart phones

Avinash Reddy Chinthalapudi

Back

Neural architecture search for smart phones

Thesis

Open access

Neural architecture search for smart phones

Avinash Reddy Chinthalapudi

Master of Science (MS), California State University, Sacramento

02/10/2020

Handle: https://hdl.handle.net/10211.3/215068

Abstract

Neural networks (Computer science)

Computer network architectures

SE blocks

Running complex neural networks for image classification and object detection requires huge amounts of hard drives, GPU and CPU computation and energy consumption. Running this network on the cloud which runs on servers will already have large amounts of space, GPU which will never be a problem. When it comes to running this neural network on smartphones is more challenging as smartphones are still having limited space, computation power and energy. To make this neural network to run on smartphones deep compression techniques can be applied which will prune parameters, quantize weights, but this technique has drawbacks like large CNNs accuracy being lowered, l1 and l2 regularization requires more iterations. The Google brain team had worked on CNN for smartphones called MNasNet. This MNasNet uses a neural architecture search using reinforcement learning with a factorized hierarchical search space to design CNN for smartphones. This MNasNet considers both accuracy and inference latency when designing CNN. The key contribution was to find the proper mobile CNN model that objectives to achieve high accuracy and high speed. When training the MNasNet from scratch for extracting complex features that can be used in generating predictions for smartphones is a difficult job. We are using knowledge distillation which improves the performance of the MNasNet model on mobile devices. We train the complex and large network which can extract important features and produce better predictions, we call this as teacher model. Then with the help of the teacher model, we try to train smaller MNasNet which can replicate the results of the teacher model. In this teacher and student model, we used teacher assistant which is a medium model that helps in mediating the larger model and smaller model. We were using the CIFAR-10 dataset.

Files and links (1)

pdf

2019ChinthalapudiAvinash.508CompliantCopy2.57 MBDownload View

TextProject Open Access

Metrics

3 File views/ downloads

41 Record Views

Details

Title: Neural architecture search for smart phones
Creators: Avinash Reddy Chinthalapudi
Contributors: Anna Baynes (Committee Member)
Xuyu Wang (Advisor)
Academic Unit: Computer Science Department; Student Research Center
Theses and Dissertations: Master of Science (MS); Computer Science; California State University, Sacramento; 12/06/2019
Publication Details: 02/10/2020
Identifiers: 99257830991001671; https://hdl.handle.net/10211.3/215068
Resource Type: Masters Project
Language: English
Comment: The accessibility of this document has been verified by Sacramento State University Library. For questions, please contact lib-508Accessibility@csus.edu.