CAE-Net: Enhanced Converting Autoencoder Based Framework for Low-Latency Energy-Efficient DNN with SLO-Constraints

Hasanul Mahmud; Peng Kang; Palden Lama; Kevin Desai; Sushil K. Prasad

doi:10.1109/Cloud-Summit61220.2024.00028

Back

Conference proceeding

CAE-Net: Enhanced Converting Autoencoder Based Framework for Low-Latency Energy-Efficient DNN with SLO-Constraints

Hasanul Mahmud, Peng Kang, Palden Lama, Kevin Desai and Sushil K. Prasad

2024 IEEE Cloud Summit, pp.128-134

06/27/2024

DOI: https://doi.org/10.1109/Cloud-Summit61220.2024.00028

Handle:

https://hdl.handle.net/20.500.12741/rep:12409

Abstract

Accuracy

Converting Autoencoder

DNN compression

Edge devices

Energy consumption

Energy efficiency

Image edge detection

Knowledge transfer

Low latency

Low latency communication

Training

As deep neural networks (DNNs) continue to be used on resource-limited edge devices with low latency requirements for interactive applications, there is a growing need to reduce inference time and energy consumption while maintaining acceptable prediction accuracy. In response, we introduce a novel framework, CAE-Net, for designing and training lightweight and energy-efficient deep neural networks (DNNs) for image classification on edge devices. The proposed framework consists of two parts: (1) a new Enhanced Converting Autoencoder that employs entropy-based intraclass clustering to learn the key image features by transforming the hard images into easy representative images, and (2) a composite lightweight CAE-Net classifier employing the pre-trained encoder of the Converting Autoencoder followed by a few classification layers from a baseline DNN trained using knowledge transfer. Unlike many state-of-the-art models, our experimental results using popular image-classification datasets, MNIST and CIFAR10 demonstrate that CAE-Net can satisfy the inference latency target of 10-20ms on Raspberry Pi and 5-10 ms on Nvidia Jetson Nano. Compared with the competing models meeting the SLO targets, CAE-Net achieves over 4-fold energy reduction and inferencing latency speedups on the CIFAR-10 dataset compared to AlexNet and its pruned/distilled variants and other DNNs on Raspberry Pi and about 6-fold on Jetson Nano while maintaining similar or higher accuracy.

Metrics

7 Record Views

Details

Title: CAE-Net: Enhanced Converting Autoencoder Based Framework for Low-Latency Energy-Efficient DNN with SLO-Constraints
Creators: Hasanul Mahmud - The University of Texas at San Antonio
Peng Kang - The University of Texas at San Antonio
Palden Lama - The University of Texas at San Antonio
Kevin Desai - The University of Texas at San Antonio
Sushil K. Prasad - The University of Texas at San Antonio
Academic Unit: Computer Science Department; California State University, Sacramento
Publisher: IEEE; LOS ALAMITOS
Publication Details: 06/27/2024
Identifiers: 99258167662001671; https://hdl.handle.net/20.500.12741/rep:12409; https://doi.org/10.1109/Cloud-Summit61220.2024.00028
Language: English
Number of pages: 7

CAE-Net: Enhanced Converting Autoencoder Based Framework for Low-Latency Energy-Efficient DNN with SLO-Constraints

Abstract

Related links

Metrics

Details