TY - GEN
T1 - Deep Reorganization
T2 - 61st ACM/IEEE Design Automation Conference, DAC 2024
AU - Mendis, Hashan Roshantha
AU - Kang, Chih Kai
AU - Lin, Chun Han
AU - Chen, Ming Syan
AU - Hsiu, Pi Cheng
N1 - Publisher Copyright:
© 2024 Copyright is held by the owner/author(s). Publication rights licensed to ACM.
PY - 2024/11/7
Y1 - 2024/11/7
N2 - Designing intelligent, tiny devices with limited memory is immensely challenging, exacerbated by the additional memory requirement of residual connections in deep neural networks. In contrast to existing approaches that eliminate residuals to reduce peak memory usage at the cost of significant accuracy degradation, this paper presents DERO, which reorganizes residual connections by leveraging insights into the types and interdependencies of operations across residual connections. Evaluations were conducted across diverse model architectures designed for common computer vision applications. DERO consistently achieves peak memory usage comparable to plain-style models without residuals, while closely matching the accuracy of the original models with residuals.
AB - Designing intelligent, tiny devices with limited memory is immensely challenging, exacerbated by the additional memory requirement of residual connections in deep neural networks. In contrast to existing approaches that eliminate residuals to reduce peak memory usage at the cost of significant accuracy degradation, this paper presents DERO, which reorganizes residual connections by leveraging insights into the types and interdependencies of operations across residual connections. Evaluations were conducted across diverse model architectures designed for common computer vision applications. DERO consistently achieves peak memory usage comparable to plain-style models without residuals, while closely matching the accuracy of the original models with residuals.
KW - TinyML
KW - deep neural networks
KW - peak memory
KW - residual connections
UR - https://www.scopus.com/pages/publications/85211091150
UR - https://www.scopus.com/pages/publications/85211091150#tab=citedBy
U2 - 10.1145/3649329.3656219
DO - 10.1145/3649329.3656219
M3 - Conference contribution
AN - SCOPUS:85211091150
T3 - Proceedings - Design Automation Conference
BT - Proceedings of the 61st ACM/IEEE Design Automation Conference, DAC 2024
PB - Institute of Electrical and Electronics Engineers Inc.
Y2 - 23 June 2024 through 27 June 2024
ER -