Abstract
This investigation presents a digital logic accelerator (DLA) design of a neural network hardware that utilizes output reuse. The DLA is used in the detection mechanism of underwater objects that was deployed in an underwater vehicle. A modified YoloV3-tiny network was also implemented to detect more than 20 underwater objects. The proposed DLA uses processing units that have parallel architectures of output windows, and output channels. Moreover, a new Inter-Controller is designed to control the direct memory access (DMA) together with a new Reshape module to improve the performance and power efficiency. A detailed description of the design as well as the measurements on silicon are presented. The chip is realized using a typical 180-nm CMOS process. It showed a performance result of 40.96 GOPS and the power consumption is 196.8 mW. The DLA was tested to demonstrate 19.88 frames per second and 40.96 GOPS.
Original language | English |
---|---|
Pages (from-to) | 4860-4871 |
Number of pages | 12 |
Journal | IEEE Transactions on Circuits and Systems I: Regular Papers |
Volume | 69 |
Issue number | 12 |
DOIs | |
Publication status | Published - 2022 Dec 1 |
Keywords
- Deep learning
- deep neural networks (DNN)
- energy efficient
- hardware accelerators
- reshape module
ASJC Scopus subject areas
- Hardware and Architecture
- Electrical and Electronic Engineering