Deep Learning Acceleration using Digital-Based Processing In-Memory

Mohsen Imani, Saransh Gupta, Yeseong Kim, Tajana Rosing

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Processing In-Memory (PIM) has shown a great potential to accelerate inference tasks of Convolutional Neural Network (CNN). However, existing PIM architectures do not support high precision computation, e.g., in floating point precision, which is essential for training accurate CNN models. In addition, most of the existing PIM approaches require analog/mixed-signal circuits, which do not scale, exploiting insufficiently reliable multi-bit Non-Volatile Memory (NVM). In this paper, we propose FloatPIM, a fully-digital scalable PIM architecture that accelerates CNN in both training and testing phases. FloatPIM natively supports floating-point representation, thus enabling accurate CNN training. FloatPIM also enables fast communication between neighboring memory blocks to reduce internal data movement of the PIM architecture. We break the CNN computation into computing and data transfer modes. In computing mode, all blocks are processing a part of CNN training/testing in parallel, while in data transfer mode Float-PIM enables fast and row-parallel communication between the neighbor blocks. Our evaluation shows that FloatPIM training is on average 303.2 and 48.6 (4.3x and 15.8x) faster and more energy efficient as compared to GTX 1080 GPU (PipeLayer [1] PIM accelerator).

Original languageEnglish
Title of host publicationProceedings - 33rd IEEE International System on Chip Conference, SOCC 2020
EditorsGang Qu, Jinjun Xiong, Danella Zhao, Venki Muthukumar, Md Farhadur Reza, Ramalingam Sridhar
PublisherIEEE Computer Society
Pages123-128
Number of pages6
ISBN (Electronic)9781728187457
DOIs
StatePublished - 8 Sep 2020
Event33rd IEEE International System on Chip Conference, SOCC 2020 - Virtual, Las Vegas, United States
Duration: 8 Sep 202011 Sep 2020

Publication series

NameInternational System on Chip Conference
Volume2020-September
ISSN (Print)2164-1676
ISSN (Electronic)2164-1706

Conference

Conference33rd IEEE International System on Chip Conference, SOCC 2020
Country/TerritoryUnited States
CityVirtual, Las Vegas
Period8/09/2011/09/20

Bibliographical note

Publisher Copyright:
© 2020 IEEE.

Fingerprint

Dive into the research topics of 'Deep Learning Acceleration using Digital-Based Processing In-Memory'. Together they form a unique fingerprint.

Cite this