Achieving Accurate In-Memory Neural Network Inference with Highly Overlapping Nonvolatile Memory State Distributions

Matthew J. Marinella, T. Patrick Xiao, Ben Feinberg, Christopher H. Bennett, Vineet Agrawal, Helmut Puchner, Sapan Agarwal
IEEE Electron Devices Technology and Manufacturing Conference (EDTM), 2022

Abstract

Analog in-memory computing is a method to improve the efficiency of deep neural network inference by orders of magnitude, by utilizing analog properties of a nonvolatile memory. This places new requirements on the memory device, which physically represent neural net weights as analog states. By carefully considering the algorithm implications when mapping weights to physical states, it is possible to achieve precision very close to that of a digital accelerator using a 40nm embedded SONOS.

BibTeX

@inproceedings{marinella2022achieving,
  author    = {Matthew J. Marinella and T. Patrick Xiao and Ben Feinberg and Christopher H. Bennett and Vineet Agrawal and Helmut Puchner and Sapan Agarwal},
  title     = {{Achieving Accurate In-Memory Neural Network Inference with Highly Overlapping Nonvolatile Memory State Distributions}},
  booktitle = {IEEE Electron Devices Technology and Manufacturing Conference (EDTM)},
  year      = {2022},
  month     = {mar},
  address   = {Penang, Malaysia},
  doi       = {10.1109/EDTM53872.2022.9797919}
}

← All papers