Predictive First-Principles Simulations for Co-Designing Next-Generation Energy-Efficient AI Systems

Denis Mamaluy, Md Rahatul Islam Udoy, Juan P. Mendez, Ben Feinberg, Wei Pan, Ahmedullah Aziz
arXiv preprint, 2026

Abstract

In modern generative-AI workloads, matrix-vector/matrix-matrix multiplications (MatMul) dominate the compute and energy cost. Achieving dramatic reductions in energy per token therefore requires a novel, specialized hardware that is co-designed across materials, devices, interconnects, circuits, and architectures rather than optimized at any single layer in isolation. In this Perspectives article, we argue that predictive (first-principles, fitting-parameter-free) device and interconnect simulations can close the loop between nanoscale physics and workload-level metrics, enabling the identification of device/interconnect operating regimes that plausibly support orders-of-magnitude improvements in energy efficiency of AI accelerators.

BibTeX

@misc{mamaluy2026predictive,
  author    = {Denis Mamaluy and Md Rahatul Islam Udoy and Juan P. Mendez and Ben Feinberg and Wei Pan and Ahmedullah Aziz},
  title     = {{Predictive First-Principles Simulations for Co-Designing Next-Generation Energy-Efficient AI Systems}},
  howpublished = {arXiv preprint arXiv:2603.08995},
  year      = {2026},
  doi       = {10.48550/arXiv.2603.08995}
}

← All papers