Predictive First-Principles Simulations for Co-Designing Next-Generation Energy-Efficient AI Systems
Abstract
In modern generative-AI workloads, matrix-vector/matrix-matrix multiplications (MatMul) dominate the compute and energy cost. Achieving dramatic reductions in energy per token therefore requires a novel, specialized hardware that is co-designed across materials, devices, interconnects, circuits, and architectures rather than optimized at any single layer in isolation. In this Perspectives article, we argue that predictive (first-principles, fitting-parameter-free) device and interconnect simulations can close the loop between nanoscale physics and workload-level metrics, enabling the identification of device/interconnect operating regimes that plausibly support orders-of-magnitude improvements in energy efficiency of AI accelerators.
BibTeX
@misc{mamaluy2026predictive,
author = {Denis Mamaluy and Md Rahatul Islam Udoy and Juan P. Mendez and Ben Feinberg and Wei Pan and Ahmedullah Aziz},
title = {{Predictive First-Principles Simulations for Co-Designing Next-Generation Energy-Efficient AI Systems}},
howpublished = {arXiv preprint arXiv:2603.08995},
year = {2026},
doi = {10.48550/arXiv.2603.08995}
}