VGA: Hardware Accelerator for Scalable Long Sequence Model Inference

Published in IEEE/ACM International Symposium on Microarchitecture (MICRO), 2024