# AQUA: Attention via QUery mAgnitudes for Memory and Compute Efficient Inference in LLMs
**Source**: https://wsai.iitm.ac.in/preprints/aqua-attention-via-query-magnitudes-for-memory-and-compute-efficient-inference-in-llms-30/
**Parent**: https://wsai.iitm.ac.in/preprints/
- [Home](https://wsai.iitm.ac.in/)
- [Preprints](https://wsai.iitm.ac.in/preprints/)
- [AQUA: Attention via QUery mAgnitudes for Memory and Compute Efficient Inference …](#)
## AQUA: Attention via QUery mAgnitudes for Memory and Compute Efficient Inference in LLMs
<https://doi.org/10.48550/arXiv.2509.11155>
Authors
S, Santhosh G
,
Prakash, Saurav
,
Ravindran, Balaraman
Preprint Server
arXiv
Santhosh G S, Saurav Prakash, Balaraman Ravindran, AQUA: Attention via QUery mAgnitudes for Memory and Compute Efficient Inference in LLMs
Preprint link: <https://arxiv.org/abs/2509.11155>