SentenceKV: Efficient LLM Inference via Sentence-Level Semantic KV Caching
Published in Arxiv, 2025
Yuxuan Zhu, Ali Falahati, David H Yang, Mohammad Mohammadi Amiri.
Download here
Published in Arxiv, 2025
Yuxuan Zhu, Ali Falahati, David H Yang, Mohammad Mohammadi Amiri.
Download here
Published in AISec 24: 17th ACM Workshop on Artificial Intelligence and Security, 2024
Yuxuan Zhu, Michael Mandulak, Kerui Wu, George Slota, Yuseok Jeon, Ka-Ho Chow, Lei Yu.
Download here
Published in ACM Transactions on Knowledge Discovery from Data, 2023
Zhong Li, Yuxuan Zhu, Matthijs Van Leeuwen.
Download here