[Feature]: PagedEviction: Structured Block-wise KV Cache Pruning for Efficient Large Language Model Inference
March 26, 2026 ยท #38279
Python
Difficulty: Easy
Labels
feature request
Parent Repository
vllm-project/vllm
Python repository
75,721 15,332