[Feature]: PagedEviction: Structured Block-wise KV Cache Pruning for Efficient Large Language Model Inference

March 26, 2026 ยท #38279
View on GitHub
Python Difficulty: Easy

Labels

feature request

Sign in required

Authenticate to use favourites & bookmarks

5