[RFC]: TurboQuant — Sub-4-bit KV Cache Quantization for Long-Context Omni Models
March 26, 2026 · #2215
Python
Difficulty: Easy
Labels
enhancement
Parent Repository
vllm-project/vllm-omni
Python repository
4,212 730
Labels
vllm-project/vllm-omni
Python repository
Sign in required
Authenticate to use favourites & bookmarks