[RFC]: TurboQuant — Sub-4-bit KV Cache Quantization for Long-Context Omni Models

March 26, 2026 · #2215
View on GitHub
Python Difficulty: Easy

Labels

enhancement

Sign in required

Authenticate to use favourites & bookmarks

5