[Performance]: llmcompressor W8A8 Inference: decoding stage speed is lower than FP16

April 1, 2026 ยท #38697
View on GitHub
Python Difficulty: Medium

Labels

performance

Sign in required

Authenticate to use favourites & bookmarks

5