[Bug]: Ngram speculative decoding produces corrupted output on hybrid GDN (Qwen3.5) models
April 8, 2026 ยท #39273
Python
Difficulty: Easy
Parent Repository
vllm-project/vllm
Python repository
75,721 15,332
vllm-project/vllm
Python repository
Sign in required
Authenticate to use favourites & bookmarks