[Bug]: TP=2 spec-decode (qwen3_next_mtp / DFlash) on Qwen3.6 hybrid GDN crashes at gpu_model_runner.py:1927 num_accepted_tokens_event.synchronize() (cudaErrorIllegalAddress)
April 29, 2026 ยท #41190
Python
Difficulty: Medium
Parent Repository
vllm-project/vllm
Python repository
78,579 16,248