[Bug]: TP=2 spec-decode (qwen3_next_mtp / DFlash) on Qwen3.6 hybrid GDN crashes at gpu_model_runner.py:1927 num_accepted_tokens_event.synchronize() (cudaErrorIllegalAddress)

April 29, 2026 ยท #41190
View on GitHub
Python Difficulty: Medium

Sign in required

Authenticate to use favourites & bookmarks

5