[Bug]: Eagle3 speculative decoding CUDA device-side assert crash with gpt-oss-120b under concurrent requests (TP=8, H20)
April 8, 2026 ยท #39295
Python
Difficulty: Easy
Parent Repository
vllm-project/vllm
Python repository
75,721 15,332