[Bug] MTP speculative decoding always rejects draft tokens for NemotronH (accept_rate=0.33)
March 22, 2026 ยท #21138
Python
Difficulty: Medium
Parent Repository
sgl-project/sglang
Python repository
25,689 5,301
sgl-project/sglang
Python repository
Sign in required
Authenticate to use favourites & bookmarks