[Bug] MTP speculative decoding always rejects draft tokens for NemotronH (accept_rate=0.33)

March 22, 2026 ยท #21138
View on GitHub
Python Difficulty: Medium

Sign in required

Authenticate to use favourites & bookmarks

5