[Bug] MiniMax M2.5 throws a CUDA out of memory error when running with speculative decoding
March 20, 2026 ยท #20966
Python
Difficulty: Medium
Parent Repository
sgl-project/sglang
Python repository
25,689 5,301
sgl-project/sglang
Python repository
Sign in required
Authenticate to use favourites & bookmarks