[Bug] Piecewise CUDA graph replay crashes with FlashInfer ≥0.6.6: q.shape[0] does not match qo_indptr[-1] in paged prefill
March 23, 2026 · #21218
Python
Difficulty: Easy
Parent Repository
sgl-project/sglang
Python repository
25,689 5,301