[Bug] Piecewise CUDA graph replay crashes with FlashInfer ≥0.6.6: q.shape[0] does not match qo_indptr[-1] in paged prefill

March 23, 2026 · #21218
View on GitHub
Python Difficulty: Easy

Sign in required

Authenticate to use favourites & bookmarks

5