[Bug] DeepSeek-V4-Pro-FP8,PD Disaggregation Input Length Hard-Capped at 18,432 Tokens Due to Incorrect SWA Limit Enforcement on Prefill Node
May 6, 2026 · #24523
Python
Difficulty: Easy
Parent Repository
sgl-project/sglang
Python repository
27,500 5,782