[Bug] DeepSeek-V4-Pro-FP8,PD Disaggregation Input Length Hard-Capped at 18,432 Tokens Due to Incorrect SWA Limit Enforcement on Prefill Node

May 6, 2026 · #24523
View on GitHub
Python Difficulty: Easy

Sign in required

Authenticate to use favourites & bookmarks

5