[CONTRIBUTION]: Hybrid Worker: Utilize Idle Prefill GPU Capacity for Overflow Decode in Disaggregated Serving

May 8, 2026 ยท #9311
View on GitHub
Rust Difficulty: Medium

Labels

enhancement language::rust dynamo-runtime backend::vllm performance

Sign in required

Authenticate to use favourites & bookmarks

5