[Bug]: Shared Expert output is incorrect under Sequence Parallel MoE (EP + TP > 1 + DP > 1) for Qwen3.5 MoE models

March 23, 2026 ยท #37856
View on GitHub
Python Difficulty: Easy

Labels

bug

Sign in required

Authenticate to use favourites & bookmarks

5