Qwen3.5 DeltaNet breaks sample independence when using sequence packing
May 5, 2026 ยท #2131
Python
Difficulty: Medium
Labels
bug community-request waiting-on-customer
Parent Repository
NVIDIA-NeMo/Automodel
Python repository
503 150