[Bug]: NVFP4 MoE produces garbage output on SM120 (RTX 5080) with CPU Weight Offloading — Nemotron-Cascade-2-30B-A3B

April 1, 2026 · #38718
View on GitHub
Python Difficulty: Medium

Sign in required

Authenticate to use favourites & bookmarks

5