[Bug] Gemma 4 31B crashes on k_eq_v full-attention layers (QKV split shape mismatch)

April 29, 2026 ยท #41283
View on GitHub
Python Difficulty: Medium

Sign in required

Authenticate to use favourites & bookmarks

5