[BUG] `quantized_matmul` produces wrong results for GQA `expand_dims` broadcasting when M < `vector_limit`
May 4, 2026 ยท #3480
cpp
Difficulty: Medium
Labels
bug
Parent Repository
ml-explore/mlx
cpp repository
25,988 1,754