[RFC] [kernel] Paged attention kernels should be MLX primitives to eliminate per-layer sync barriers

March 21, 2026 ยท #188
View on GitHub
Python Difficulty: Medium

Sign in required

Authenticate to use favourites & bookmarks

5