[RFC] [kernel] Paged attention kernels should be MLX primitives to eliminate per-layer sync barriers
March 21, 2026 ยท #188
Python
Difficulty: Medium
Parent Repository
vllm-project/vllm-metal
Python repository
738 76
vllm-project/vllm-metal
Python repository
Sign in required
Authenticate to use favourites & bookmarks