Mac Metal generation throughput 5-7 tok/s (45x slower than CUDA) — vendored llama.cpp Metal kernel coverage gap

April 24, 2026 · #960
View on GitHub
TypeScript Difficulty: Medium

Labels

bug performance

Sign in required

Authenticate to use favourites & bookmarks

5