Eval bug: Vulkan/RADV GFX1200: token generation drops from ~27 tok/s to ~9 tok/s at long context, fixed by server restart
May 10, 2026 ยท #22898
cpp
Difficulty: Medium
Labels
bug-unconfirmed
Parent Repository
ggml-org/llama.cpp
cpp repository
109,350 18,030