Add llama.cpp as a CPU-optimized ServingRuntime for LLM inference

April 2, 2026 · #5334

Go Difficulty: Medium

Parent Repository

kserve/kserve

Go repository

All Issues Back to kserve

Sign in required

Authenticate to use favourites & bookmarks

5