CPU input embedding adds an extra graph split in multi-GPU layer-split configurations
May 10, 2026 ยท #22926
cpp
Difficulty: Medium
Parent Repository
ggml-org/llama.cpp
cpp repository
109,429 18,048
ggml-org/llama.cpp
cpp repository
Sign in required
Authenticate to use favourites & bookmarks