ONNX Runtime running on CPU (MLAS) instead of Metal — 800-900% CPU spike during chat from fastembed/TTS/STT/vision-bridge

April 24, 2026 · #964
View on GitHub
TypeScript Difficulty: Medium

Labels

bug performance

Sign in required

Authenticate to use favourites & bookmarks

5