Inconsistent numerical correctness in scaled dot-product attention (QK matmul) depending on shape configuration

April 21, 2026 ยท #1730
View on GitHub
cpp Difficulty: Medium

Sign in required

Authenticate to use favourites & bookmarks

5