[Bug]: GPT-OSS-20B on RTX PRO 6000 (SM120) falls back to TRITON_ATTN + Marlin; forcing FLASHINFER fails with sink setting not supported
April 17, 2026 ยท #40153
Python
Difficulty: Easy
Labels
bug
Parent Repository
vllm-project/vllm
Python repository
77,116 15,764