[Bug] FP8 quantization and HSDP cannot be enabled simultaneously for Diffusion models (dynamic_scaled_fp8_quant called on CPU tensor)
March 25, 2026 ยท #2159
Python
Difficulty: Medium
Labels
bug
Parent Repository
vllm-project/vllm-omni
Python repository
4,212 730