[Bug] Training on an ChatML-like dataset somehow uses much, much more VRAM than on an Alpaca-like dataset
March 21, 2026 ยท #4504
Python
Difficulty: Medium
Labels
bug
Parent Repository
unslothai/unsloth
Python repository
60,135 5,153