19
My fine-tuning run kept crashing until I set the gradient accumulation steps to 4
It was failing on a 12GB VRAM card after about 20 minutes every single time. Anyone know other tricks for memory issues with smaller models?
2 comments
Log in to join the discussion
Log In2 Comments
gavin_kim35h ago
Try lowering your batch size to one.
6
ericfox5h ago
Honestly, lowering the batch size just makes things slower for me.
4