Hello Users, With the new version 3.1.1 it is possible to finetune a bigger model. As you know the issue is as follow: When training / finetuning a 3B parameters in fp16 mode, it will require: 3B x 2 bytes = 6 GB just to load the model about the...
Reading time: 1 mins 🕑
Likes: 2 ❤