Naahraf27/npo_llama-3.2-1b-instruct_forget10_ep10_lr5e-5_alpha1.0_beta0.1 Text Generation • 1B • Updated 3 days ago • 660
Naahraf27/npo_llama-3.2-3b-instruct_forget10_ep5_lr2e-5_alpha2.0_beta0.1 Text Generation • 3B • Updated 1 day ago • 537
Naahraf27/npo_llama-3.1-8b-instruct_forget10_ep5_lr5e-5_alpha2.0_beta0.1 Text Generation • 8B • Updated 3 days ago • 664