A "work-in-progress" collection of experimental SFT runs. Primary focus: minimizing catastrophic forgetting, testing LoRA vs. Full-Parameter tuning.
Francesco Albanese
Francesco-A
·
AI & ML interests
None yet
Recent Activity
updated
a model 10 days ago
Francesco-A/mistral-7b-instruct-v0.3-bnb-4bit published
a model 10 days ago
Francesco-A/mistral-7b-instruct-v0.3-bnb-4bit updated
a Space 11 days ago
Francesco-A/GaiaAgent_Final_Assignment