deepseek-ai/DeepSeek-V4-Flash Text Generation • 158B • Updated about 6 hours ago • 65.7k • • 766
Running 3.81k The Ultra-Scale Playbook 🌌 3.81k The ultimate guide to training LLM on large GPU Clusters
deepseek-ai/DeepSeek-V4-Pro Text Generation • 862B • Updated about 6 hours ago • 138k • • 2.98k
view article Article TRL v1.0: Post-Training Library Built to Move with the Field +2 28 days ago • 50