Running 3.65k The Ultra-Scale Playbook 🌌 3.65k The ultimate guide to training LLM on large GPU Clusters
inclusionAI/LLaDA2.0-mini-preview Text Generation • 16B • Updated about 1 month ago • 2.78k • 86
RouteFinder Collection Towards Foundation Models for Vehicle Routing Problems • 3 items • Updated May 15, 2025
PARCO Collection Parallel AutoRegressive Combinatorial Optimization • 3 items • Updated May 15, 2025