view article Article Luth: Efficient French Specialization for Small Language Models Aug 11, 2025 • 20
VLMs Need Words: Vision Language Models Ignore Visual Detail In Favor of Semantic Anchors Paper • 2604.02486 • Published 30 days ago • 10
Lost in Cultural Translation: Do LLMs Struggle with Math Across Cultural Contexts? Paper • 2503.18018 • Published Mar 23, 2025 • 7
On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification Paper • 2508.05629 • Published Aug 7, 2025 • 189
Reasoning Models Struggle to Control their Chains of Thought Paper • 2603.05706 • Published Mar 5 • 37
Running 92 Scaling FineWeb to 1000+ languages: Step 1: finding signal in 100s of evaluation tasks 📝 92 Evaluate multilingual models using FineTasks