Multilingual Language Model Pretraining using Machine-translated Data Paper • 2502.13252 • Published Feb 18, 2025
Drawing Conclusions from Draws: Rethinking Preference Semantics in Arena-Style LLM Evaluation Paper • 2510.02306 • Published Oct 2, 2025 • 3
TransWebLLM Collection A collection of training corpus and models for "Multilingual Language Model Pretraining using Machine-translated Data". • 5 items • Updated Apr 21, 2025 • 1
Improving Language Plasticity via Pretraining with Active Forgetting Paper • 2307.01163 • Published Jul 3, 2023 • 6