Recon-Act: A Self-Evolving Multi-Agent Browser-Use System via Web Reconnaissance, Tool Generation, and Task Execution Paper • 2509.21072 • Published Sep 25, 2025 • 16
Rephrasing the Web: A Recipe for Compute and Data-Efficient Language Modeling Paper • 2401.16380 • Published Jan 29, 2024 • 53
Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models Paper • 2401.01335 • Published Jan 2, 2024 • 69
Tuning LLMs with Contrastive Alignment Instructions for Machine Translation in Unseen, Low-resource Languages Paper • 2401.05811 • Published Jan 11, 2024 • 9
YaRN: Efficient Context Window Extension of Large Language Models Paper • 2309.00071 • Published Aug 31, 2023 • 83
Data Selection for Language Models via Importance Resampling Paper • 2302.03169 • Published Feb 6, 2023 • 1
A Pretrainer's Guide to Training Data: Measuring the Effects of Data Age, Domain Coverage, Quality, & Toxicity Paper • 2305.13169 • Published May 22, 2023 • 4
RedPajama: an Open Dataset for Training Large Language Models Paper • 2411.12372 • Published Nov 19, 2024 • 58
LLaMA: Open and Efficient Foundation Language Models Paper • 2302.13971 • Published Feb 27, 2023 • 23
OpenAssistant Conversations -- Democratizing Large Language Model Alignment Paper • 2304.07327 • Published Apr 14, 2023 • 10
SelfDefend: LLMs Can Defend Themselves against Jailbreaking in a Practical Manner Paper • 2406.05498 • Published Jun 8, 2024 • 1
Universal and Transferable Adversarial Attacks on Aligned Language Models Paper • 2307.15043 • Published Jul 27, 2023 • 3