SWE-Lego: Pushing the Limits of Supervised Fine-tuning for Software Issue Resolving Paper • 2601.01426 • Published 26 days ago • 22
MR-BEN: A Comprehensive Meta-Reasoning Benchmark for Large Language Models Paper • 2406.13975 • Published Jun 20, 2024
Effi-Code: Unleashing Code Efficiency in Language Models Paper • 2410.10209 • Published Oct 14, 2024 • 2
SwingArena: Competitive Programming Arena for Long-context GitHub Issue Solving Paper • 2505.23932 • Published May 29, 2025
SWE-Lego: Pushing the Limits of Supervised Fine-tuning for Software Issue Resolving Paper • 2601.01426 • Published 26 days ago • 22
EffiLearner: Enhancing Efficiency of Generated Code via Self-Optimization Paper • 2405.15189 • Published May 24, 2024
SWE-Lego: Pushing the Limits of Supervised Fine-tuning for Software Issue Resolving Paper • 2601.01426 • Published 26 days ago • 22
SWE-Lego: Pushing the Limits of Supervised Fine-tuning for Software Issue Resolving Paper • 2601.01426 • Published 26 days ago • 22
SWE-Lego: Pushing the Limits of Supervised Fine-tuning for Software Issue Resolving Paper • 2601.01426 • Published 26 days ago • 22
SWE-Lego: Pushing the Limits of Supervised Fine-tuning for Software Issue Resolving Paper • 2601.01426 • Published 26 days ago • 22
SWE-Lego: Pushing the Limits of Supervised Fine-tuning for Software Issue Resolving Paper • 2601.01426 • Published 26 days ago • 22
Memory-T1: Reinforcement Learning for Temporal Reasoning in Multi-session Agents Paper • 2512.20092 • Published Dec 23, 2025 • 9
Bridging the Long-Term Gap: A Memory-Active Policy for Multi-Session Task-Oriented Dialogue Paper • 2505.20231 • Published May 26, 2025
ReSURE: Regularizing Supervision Unreliability for Multi-turn Dialogue Fine-tuning Paper • 2508.19996 • Published Aug 27, 2025
Memory-T1: Reinforcement Learning for Temporal Reasoning in Multi-session Agents Paper • 2512.20092 • Published Dec 23, 2025 • 9
LongEmotion: Measuring Emotional Intelligence of Large Language Models in Long-Context Interaction Paper • 2509.07403 • Published Sep 9, 2025 • 35
Electrocardiogram Instruction Tuning for Report Generation Paper • 2403.04945 • Published Mar 7, 2024 • 2
Rethinking Kullback-Leibler Divergence in Knowledge Distillation for Large Language Models Paper • 2404.02657 • Published Apr 3, 2024 • 2