interest_need_read
updated
ProcessBench: Identifying Process Errors in Mathematical Reasoning
Paper
• 2412.06559
• Published
• 86
Maya: An Instruction Finetuned Multilingual Multimodal Model
Paper
• 2412.07112
• Published
• 28
Paper
• 2412.16720
• Published
• 37
Diving into Self-Evolving Training for Multimodal Reasoning
Paper
• 2412.17451
• Published
• 42
B-STaR: Monitoring and Balancing Exploration and Exploitation in
Self-Taught Reasoners
Paper
• 2412.17256
• Published
• 47
Multi-LLM Text Summarization
Paper
• 2412.15487
• Published
• 6
Offline Reinforcement Learning for LLM Multi-Step Reasoning
Paper
• 2412.16145
• Published
• 38
MegaPairs: Massive Data Synthesis For Universal Multimodal Retrieval
Paper
• 2412.14475
• Published
• 57
Progressive Multimodal Reasoning via Active Retrieval
Paper
• 2412.14835
• Published
• 73
Paper
• 2412.15115
• Published
• 377
VidTok: A Versatile and Open-Source Video Tokenizer
Paper
• 2412.13061
• Published
• 8
Paper
• 2412.13501
• Published
• 29
Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for
Fast, Memory Efficient, and Long Context Finetuning and Inference
Paper
• 2412.13663
• Published
• 161
Compressed Chain of Thought: Efficient Reasoning Through Dense
Representations
Paper
• 2412.13171
• Published
• 35
Reliable, Reproducible, and Really Fast Leaderboards with Evalica
Paper
• 2412.11314
• Published
• 2
The Open Source Advantage in Large Language Models (LLMs)
Paper
• 2412.12004
• Published
• 10
SPaR: Self-Play with Tree-Search Refinement to Improve
Instruction-Following in Large Language Models
Paper
• 2412.11605
• Published
• 18
RetroLLM: Empowering Large Language Models to Retrieve Fine-grained
Evidence within Generation
Paper
• 2412.11919
• Published
• 36
Smaller Language Models Are Better Instruction Evolvers
Paper
• 2412.11231
• Published
• 28
Apollo: An Exploration of Video Understanding in Large Multimodal Models
Paper
• 2412.10360
• Published
• 147
Multimodal Latent Language Modeling with Next-Token Diffusion
Paper
• 2412.08635
• Published
• 49
Lyra: An Efficient and Speech-Centric Framework for Omni-Cognition
Paper
• 2412.09501
• Published
• 48
Euclid: Supercharging Multimodal LLMs with Synthetic High-Fidelity
Visual Descriptions
Paper
• 2412.08737
• Published
• 54
InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for
Long-term Streaming Video and Audio Interactions
Paper
• 2412.09596
• Published
• 97
Paper
• 2412.08905
• Published
• 122
Chimera: Improving Generalist Model with Domain-Specific Experts
Paper
• 2412.05983
• Published
• 9
Evaluating and Aligning CodeLLMs on Human Preference
Paper
• 2412.05210
• Published
• 50
Do NOT Think That Much for 2+3=? On the Overthinking of o1-Like LLMs
Paper
• 2412.21187
• Published
• 40
CypherBench: Towards Precise Retrieval over Full-scale Modern Knowledge
Graphs in the LLM Era
Paper
• 2412.18702
• Published
• 8
Next Token Prediction Towards Multimodal Intelligence: A Comprehensive Survey
Paper
• 2412.18619
• Published
• 60
HuatuoGPT-o1, Towards Medical Complex Reasoning with LLMs
Paper
• 2412.18925
• Published
• 107
MMFactory: A Universal Solution Search Engine for Vision-Language Tasks
Paper
• 2412.18072
• Published
• 18
YuLan-Mini: An Open Data-efficient Language Model
Paper
• 2412.17743
• Published
• 66
Mulberry: Empowering MLLM with o1-like Reasoning and Reflection via
Collective Monte Carlo Tree Search
Paper
• 2412.18319
• Published
• 39
Bridging the Data Provenance Gap Across Text, Speech and Video
Paper
• 2412.17847
• Published
• 12
SKETCH: Structured Knowledge Enhanced Text Comprehension for Holistic
Retrieval
Paper
• 2412.15443
• Published
• 10
Ensembling Large Language Models with Process Reward-Guided Tree Search
for Better Complex Reasoning
Paper
• 2412.15797
• Published
• 18
OpenRFT: Adapting Reasoning Foundation Model for Domain-specific Tasks
with Reinforcement Fine-Tuning
Paper
• 2412.16849
• Published
• 9
Outcome-Refining Process Supervision for Code Generation
Paper
• 2412.15118
• Published
• 19
DRT-o1: Optimized Deep Reasoning Translation via Long Chain-of-Thought
Paper
• 2412.17498
• Published
• 22