CiteAudit: You Cited It, But Did You Read It? A Benchmark for Verifying Scientific References in the LLM Era Paper • 2602.23452 • Published 7 days ago • 16
Solaris: Building a Multiplayer Video World Model in Minecraft Paper • 2602.22208 • Published 8 days ago • 27
Understanding vs. Generation: Navigating Optimization Dilemma in Multimodal Models Paper • 2602.15772 • Published 16 days ago • 6
jina-embeddings-v5-text: Task-Targeted Embedding Distillation Paper • 2602.15547 • Published 16 days ago • 26
Revisiting the Platonic Representation Hypothesis: An Aristotelian View Paper • 2602.14486 • Published 17 days ago • 11
Does Socialization Emerge in AI Agent Society? A Case Study of Moltbook Paper • 2602.14299 • Published 18 days ago • 26
Quantifying the Gap between Understanding and Generation within Unified Multimodal Models Paper • 2602.02140 • Published about 1 month ago • 12
TSRBench: A Comprehensive Multi-task Multi-modal Time Series Reasoning Benchmark for Generalist Models Paper • 2601.18744 • Published Jan 26 • 10
Pretraining Frame Preservation in Autoregressive Video Memory Compression Paper • 2512.23851 • Published Dec 29, 2025 • 25
Can LLMs Estimate Student Struggles? Human-AI Difficulty Alignment with Proficiency Simulation for Item Difficulty Prediction Paper • 2512.18880 • Published Dec 21, 2025 • 25
Physics of Language Models: Part 4.1, Architecture Design and the Magic of Canon Layers Paper • 2512.17351 • Published Dec 19, 2025 • 28
Are We on the Right Way to Assessing LLM-as-a-Judge? Paper • 2512.16041 • Published Dec 17, 2025 • 34
FrameDiffuser: G-Buffer-Conditioned Diffusion for Neural Forward Frame Rendering Paper • 2512.16670 • Published Dec 18, 2025 • 4
FrontierCS: Evolving Challenges for Evolving Intelligence Paper • 2512.15699 • Published Dec 17, 2025 • 5
V-REX: Benchmarking Exploratory Visual Reasoning via Chain-of-Questions Paper • 2512.11995 • Published Dec 12, 2025 • 10
Decoupled DMD: CFG Augmentation as the Spear, Distribution Matching as the Shield Paper • 2511.22677 • Published Nov 27, 2025 • 34
Exploring MLLM-Diffusion Information Transfer with MetaCanvas Paper • 2512.11464 • Published Dec 12, 2025 • 15