The Station: An Open-World Environment for AI-Driven Discovery Paper • 2511.06309 • Published 30 days ago • 35
ThinkAct: Vision-Language-Action Reasoning via Reinforced Visual Latent Planning Paper • 2507.16815 • Published Jul 22 • 39
AimBot: A Simple Auxiliary Visual Cue to Enhance Spatial Awareness of Visuomotor Policies Paper • 2508.08113 • Published Aug 11 • 11
Lingshu: A Generalist Foundation Model for Unified Multimodal Medical Understanding and Reasoning Paper • 2506.07044 • Published Jun 8 • 114
Creation-MMBench: Assessing Context-Aware Creative Intelligence in MLLM Paper • 2503.14478 • Published Mar 18 • 48
Dialogue Planning via Brownian Bridge Stochastic Process for Goal-directed Proactive Dialogue Paper • 2305.05290 • Published May 9, 2023
Self-Detoxifying Language Models via Toxification Reversal Paper • 2310.09573 • Published Oct 14, 2023
Why Safeguarded Ships Run Aground? Aligned Large Language Models' Safety Mechanisms Tend to Be Anchored in The Template Region Paper • 2502.13946 • Published Feb 19 • 10
Training Turn-by-Turn Verifiers for Dialogue Tutoring Agents: The Curious Case of LLMs as Your Coding Tutors Paper • 2502.13311 • Published Feb 18 • 2
Instruct Once, Chat Consistently in Multiple Rounds: An Efficient Tuning Framework for Dialogue Paper • 2402.06967 • Published Feb 10, 2024
Why Safeguarded Ships Run Aground? Aligned Large Language Models' Safety Mechanisms Tend to Be Anchored in The Template Region Paper • 2502.13946 • Published Feb 19 • 10