Dr. Kernel: Reinforcement Learning Done Right for Triton Kernel Generations Paper • 2602.05885 • Published 4 days ago • 24
ConceptMoE: Adaptive Token-to-Concept Compression for Implicit Compute Allocation Paper • 2601.21420 • Published 11 days ago • 42
MMFineReason: Closing the Multimodal Reasoning Gap via Open Data-Centric Methods Paper • 2601.21821 • Published 11 days ago • 57
ToolSafe: Enhancing Tool Invocation Safety of LLM-based agents via Proactive Step-level Guardrail and Feedback Paper • 2601.10156 • Published 25 days ago • 26
AgentDoG: A Diagnostic Guardrail Framework for AI Agent Safety and Security Paper • 2601.18491 • Published 14 days ago • 122
AgentDoG Collection A Diagnostic Guardrail Framework for AI Agent Safety and Security • 11 items • Updated 12 days ago • 99
AgentDoG: A Diagnostic Guardrail Framework for AI Agent Safety and Security Paper • 2601.18491 • Published 14 days ago • 122
AgentDoG: A Diagnostic Guardrail Framework for AI Agent Safety and Security Paper • 2601.18491 • Published 14 days ago • 122
AgentDoG Collection A Diagnostic Guardrail Framework for AI Agent Safety and Security • 11 items • Updated 12 days ago • 99
Geometrically-Constrained Agent for Spatial Reasoning Paper • 2511.22659 • Published Nov 27, 2025 • 41
iMontage: Unified, Versatile, Highly Dynamic Many-to-many Image Generation Paper • 2511.20635 • Published Nov 25, 2025 • 32
Attention Illuminates LLM Reasoning: The Preplan-and-Anchor Rhythm Enables Fine-Grained Policy Optimization Paper • 2510.13554 • Published Oct 15, 2025 • 58