David Garcia's picture

David Garcia

davidgarcia14

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 days ago

Language Models Need Sleep

liked a model 2 days ago

openbmb/MiniCPM5-1B

upvoted a paper 6 days ago

DelTA: Discriminative Token Credit Assignment for Reinforcement Learning from Verifiable Rewards

View all activity

Organizations

None yet

upvoted a paper 2 days ago

Language Models Need Sleep

Paper • 2605.26099 • Published 5 days ago • 10

upvoted a paper 6 days ago

DelTA: Discriminative Token Credit Assignment for Reinforcement Learning from Verifiable Rewards

Paper • 2605.21467 • Published 10 days ago • 204

upvoted a paper 7 days ago

Anti-Self-Distillation for Reasoning RL via Pointwise Mutual Information

Paper • 2605.11609 • Published 18 days ago • 193

upvoted a paper 8 days ago

Video2GUI: Synthesizing Large-Scale Interaction Trajectories for Generalized GUI Agent Pretraining

Paper • 2605.14747 • Published 16 days ago • 145

upvoted a paper 15 days ago

Omni-Persona: Systematic Benchmarking and Improving Omnimodal Personalization

Paper • 2605.09996 • Published 19 days ago • 8

upvoted a paper 17 days ago

SEIF: Self-Evolving Reinforcement Learning for Instruction Following

Paper • 2605.07465 • Published 22 days ago • 29

upvoted a paper 18 days ago

PianoCoRe: Combined and Refined Piano MIDI Dataset

Paper • 2605.06627 • Published 23 days ago • 6

upvoted a paper about 1 month ago

GameWorld: Towards Standardized and Verifiable Evaluation of Multimodal Game Agents

Paper • 2604.07429 • Published Apr 8 • 121

upvoted 9 papers about 2 months ago

WildDet3D: Scaling Promptable 3D Detection in the Wild

Paper • 2604.08626 • Published Apr 9 • 247

Token Warping Helps MLLMs Look from Nearby Viewpoints

Paper • 2604.02870 • Published Apr 3 • 34

Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability

Paper • 2604.06628 • Published Apr 8 • 326

An Efficient Heterogeneous Co-Design for Fine-Tuning on a Single GPU

Paper • 2603.16428 • Published Mar 17 • 51

Omni-SimpleMem: Autoresearch-Guided Discovery of Lifelong Multimodal Agent Memory

Paper • 2604.01007 • Published Apr 2 • 31

Adam's Law: Textual Frequency Law on Large Language Models

Paper • 2604.02176 • Published Apr 2 • 504

ACES: Who Tests the Tests? Leave-One-Out AUC Consistency for Code Generation

Paper • 2604.03922 • Published Apr 5 • 53

Self-Distilled RLVR

Paper • 2604.03128 • Published Apr 3 • 176

Falcon Perception

Paper • 2603.27365 • Published Mar 28 • 16

upvoted 3 papers 2 months ago

InCoder-32B: Code Foundation Model for Industrial Scenarios

Paper • 2603.16790 • Published Mar 17 • 311

HSImul3R: Physics-in-the-Loop Reconstruction of Simulation-Ready Human-Scene Interactions

Paper • 2603.15612 • Published Mar 16 • 153

Bootstrapping Exploration with Group-Level Natural Language Feedback in Reinforcement Learning

Paper • 2603.04597 • Published Mar 4 • 211