1 16 13

Norris Wheeler

wheeler404

AI & ML interests

None yet

Recent Activity

upvoted a paper 12 days ago

From Reasoning to Agentic: Credit Assignment in Reinforcement Learning for Large Language Models

upvoted a paper 12 days ago

Mobile GUI Agent Privacy Personalization with Trajectory Induced Preference Optimization

upvoted a paper 12 days ago

TRACE: Capability-Targeted Agentic Training

View all activity

Organizations

None yet

upvoted 14 papers 12 days ago

From Reasoning to Agentic: Credit Assignment in Reinforcement Learning for Large Language Models

Paper • 2604.09459 • Published 15 days ago • 13

Mobile GUI Agent Privacy Personalization with Trajectory Induced Preference Optimization

Paper • 2604.11259 • Published 15 days ago • 12

TRACE: Capability-Targeted Agentic Training

Paper • 2604.05336 • Published 21 days ago • 13

Agentic Aggregation for Parallel Scaling of Long-Horizon Agentic Tasks

Paper • 2604.11753 • Published 15 days ago • 14

AgentSwing: Adaptive Parallel Context Management Routing for Long-Horizon Web Agents

Paper • 2603.27490 • Published 30 days ago • 18

KnowRL: Boosting LLM Reasoning via Reinforcement Learning with Minimal-Sufficient Knowledge Guidance

Paper • 2604.12627 • Published 14 days ago • 99

Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe

Paper • 2604.13016 • Published 14 days ago • 87

Attention Sink in Transformers: A Survey on Utilization, Interpretation, and Mitigation

Paper • 2604.10098 • Published 17 days ago • 76

The Past Is Not Past: Memory-Enhanced Dynamic Reward Shaping

Paper • 2604.11297 • Published 15 days ago • 138

ClawGUI: A Unified Framework for Training, Evaluating, and Deploying GUI Agents

Paper • 2604.11784 • Published 15 days ago • 141

liked a model about 1 month ago

docling-project/SmolDocling-256M-preview

Image-Text-to-Text • Updated Sep 17, 2025 • 36.7k • 1.61k

liked a dataset 4 months ago

opencsg/Fineweb-Edu-Chinese-V2.1

Viewer • Updated Jan 28 • 958M • 12.3k • 73

liked a model 5 months ago

Qwen/Qwen3-VL-2B-Instruct

Image-Text-to-Text • 2B • Updated Oct 23, 2025 • 155M • 374

updated a dataset 5 months ago

wheeler404/catgirl_sft_15k

Viewer • Updated Nov 14, 2025 • 15k • 38 • 1

published a dataset 5 months ago

wheeler404/catgirl_sft_15k

Viewer • Updated Nov 14, 2025 • 15k • 38 • 1

liked a dataset 6 months ago

databricks/databricks-dolly-15k

Viewer • Updated Jun 30, 2023 • 15k • 33.5k • 954

Norris Wheeler

AI & ML interests

Recent Activity

Organizations

wheeler404's activity