Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
yujin731 's Collections
benchmark
domain
finance
agent
med
S2
unlearning
interesting
RL-math
Code

agent

updated Sep 17, 2025
Upvote
-

  • Scaling Test-time Compute for LLM Agents

    Paper • 2506.12928 • Published Jun 15, 2025 • 63

  • AgentsNet: Coordination and Collaborative Reasoning in Multi-Agent LLMs

    Paper • 2507.08616 • Published Jul 11, 2025 • 14

  • ChemDFM-R: An Chemical Reasoner LLM Enhanced with Atomized Chemical Knowledge

    Paper • 2507.21990 • Published Jul 29, 2025 • 26

  • DuPO: Enabling Reliable LLM Self-Verification via Dual Preference Optimization

    Paper • 2508.14460 • Published Aug 20, 2025 • 85

  • AgentFly: Fine-tuning LLM Agents without Fine-tuning LLMs

    Paper • 2508.16153 • Published Aug 22, 2025 • 160

  • Scaling Agents via Continual Pre-training

    Paper • 2509.13310 • Published Sep 16, 2025 • 117
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs