4 10 8

Wenqi Shi

wshi83

https://wshi83.github.io

AI & ML interests

LLMs, Generative AI, Data-Centric AI

Recent Activity

authored a paper 20 days ago

CellForge: Agentic Design of Virtual Cell Models

authored a paper 20 days ago

AceSearcher: Bootstrapping Reasoning and Search for LLMs via Reinforced Self-Play

authored a paper 20 days ago

Position: The Hidden Costs and Measurement Gaps of Reinforcement Learning with Verifiable Rewards

View all activity

Organizations

authored 4 papers 20 days ago

CellForge: Agentic Design of Virtual Cell Models

Paper • 2508.02276 • Published Aug 4 • 39

AceSearcher: Bootstrapping Reasoning and Search for LLMs via Reinforced Self-Play

Paper • 2509.24193 • Published Sep 29 • 6

Position: The Hidden Costs and Measurement Gaps of Reinforcement Learning with Verifiable Rewards

Paper • 2509.21882 • Published Sep 26

Scaling Agentic Reinforcement Learning for Tool-Integrated Reasoning in VLMs

Paper • 2511.19773 • Published 21 days ago • 9

upvoted a paper 20 days ago

Scaling Agentic Reinforcement Learning for Tool-Integrated Reasoning in VLMs

Paper • 2511.19773 • Published 21 days ago • 9

commented a paper 20 days ago

Scaling Agentic Reinforcement Learning for Tool-Integrated Reasoning in VLMs

Paper • 2511.19773 • Published 21 days ago • 9 •

upvoted a paper 3 months ago

AceSearcher: Bootstrapping Reasoning and Search for LLMs via Reinforced Self-Play

Paper • 2509.24193 • Published Sep 29 • 6

liked a dataset 3 months ago

MedAgentGym/MedAgentGym-Data

Preview • Updated Jul 24 • 6 • 1

liked 2 models 3 months ago

MedAgentGym/MedCopilot-7B

8B • Updated Jun 1 • 40 • 4

MedAgentGym/MedCopilot-14B

15B • Updated Jun 1 • 27 • 2

liked a dataset 3 months ago

MedAgentGym/SampledTrajs

Viewer • Updated Jun 1 • 21.4k • 610 • 4

upvoted a paper 4 months ago

CellForge: Agentic Design of Virtual Cell Models

Paper • 2508.02276 • Published Aug 4 • 39

updated a dataset 5 months ago

MedAgentGym/MedAgentGym-Data

Preview • Updated Jul 24 • 6 • 1

upvoted a paper 5 months ago

The Invisible Leash: Why RLVR May Not Escape Its Origin

Paper • 2507.14843 • Published Jul 20 • 85

published a dataset 5 months ago

MedAgentGym/MedAgentGym-Data

Preview • Updated Jul 24 • 6 • 1

upvoted a paper 5 months ago

Agent KB: Leveraging Cross-Domain Experience for Agentic Problem Solving

Paper • 2507.06229 • Published Jul 8 • 75

liked a dataset 5 months ago

wshi83/EHRAgent-treqs

Updated Feb 13, 2024 • 72 • 3

authored 2 papers 6 months ago

Collab-RAG: Boosting Retrieval-Augmented Generation for Complex Question Answering via White-Box and Black-Box LLM Collaboration

Paper • 2504.04915 • Published Apr 7

MedAgentGym: Training LLM Agents for Code-Based Medical Reasoning at Scale

Paper • 2506.04405 • Published Jun 4 • 7

upvoted a paper 6 months ago

MedAgentGym: Training LLM Agents for Code-Based Medical Reasoning at Scale

Paper • 2506.04405 • Published Jun 4 • 7

Wenqi Shi

AI & ML interests

Recent Activity

Organizations

wshi83's activity