Jiawei Wang's picture

Jiawei Wang

Jarvis1111

·

https://jarvisustc.github.io/

AI & ML interests

None yet

Recent Activity

upvoted a paper about 1 month ago

DAComp: Benchmarking Data Agents across the Full Data Intelligence Lifecycle

upvoted a paper about 1 month ago

Stabilizing Reinforcement Learning with LLMs: Formulation and Practices

upvoted a paper about 2 months ago

Think-at-Hard: Selective Latent Iterations to Improve Reasoning Language Models

View all activity

Organizations

None yet

commented a paper 3 months ago

Harnessing Uncertainty: Entropy-Modulated Policy Gradients for Long-Horizon LLM Agents

Paper • 2509.09265 • Published Sep 11, 2025 • 47 •

commented a paper 4 months ago

Harnessing Uncertainty: Entropy-Modulated Policy Gradients for Long-Horizon LLM Agents

Paper • 2509.09265 • Published Sep 11, 2025 • 47 •

commented a paper 5 months ago

WideSearch: Benchmarking Agentic Broad Info-Seeking

Paper • 2508.07999 • Published Aug 11, 2025 • 110 •

New activity in Jarvis1111/DoctorAgent-RL-SFT-1k-Thinking 5 months ago

Improve model card: Update pipeline tag, add `transformers` library, and enhance content with paper/code links

#1 opened 5 months ago by

New activity in Jarvis1111/DoctorAgent-RL 5 months ago

Add comprehensive model card

#1 opened 5 months ago by

commented a paper 8 months ago

DoctorAgent-RL: A Multi-Agent Collaborative Reinforcement Learning System for Multi-Turn Clinical Dialogue

Paper • 2505.19630 • Published May 26, 2025 • 7 •

New activity in Jarvis1111/llava-v1.5-7b-RobustVLGuard 8 months ago

Add pipeline tag and library name

#1 opened 9 months ago by

New activity in Jarvis1111/InternVL2-8B-RobustVLGuard 9 months ago

Add base model

#2 opened 9 months ago by

New activity in Jarvis1111/MiniGPT4-RobustVLGuard 9 months ago

Add pipeline tag, library name, and project page link

#1 opened 9 months ago by

New activity in Jarvis1111/InternVL2-8B-RobustVLGuard 9 months ago

Add pipeline tag and library name

#1 opened 9 months ago by

New activity in Jarvis1111/RobustVLGuard 9 months ago

Fix paper link

#2 opened 9 months ago by

commented a paper 9 months ago

Safeguarding Vision-Language Models: Mitigating Vulnerabilities to Gaussian Noise in Perturbation-based Attacks

Paper • 2504.01308 • Published Apr 2, 2025 • 14 •

commented a paper 10 months ago

UniHDSA: A Unified Relation Prediction Approach for Hierarchical Document Structure Analysis

Paper • 2503.15893 • Published Mar 20, 2025 • 2 •

New activity in jordyvl/DUDE_loader over 2 years ago

The difference between azure_due and azure_original

#3 opened over 2 years ago by