ShiqiangWoo 's Collections 20250901
updated
Think in Games: Learning to Reason in Games via Reinforcement Learning
with Large Language Models
Paper
• 2508.21365
• Published
• 29
TiKMiX: Take Data Influence into Dynamic Mixture for Language Model
Pre-training
Paper
• 2508.17677
• Published
• 14
UItron: Foundational GUI Agent with Advanced Perception and Planning
Paper
• 2508.21767
• Published
• 12
AHELM: A Holistic Evaluation of Audio-Language Models
Paper
• 2508.21376
• Published
• 9
Efficient Code Embeddings from Code Generation Models
Paper
• 2508.21290
• Published
• 20
Morae: Proactively Pausing UI Agents for User Choices
Paper
• 2508.21456
• Published
• 5
Model-Task Alignment Drives Distinct RL Outcomes
Paper
• 2508.21188
• Published
• 8
CLIPSym: Delving into Symmetry Detection with CLIP
Paper
• 2508.14197
• Published
• 8
HERMES: Human-to-Robot Embodied Learning from Multi-Source Motion Data
for Mobile Dexterous Manipulation
Paper
• 2508.20085
• Published
• 1
Mimicking the Physicist's Eye:A VLM-centric Approach for Physics Formula
Discovery
Paper
• 2508.17380
• Published
• 7
EduRABSA: An Education Review Dataset for Aspect-based Sentiment
Analysis Tasks
Paper
• 2508.17008
• Published