24 16

P7n2c1dvlqk6

p7n2c1dvlqk6

AI & ML interests

None yet

Recent Activity

liked a dataset about 1 hour ago

Congliu/Chinese-DeepSeek-R1-Distill-data-110k

upvoted a paper 2 days ago

Mix-MoE: Improving Multilingual Machine Translation of Large Language Models through Mixed MoEs

upvoted a paper 4 days ago

Gamma-World: Generative Multi-Agent World Modeling Beyond Two Players

View all activity

Organizations

None yet

liked a dataset about 1 hour ago

Congliu/Chinese-DeepSeek-R1-Distill-data-110k

Viewer • Updated Feb 21, 2025 • 110k • 859 • 761

upvoted a paper 2 days ago

Mix-MoE: Improving Multilingual Machine Translation of Large Language Models through Mixed MoEs

Paper • 2605.24681 • Published 10 days ago • 5

upvoted a paper 4 days ago

Gamma-World: Generative Multi-Agent World Modeling Beyond Two Players

Paper • 2605.28816 • Published 6 days ago • 413

upvoted a paper 5 days ago

DVAO: Dynamic Variance-adaptive Advantage Optimization for Multi-reward Reinforcement Learning

Paper • 2605.25604 • Published 8 days ago • 133

liked a dataset 6 days ago

Anthropic/hh-rlhf

Viewer • Updated May 26, 2023 • 169k • 39.1k • 1.76k

upvoted 2 papers 11 days ago

Anti-Self-Distillation for Reasoning RL via Pointwise Mutual Information

Paper • 2605.11609 • Published 21 days ago • 195

DelTA: Discriminative Token Credit Assignment for Reinforcement Learning from Verifiable Rewards

Paper • 2605.21467 • Published 13 days ago • 204

liked a model 11 days ago

google/electra-base-discriminator

Updated Feb 29, 2024 • 58M • 121

upvoted a paper 12 days ago

OScaR: The Occam's Razor for Extreme KV Cache Quantization in LLMs and Beyond

Paper • 2605.19660 • Published 14 days ago • 40

liked 2 models 19 days ago

RoMALab/pi05_libero_local_only_sanity_v2_client_6

Robotics • 4B • Updated 19 days ago • 40 • 1

jackxinning/Leanly_AI

Question Answering • 15B • Updated 3 days ago • 3.18k • 120

upvoted a paper 22 days ago

Who Prices Cognitive Labor in the Age of Agents? Compute-Anchored Wages

Paper • 2605.05558 • Published 25 days ago • 3

liked a model 26 days ago

sentence-transformers/all-mpnet-base-v2

upvoted a paper about 1 month ago

Leveraging Verifier-Based Reinforcement Learning in Image Editing

Paper • 2604.27505 • Published Apr 30 • 57

liked 2 datasets about 1 month ago

harshal3099/apex-food-rd-chatml

Viewer • Updated May 1 • 5.5k • 117 • 1

WindyCh/SyntheticDataset

Viewer • Updated Apr 23 • 223 • 229 • 1

liked a dataset about 2 months ago

allenai/dolma

Updated Apr 17, 2024 • 4.1k • 1.04k

upvoted 3 papers about 2 months ago

Experience Transfer for Multimodal LLM Agents in Minecraft Game

Paper • 2604.05533 • Published Apr 7 • 16

Adam's Law: Textual Frequency Law on Large Language Models

Paper • 2604.02176 • Published Apr 2 • 504

GrandCode: Achieving Grandmaster Level in Competitive Programming via Agentic Reinforcement Learning

Paper • 2604.02721 • Published Apr 3 • 630

P7n2c1dvlqk6

AI & ML interests

Recent Activity

Organizations

p7n2c1dvlqk6's activity