10 39 87

Sukesh Perla

hitchhiker3010

AI & ML interests

None yet

Recent Activity

upvoted a paper 11 days ago

A Self-Evolving Framework for Efficient Terminal Agents via Observational Context Compression

upvoted a paper 11 days ago

Reward Hacking in the Era of Large Models: Mechanisms, Emergent Misalignment, Challenges

upvoted a paper 11 days ago

DeVI: Physics-based Dexterous Human-Object Interaction via Synthetic Video Imitation

View all activity

Organizations

upvoted 6 papers 11 days ago

A Self-Evolving Framework for Efficient Terminal Agents via Observational Context Compression

Paper • 2604.19572 • Published 15 days ago • 21

Reward Hacking in the Era of Large Models: Mechanisms, Emergent Misalignment, Challenges

Paper • 2604.13602 • Published 21 days ago • 31

DeVI: Physics-based Dexterous Human-Object Interaction via Synthetic Video Imitation

Paper • 2604.20841 • Published 14 days ago • 24

EditCrafter: Tuning-free High-Resolution Image Editing via Pretrained Diffusion Model

Paper • 2604.10268 • Published 25 days ago • 12

Context Unrolling in Omni Models

Paper • 2604.21921 • Published 13 days ago • 12

TingIS: Real-time Risk Event Discovery from Noisy Customer Incidents at Enterprise Scale

Paper • 2604.21889 • Published 13 days ago • 12

liked a Space 13 days ago

The Synthetic Data Playbook: Generating Trillions of the Finest Tokens

📝

231

Explore synthetic data experiments on a virtual bookshelf

liked a model 19 days ago

NucleusAI/Nucleus-Image

Text-to-Image • Updated 20 days ago • 2.43k • • 244

liked a dataset about 1 month ago

purvanshi/lica-data

Preview • Updated Mar 27 • 349 • 31

updated a collection about 2 months ago

to_read

Collection

97 items • Updated Mar 17

updated a collection 3 months ago

AI Ads

Collection

6 items • Updated Feb 10 • 1

reacted to sergiopaniego's post with 🔥 3 months ago

Post

2644

New TRL + OpenEnv example! 💥

Fine tune an LLM for playing Sudoku using an RL env via OpenEnv

Includes a script that runs on 1 or multiple GPUs with vLLM, plus a Colab-ready notebook.

Enjoy!

Notebook: https://colab.research.google.com/github/huggingface/trl/blob/main/examples/notebooks/openenv_sudoku_grpo.ipynb

Script: https://github.com/huggingface/trl/blob/main/examples/scripts/openenv/sudoku.py

1 reply

upvoted 3 articles 3 months ago

Article

Unlocking Agentic RL Training for GPT-OSS: A Practical Retrospective

Jan 27

•

Article

AssetOpsBench: Bridging the Gap Between AI Agent Benchmarks and Industrial Reality

Jan 21

•

Article

We Got Claude to Build CUDA Kernels and teach open models!

Jan 28

•

155

liked a model 3 months ago

RuneXX/LTX-2-Workflows

Updated Mar 28 • 280

liked 2 models 4 months ago

Sri-Vigneshwar-DJ/hawky-ai-H1-4b-PM

Updated Jan 11 • 4

Lightricks/LTX-2

Image-to-Video • Updated Mar 2 • 724k • • 1.7k

updated a collection 4 months ago

AI Agents

Collection

3 items • Updated Jan 8

liked a model 4 months ago

JaydenLu666/Reward-Forcing-T2V-1.3B

Updated Dec 5, 2025 • 10

Sukesh Perla

AI & ML interests

Recent Activity

Organizations

hitchhiker3010's activity

The Synthetic Data Playbook: Generating Trillions of the Finest Tokens

Unlocking Agentic RL Training for GPT-OSS: A Practical Retrospective

AssetOpsBench: Bridging the Gap Between AI Agent Benchmarks and Industrial Reality

We Got Claude to Build CUDA Kernels and teach open models!