Sunil Kumar Yadav

sukuya

https://sukuya.github.io/

sukuya

AI & ML interests

Machine Translation, Large Language Models

Recent Activity

upvoted a collection 7 days ago

GPT-1900

liked a model 7 days ago

talkie-lm/talkie-1930-13b-it

upvoted a paper 24 days ago

GameWorld: Towards Standardized and Verifiable Evaluation of Multimodal Game Agents

View all activity

Organizations

upvoted a collection 7 days ago

GPT-1900

Collection

Pre-1900 LLMs for physics reasoning. RL models are physics-only; use the SFT model for general chat. Tune temperature (0.6-0.7). • 11 items • Updated Apr 2 • 9

liked a model 7 days ago

talkie-lm/talkie-1930-13b-it

Updated 13 days ago • 235

upvoted a paper 24 days ago

GameWorld: Towards Standardized and Verifiable Evaluation of Multimodal Game Agents

Paper • 2604.07429 • Published 28 days ago • 119

upvoted a paper about 1 month ago

FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization

Paper • 2603.19835 • Published Mar 20 • 350

liked a model about 2 months ago

Rakuten/RakutenAI-3.0

Text Generation • 671B • Updated Mar 17 • 50.9k • 76

liked a Space about 2 months ago

The Synthetic Data Playbook: Generating Trillions of the Finest Tokens

📝

231

Explore synthetic data experiments on a virtual bookshelf

liked 2 datasets 2 months ago

openai/graphwalks

Viewer • Updated Mar 5 • 1.15k • 1.95k • 117

togethercomputer/CoderForge-Preview

Viewer • Updated Feb 26 • 827k • 3.45k • 166

updated a model 5 months ago

Rakuten/rakuten-ai-7b-onnx

Updated Dec 13, 2025 • 189 • 3

reacted to leonardlin's post with 🔥 5 months ago

Post

2301

We just released our latest Shisa V2.1 Japanese multi-lingual models: https://huggingface.co/collections/shisa-ai/shisa-v21

Besides updates to our 14B, and 70B, we have a new LFM2-based 1.2B, Llama 3.2-based 3B, and Qwen 3-based 8B, all with class-leading Japanese language capabilities.

Per usual, lots of details in the Model Cards for those interested.