Reasoning models trained on synthetic data using reinforcement learning.
Yichao 'Peak' Ji PRO
peakji
AI & ML interests
Agents, Small Language Models, Retrieval-Augmented Generation, Information Extraction
Recent Activity
liked a model 5 days ago
openai/privacy-filter liked a model 13 days ago
Qwen/Qwen3.6-27B liked a model 16 days ago
deepseek-ai/DeepSeek-V4-Pro