Reasoning models trained on synthetic data using reinforcement learning.
Yichao 'Peak' Ji PRO
peakji
AI & ML interests
Agents, Small Language Models, Retrieval-Augmented Generation, Information Extraction
Recent Activity
liked a model about 18 hours ago
openai/privacy-filter liked a model 9 days ago
Qwen/Qwen3.6-27B liked a model 12 days ago
deepseek-ai/DeepSeek-V4-Pro