Elie Bakouch's picture

Elie Bakouch PRO

eliebak

·

AI & ML interests

Training LLM's @ 🤗

Recent Activity

liked a model about 5 hours ago

EssentialAI/rnj-1-instruct

liked a model about 5 hours ago

EssentialAI/rnj-1

upvoted a paper 1 day ago

Stabilizing Reinforcement Learning with LLMs: Formulation and Practices

View all activity

Organizations

liked 2 models about 5 hours ago

EssentialAI/rnj-1-instruct

8B • Updated about 13 hours ago • 3.15k • 34

EssentialAI/rnj-1

8B • Updated about 13 hours ago • 115k • 16

upvoted a paper 1 day ago

Stabilizing Reinforcement Learning with LLMs: Formulation and Practices

Paper • 2512.01374 • Published 5 days ago • 75

upvoted a paper 4 days ago

From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intelligence

Paper • 2511.18538 • Published 13 days ago • 238

upvoted an article 5 days ago

Article

Transformers v5: Simple model definitions powering the AI ecosystem

+2

6 days ago

•

223

upvoted a collection 8 days ago

INTELLECT-3

INTELLECT-3: A 100B+ MoE trained with large-scale RL • 4 items • Updated 8 days ago • 11

liked a dataset 8 days ago

PrimeIntellect/INTELLECT-3-RL

Viewer • Updated 28 days ago • 70.7k • 1.01k • 2

liked a model 9 days ago

deepseek-ai/DeepSeek-Math-V2

Text Generation • 685B • Updated 9 days ago • 8.96k • 636

upvoted an article 16 days ago

Article

Apriel-H1: The Surprising Key to Distilling Efficient Reasoning Models

17 days ago

•

26

liked a dataset 18 days ago

sinatras/pmpp-eval

Viewer • Updated Oct 15 • 199 • 151 • 4

upvoted a collection 21 days ago

NeMo Gym

Collection of RL verifiable data for NeMo Gym • 8 items • Updated 3 days ago • 8

commented a paper 23 days ago

Motif 2 12.7B technical report

Paper • 2511.07464 • Published 29 days ago • 38 •

upvoted a paper 23 days ago

Motif 2 12.7B technical report

Paper • 2511.07464 • Published 29 days ago • 38

liked a model 23 days ago

PrimeIntellect/Qwen3-4B-Instruct-2507-SFT-DeepDive

Updated 24 days ago • 309 • 3

liked a model 30 days ago

moonshotai/Kimi-K2-Thinking

Text Generation • Updated 28 days ago • 395k • • 1.5k

liked a Space about 1 month ago

The Smol Training Playbook

The secrets to building world-class LLMs

upvoted 2 collections about 1 month ago

gpt-oss

Open-weight models designed for powerful reasoning, agentic tasks, and versatile developer use cases. • 2 items • Updated Aug 7 • 389

gpt-oss-safeguard

gpt-oss-safeguard-120b and gpt-oss-safeguard-20b are safety reasoning models built-upon gpt-oss • 2 items • Updated Oct 29 • 58

liked a model about 1 month ago

marin-community/marin-32b-base

Text Generation • 33B • Updated Nov 3 • 4.35k • 31

New activity in marin-community/marin-32b-base about 1 month ago

fix link to retrospective

#1 opened about 1 month ago by