Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
113
47
615
Nathan Lambert
natolambert
Follow
theainerd's profile picture
lennert-neena's profile picture
stefan-jo's profile picture
287 followers
·
38 following
https://www.natolambert.com/
natolambert
natolambert
AI & ML interests
Reinforcement learning, Ethics, Robotics, Dynamics Models
Recent Activity
liked
a model
17 days ago
openbmb/BitCPM-CANN-3B-unquantized
liked
a model
23 days ago
perplexity-ai/pplx-embed-v1-late-0.6b
liked
a model
about 1 month ago
inclusionAI/Ling-2.6-flash
View all activity
Organizations
natolambert
's datasets
66
Sort: Recently updated
natolambert/rlhf-library
Viewer
•
Updated
Sep 17, 2025
•
864
•
41
•
3
natolambert/rlhf-library-Llama-3.1-Tulu-3-70B-DPO
Viewer
•
Updated
Sep 15, 2025
•
48
•
16
natolambert/rlhf-library-Llama-3.1-Tulu-3-70B-SFT
Viewer
•
Updated
Sep 15, 2025
•
48
•
35
natolambert/rlhf-library-tulu-2-dpo-7b
Viewer
•
Updated
Sep 15, 2025
•
48
•
16
natolambert/rlhf-library-OLMo-2-0425-1B-DPO
Viewer
•
Updated
Sep 15, 2025
•
48
•
28
natolambert/rlhf-library-OLMo-2-0425-1B-SFT
Viewer
•
Updated
Sep 15, 2025
•
48
•
24
natolambert/rlhf-library-Llama-3.1-Tulu-3-8B-DPO
Viewer
•
Updated
Sep 15, 2025
•
48
•
15
natolambert/rlhf-library-tulu-2-7b
Viewer
•
Updated
Sep 15, 2025
•
48
•
15
natolambert/rlhf-library-OLMo-7B-0424-Instruct-hf
Viewer
•
Updated
Sep 15, 2025
•
48
•
16
natolambert/rlhf-library-OLMo-7B-0424-SFT-hf
Viewer
•
Updated
Sep 15, 2025
•
48
•
24
natolambert/rlhf-library-OLMo-7B-Instruct-hf
Viewer
•
Updated
Sep 15, 2025
•
48
•
12
natolambert/rlhf-library-OLMo-7B-SFT-hf
Viewer
•
Updated
Sep 15, 2025
•
48
•
11
natolambert/rlhf-library-OLMo-2-0325-32B-DPO
Viewer
•
Updated
Sep 15, 2025
•
48
•
23
natolambert/rlhf-library-OLMo-2-0325-32B-SFT
Viewer
•
Updated
Sep 15, 2025
•
48
•
21
natolambert/rlhf-library-OLMo-2-1124-13B-DPO
Viewer
•
Updated
Sep 15, 2025
•
48
•
10
natolambert/rlhf-library-OLMo-2-1124-13B-SFT
Viewer
•
Updated
Sep 15, 2025
•
48
•
13
natolambert/rlhf-library-OLMo-2-1124-7B-DPO
Viewer
•
Updated
Sep 15, 2025
•
48
•
11
natolambert/rlhf-library-Llama-3.1-Tulu-3-8B-SFT
Viewer
•
Updated
Sep 15, 2025
•
48
•
17
natolambert/rlhf-library-OLMo-2-1124-7B-SFT
Viewer
•
Updated
Sep 15, 2025
•
48
•
9
natolambert/rlhf-book-prompts-v2
Viewer
•
Updated
Sep 14, 2025
•
16
•
12
natolambert/coconot-r1-debug-debug
Viewer
•
Updated
Jun 30, 2025
•
10
•
28
natolambert/tulu_v3.9_wildchat_100k_english-r1
Viewer
•
Updated
Jun 30, 2025
•
57.4k
•
233
natolambert/acecoder-r1
Viewer
•
Updated
Jun 29, 2025
•
63.6k
•
29
natolambert/rlvr-code-data-python-r1
Viewer
•
Updated
Jun 29, 2025
•
80k
•
127
natolambert/tulu_v3.9_wildchat_100k_english-r1-debug
Viewer
•
Updated
Jun 29, 2025
•
9
•
16
natolambert/hardcoded-test
Viewer
•
Updated
Jun 29, 2025
•
24
•
13
natolambert/rlvr_acecoder_filtered-r1
Updated
Jun 28, 2025
•
8
natolambert/the-algorithm-python-r1
Viewer
•
Updated
Jun 28, 2025
•
608
•
20
natolambert/the-algorithm-python-r1-debug
Viewer
•
Updated
Jun 28, 2025
•
10
•
18
natolambert/GeneralThought-430K-filtered
Viewer
•
Updated
Mar 26, 2025
•
338k
•
2.6k
•
35
Previous
1
2
3
Next