Models and datasets for Elastic Reset (NeurIPS 2023), code at https://github.com/mnoukhov/elastic-reset
Michael N
mnoukhov
AI & ML interests
Representation learning for functional language
Recent Activity
updated a model about 3 hours ago
mnoukhov/nuevamol-46M-6Btok-wd1 published a model about 13 hours ago
mnoukhov/nuevamol-360M-6Btok-wd8 published a model about 13 hours ago
mnoukhov/nuevamol-46M-6Btok-wd1Organizations
models 53
mnoukhov/nuevamol-46M-6Btok-wd1
Text Generation • 46.2M • Updated
mnoukhov/nuevamol-360M-6Btok-wd8
Updated
mnoukhov/nuevamol-360m-init
0.4B • Updated
mnoukhov/nuevamol-135M-wsd-6Btok-wd2.0
Text Generation • 0.1B • Updated • 19
mnoukhov/nuevamol-135m-6B-wd3
Text Generation • 0.1B • Updated • 100
mnoukhov/nuevamol-80m-reinvent-sft
Text Generation • 78.1M • Updated • 329
mnoukhov/nuevamol-80m-base
Text Generation • 78.1M • Updated • 112
mnoukhov/nuevamol-220m-reinvent-sft
Text Generation • 0.2B • Updated • 331
mnoukhov/nuevamol-80m-init
Text Generation • 0.1B • Updated • 27
mnoukhov/nuevamol-135m-reinvent-sft
Text Generation • 0.1B • Updated • 600
datasets 102
mnoukhov/chembl_filtered
Viewer • Updated • 1.18M • 50
mnoukhov/brumo-2025-openinstruct-qwen3-4b-base-32samples-quartiles
Viewer • Updated • 60 • 16
mnoukhov/brumo-2025-openinstruct-qwen3-4b-base-32samples
Viewer • Updated • 30 • 10
mnoukhov/aime-2025-openinstruct-qwen3-4b-base-32samples-quartiles
Viewer • Updated • 60 • 15
mnoukhov/aime-2025-openinstruct-qwen3-4b-base-32samples
Viewer • Updated • 30 • 9
mnoukhov/dapo-math-17k-processed-filtered-qwen3-4b-base-32samples-quartiles
Viewer • Updated • 25.3k • 88
mnoukhov/dapo-math-17k-processed-filtered-qwen3-4b-base-32samples
Viewer • Updated • 12.6k • 33
mnoukhov/gsm8k-train-harder-quartiles
Viewer • Updated • 11.2k • 9
mnoukhov/manufactoria-qwen3-4b-instruct-warmup650-pass128
Viewer • Updated • 874 • 8
mnoukhov/manufactoria-qwen3-4b-instruct-warmup650-pass128-completions
Viewer • Updated • 874 • 49