2 24 76

Mann Patel

manncodes

AI & ML interests

NLP, Mech Interp, Reasoning, MLSystems

Recent Activity

liked a dataset about 20 hours ago

nvidia/AceReason-Math

updated a collection about 20 hours ago

RL data

liked a dataset about 20 hours ago

meta-math/MetaMathQA

View all activity

Organizations

None yet

upvoted a paper 19 days ago

HelpSteer3-Preference: Open Human-Annotated Preference Data across Diverse Tasks and Languages

Paper • 2505.11475 • Published May 16 • 4

upvoted a paper 25 days ago

Skywork-Reward-V2: Scaling Preference Data Curation via Human-AI Synergy

Paper • 2507.01352 • Published Jul 2 • 56

upvoted a collection about 2 months ago

Apertus LLM

Collection

Democratizing Open and Compliant LLMs for Global Language Environments: 8B and 70B open-data open-weights models, multilingual in >1000 languages • 4 items • Updated Oct 1 • 304

upvoted a paper 2 months ago

Apriel-1.5-15b-Thinker

Paper • 2510.01141 • Published Oct 1 • 117

upvoted an article 2 months ago

Article

PipelineRL

Apr 25

•

upvoted a collection 3 months ago

— Long-context post-training 🧶 —

Collection

Resources for post-training LLMs with long-context samples • 5 items • Updated Sep 14 • 5

upvoted 4 papers 4 months ago

upvoted an article 5 months ago

Article

Everything About Long Context Fine-tuning

May 10, 2024

•

upvoted a paper 5 months ago

MiniMax-01: Scaling Foundation Models with Lightning Attention

Paper • 2501.08313 • Published Jan 14 • 301

upvoted 2 articles 6 months ago

Article

Binary and Scalar Embedding Quantization for Significantly Faster & Cheaper Retrieval

Mar 22, 2024

•

103

Article

KV Cache from scratch in nanoVLM

Jun 4

•

103

upvoted a paper 7 months ago

AceReason-Nemotron: Advancing Math and Code Reasoning through Reinforcement Learning

Paper • 2505.16400 • Published May 22 • 35

upvoted 3 articles 8 months ago

Article

🪆 Introduction to Matryoshka Embedding Models

Feb 23, 2024

•

181

Article

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

Mar 12

•

473

Article

Distributed Training with JAX and Flax NNX: A Practical Guide to Sharding

Mar 26

•

upvoted a paper 9 months ago

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published Jan 8 • 286

upvoted a paper 12 months ago

LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding

Paper • 2404.16710 • Published Apr 25, 2024 • 80

Mann Patel

AI & ML interests

Recent Activity

Organizations

manncodes's activity

PipelineRL

Everything About Long Context Fine-tuning

Binary and Scalar Embedding Quantization for Significantly Faster & Cheaper Retrieval

KV Cache from scratch in nanoVLM

🪆 Introduction to Matryoshka Embedding Models

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

Distributed Training with JAX and Flax NNX: A Practical Guide to Sharding