Stephen Oates PRO

soates

AI & ML interests

None yet

Recent Activity

upvoted an article 1 day ago

We Got Claude to Fine-Tune an Open Source LLM

upvoted a paper about 1 month ago

The Massive Legal Embedding Benchmark (MLEB)

upvoted an article about 1 month ago

Australian-made LLM beats OpenAI and Google at legal retrieval

View all activity

Organizations

None yet

upvoted an article 1 day ago

Article

We Got Claude to Fine-Tune an Open Source LLM

3 days ago

•

220

upvoted a paper about 1 month ago

The Massive Legal Embedding Benchmark (MLEB)

Paper • 2510.19365 • Published Oct 22 • 17

upvoted an article about 1 month ago

Article

Australian-made LLM beats OpenAI and Google at legal retrieval

Oct 23

•

upvoted an article 2 months ago

Article

There is no such thing as a tokenizer-free lunch

Sep 25

•

updated a dataset 3 months ago

soates/australian-insurance-dspy-corpus

Viewer • Updated Sep 17 • 359 • 21

published a dataset 3 months ago

soates/australian-insurance-dspy-corpus

Viewer • Updated Sep 17 • 359 • 21

upvoted 2 papers 3 months ago

Virtual Agent Economies

Paper • 2509.10147 • Published Sep 12 • 26

The Majority is not always right: RL training for solution aggregation

Paper • 2509.06870 • Published Sep 8 • 16

updated a dataset 4 months ago

soates/tictactoe-gemma-dataset

Viewer • Updated Aug 15 • 93.6k • 19

published a dataset 4 months ago

soates/tictactoe-gemma-dataset

Viewer • Updated Aug 15 • 93.6k • 19

liked a model 5 months ago

Menlo/Lucy-128k

Text Generation • 2B • Updated Aug 4 • 285 • 108

liked a model 6 months ago

chandar-lab/NeoBERT

Feature Extraction • 0.2B • Updated Mar 25 • 2.11k • 184

upvoted a paper 6 months ago

Large Language Models are Locally Linear Mappings

Paper • 2505.24293 • Published May 30 • 14

upvoted a paper 7 months ago

Reinforcement Learning Finetunes Small Subnetworks in Large Language Models

Paper • 2505.11711 • Published May 16 • 11

upvoted 2 articles 7 months ago

Article

nanoVLM: The simplest repository to train your VLM in pure PyTorch

May 21

•

234

Article

Tiny Agents: an MCP-powered agent in 50 lines of code

Apr 25

•

303

upvoted a paper 8 months ago

Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?

Paper • 2504.13837 • Published Apr 18 • 138

upvoted an article 8 months ago

Article

Gotchas in Tokenizer Behavior Every Developer Should Know

Apr 18

•

upvoted a collection 9 months ago

Gemma 3

Collection

All versions of Google's new multimodal models including QAT in 1B, 4B, 12B, and 27B sizes. In GGUF, dynamic 4-bit and 16-bit formats. • 55 items • Updated 4 days ago • 96

updated a dataset 9 months ago

soates/australian-insurance-pii-dataset-corrected

Viewer • Updated Feb 25 • 1.55k • 19

Stephen Oates PRO

AI & ML interests

Recent Activity

Organizations

soates's activity

We Got Claude to Fine-Tune an Open Source LLM

Australian-made LLM beats OpenAI and Google at legal retrieval

There is no such thing as a tokenizer-free lunch

nanoVLM: The simplest repository to train your VLM in pure PyTorch

Tiny Agents: an MCP-powered agent in 50 lines of code

Gotchas in Tokenizer Behavior Every Developer Should Know