WTF GENIUS PAPERS Papers that made me appreciate my major and my life a little more. obs=Observation, innov=Innovation. Most papers are abt improving tiny models. Continuous Latent Diffusion Language Model Paper ⢠2605.06548 ⢠Published 6 days ago ⢠69 Scaling Latent Reasoning via Looped Language Models Paper ⢠2510.25741 ⢠Published Oct 29, 2025 ⢠229 Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach Paper ⢠2502.05171 ⢠Published Feb 7, 2025 ⢠155 Pretraining Language Models to Ponder in Continuous Space Paper ⢠2505.20674 ⢠Published May 27, 2025 ⢠3
Scaling Latent Reasoning via Looped Language Models Paper ⢠2510.25741 ⢠Published Oct 29, 2025 ⢠229
Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach Paper ⢠2502.05171 ⢠Published Feb 7, 2025 ⢠155
Pretraining Language Models to Ponder in Continuous Space Paper ⢠2505.20674 ⢠Published May 27, 2025 ⢠3
HUMAN-WRITTEN & LEGALLY-SOURCED* Datasets written by humans and/or reverse-engineered from text with deterministic algorithms. No illegal scraping or unethical synthesis *...mostly. BramVanroy/CommonCrawl-CreativeCommons Viewer ⢠Updated Aug 28, 2025 ⢠739M ⢠3.18k ⢠34 PleIAs/common_corpus Viewer ⢠Updated 7 days ago ⢠69.9k ⢠65.8k ⢠399 common-pile/comma_v0.1_training_dataset Viewer ⢠Updated Jun 6, 2025 ⢠784M ⢠18.8k ⢠40 crumb/openstax-text Viewer ⢠Updated Jul 14, 2023 ⢠3.35M ⢠2.13k ⢠5
WTF GENIUS PAPERS Papers that made me appreciate my major and my life a little more. obs=Observation, innov=Innovation. Most papers are abt improving tiny models. Continuous Latent Diffusion Language Model Paper ⢠2605.06548 ⢠Published 6 days ago ⢠69 Scaling Latent Reasoning via Looped Language Models Paper ⢠2510.25741 ⢠Published Oct 29, 2025 ⢠229 Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach Paper ⢠2502.05171 ⢠Published Feb 7, 2025 ⢠155 Pretraining Language Models to Ponder in Continuous Space Paper ⢠2505.20674 ⢠Published May 27, 2025 ⢠3
Scaling Latent Reasoning via Looped Language Models Paper ⢠2510.25741 ⢠Published Oct 29, 2025 ⢠229
Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach Paper ⢠2502.05171 ⢠Published Feb 7, 2025 ⢠155
Pretraining Language Models to Ponder in Continuous Space Paper ⢠2505.20674 ⢠Published May 27, 2025 ⢠3
HUMAN-WRITTEN & LEGALLY-SOURCED* Datasets written by humans and/or reverse-engineered from text with deterministic algorithms. No illegal scraping or unethical synthesis *...mostly. BramVanroy/CommonCrawl-CreativeCommons Viewer ⢠Updated Aug 28, 2025 ⢠739M ⢠3.18k ⢠34 PleIAs/common_corpus Viewer ⢠Updated 7 days ago ⢠69.9k ⢠65.8k ⢠399 common-pile/comma_v0.1_training_dataset Viewer ⢠Updated Jun 6, 2025 ⢠784M ⢠18.8k ⢠40 crumb/openstax-text Viewer ⢠Updated Jul 14, 2023 ⢠3.35M ⢠2.13k ⢠5