-
FineWeb2: One Pipeline to Scale Them All -- Adapting Pre-Training Data Processing to Every Language
Paper • 2506.20920 • Published • 75 -
SmolVLM: Redefining small and efficient multimodal models
Paper • 2504.05299 • Published • 200 -
YourBench: Easy Custom Evaluation Sets for Everyone
Paper • 2504.01833 • Published • 22 -
SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model
Paper • 2502.02737 • Published • 249
Collections
Discover the best community collections!
Collections including paper arxiv:2211.15533
-
Magicoder: Source Code Is All You Need
Paper • 2312.02120 • Published • 82 -
InstructCoder: Empowering Language Models for Code Editing
Paper • 2310.20329 • Published • 2 -
LLM-Assisted Code Cleaning For Training Accurate Code Generators
Paper • 2311.14904 • Published • 5 -
Nova^+: Generative Language Models for Binaries
Paper • 2311.13721 • Published • 3
-
dphn/dolphin-2.5-mixtral-8x7b
Text Generation • 47B • Updated • 1.68k • 1.24k -
The Stack: 3 TB of permissively licensed source code
Paper • 2211.15533 • Published • 6 -
OpenCodeInterpreter: Integrating Code Generation with Execution and Refinement
Paper • 2402.14658 • Published • 83 -
Design2Code: How Far Are We From Automating Front-End Engineering?
Paper • 2403.03163 • Published • 98
-
WizardLMTeam/WizardCoder-Python-34B-V1.0
Text Generation • Updated • 164 • 771 -
TheBloke/WizardLM-70B-V1.0-GPTQ
Text Generation • 9B • Updated • 298 • 37 -
WizardLMTeam/WizardLM_evol_instruct_70k
Viewer • Updated • 70k • 674 • 195 -
WizardLMTeam/WizardLM_evol_instruct_V2_196k
Viewer • Updated • 143k • 1.46k • 241
-
FineWeb2: One Pipeline to Scale Them All -- Adapting Pre-Training Data Processing to Every Language
Paper • 2506.20920 • Published • 75 -
SmolVLM: Redefining small and efficient multimodal models
Paper • 2504.05299 • Published • 200 -
YourBench: Easy Custom Evaluation Sets for Everyone
Paper • 2504.01833 • Published • 22 -
SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model
Paper • 2502.02737 • Published • 249
-
dphn/dolphin-2.5-mixtral-8x7b
Text Generation • 47B • Updated • 1.68k • 1.24k -
The Stack: 3 TB of permissively licensed source code
Paper • 2211.15533 • Published • 6 -
OpenCodeInterpreter: Integrating Code Generation with Execution and Refinement
Paper • 2402.14658 • Published • 83 -
Design2Code: How Far Are We From Automating Front-End Engineering?
Paper • 2403.03163 • Published • 98
-
Magicoder: Source Code Is All You Need
Paper • 2312.02120 • Published • 82 -
InstructCoder: Empowering Language Models for Code Editing
Paper • 2310.20329 • Published • 2 -
LLM-Assisted Code Cleaning For Training Accurate Code Generators
Paper • 2311.14904 • Published • 5 -
Nova^+: Generative Language Models for Binaries
Paper • 2311.13721 • Published • 3
-
WizardLMTeam/WizardCoder-Python-34B-V1.0
Text Generation • Updated • 164 • 771 -
TheBloke/WizardLM-70B-V1.0-GPTQ
Text Generation • 9B • Updated • 298 • 37 -
WizardLMTeam/WizardLM_evol_instruct_70k
Viewer • Updated • 70k • 674 • 195 -
WizardLMTeam/WizardLM_evol_instruct_V2_196k
Viewer • Updated • 143k • 1.46k • 241