Inference Providers
Active filters: RL
Teen-Different/Tabular_RL_For_Multi_Env
Reinforcement Learning
• Updated NousResearch/DeepHermes-Egregore-v1-RLAIF-8b-Atropos
Reinforcement Learning
• 8B • Updated • 46
• 4
NousResearch/DeepHermes-Egregore-v2-RLAIF-8b-Atropos
Reinforcement Learning
• 8B • Updated • 54
• 7
NousResearch/DeepHermes-AscensionMaze-RLAIF-8b-Atropos
Reinforcement Learning
• 8B • Updated • 49
• 9
prithivMLmods/Mensa-Beta-14B-Instruct
Text Generation
• 15B • Updated • 6
mradermacher/Mensa-Beta-14B-Instruct-GGUF
15B • Updated • 63
mradermacher/Mensa-Beta-14B-Instruct-i1-GGUF
15B • Updated • 301
prithivMLmods/Venatici-Coder-14B-Y.2
Text Generation
• 15B • Updated • 2
mradermacher/Venatici-Coder-14B-Y.2-GGUF
15B • Updated • 134
NousResearch/DeepHermes-ToolCalling-Specialist-Atropos
Reinforcement Learning
• 8B • Updated • 70
• 18
mradermacher/Venatici-Coder-14B-Y.2-i1-GGUF
15B • Updated • 302
prithivMLmods/Camelopardalis-650-14B-Instruct
Text Generation
• 15B • Updated • 2
mradermacher/Camelopardalis-650-14B-Instruct-GGUF
15B • Updated • 97
mradermacher/Camelopardalis-650-14B-Instruct-i1-GGUF
15B • Updated • 99
prithivMLmods/Fomalhaut-QwenR-1.5B
Text Generation
• 2B • Updated • 2
prithivMLmods/Horologium-QwenC-1.5B
Text Generation
• 2B • Updated • 5
prithivMLmods/Pictor-1338-QwenP-1.5B
Text Generation
• 2B • Updated • 3
prithivMLmods/Monoceros-QwenM-1.5B
Text Generation
• 2B • Updated • 1
prithivMLmods/Pisces-QwenR1-1.5B
Text Generation
• 2B • Updated • 6
prithivMLmods/Octantis-QwenR1-1.5B
Text Generation
• 2B • Updated • 4
adriey/Pictor-1338-QwenP-1.5B-Q8_0-GGUF
Text Generation
• 2B • Updated mradermacher/Pisces-QwenR1-1.5B-GGUF
2B • Updated • 59
mradermacher/Horologium-QwenC-1.5B-GGUF
2B • Updated • 146
mradermacher/Pictor-1338-QwenP-1.5B-GGUF
2B • Updated • 60
mradermacher/Octantis-QwenR1-1.5B-GGUF
2B • Updated • 88
mradermacher/Monoceros-QwenM-1.5B-GGUF
2B • Updated • 56
mradermacher/Horologium-QwenC-1.5B-i1-GGUF
2B • Updated • 296
mradermacher/Fomalhaut-QwenR-1.5B-GGUF
2B • Updated • 97
mradermacher/Pictor-1338-QwenP-1.5B-i1-GGUF
2B • Updated • 96
mradermacher/Monoceros-QwenM-1.5B-i1-GGUF
2B • Updated • 71