-
-
-
-
-
-
Inference Providers
Active filters:
redhat
Text Generation
•
15B
•
Updated
•
107
•
1
RedHatAI/Llama-4-Maverick-17B-128E-Instruct-FP8
Image-Text-to-Text
•
402B
•
Updated
•
259
•
2
RedHatTraining/AI296-m3diterraneo-hotels
8B
•
Updated
•
22
•
1
RedHatAI/DeepSeek-R1-0528-quantized.w4a16
Text Generation
•
104B
•
Updated
•
333
•
12
RedHatAI/Llama-4-Maverick-17B-128E-Instruct-quantized.w4a16
Image-Text-to-Text
•
59B
•
Updated
•
198
•
1
Image-Text-to-Text
•
109B
•
Updated
•
4
RedHatAI/Kimi-K2-Instruct-quantized.w4a16
Text Generation
•
146B
•
Updated
•
259
•
12
nm-testing/Llama-3.1-8B-Instruct-speculator.eagle3-converted
Text Generation
•
1.0B
•
Updated
•
6
RedHatAI/SmolLM3-3B-quantized.w4a16
0.9B
•
Updated
•
333
•
1
Text-to-Image
•
Updated
•
4
RedHatAI/Devstral-Small-2507-FP8-Dynamic
Text Generation
•
24B
•
Updated
•
368
•
4
RedHatAI/Devstral-Small-2507-quantized.w8a8
Text Generation
•
24B
•
Updated
•
29
•
1
RedHatAI/Devstral-Small-2507-quantized.w4a16
Text Generation
•
4B
•
Updated
•
31
•
1
RedHatAI/Qwen3-14B-speculator.eagle3
Text Generation
•
1B
•
Updated
•
113
RedHatAI/Qwen3-32B-speculator.eagle3
Text Generation
•
2B
•
Updated
•
937
•
4
RedHatAI/Llama-3.3-70B-Instruct-speculator.eagle3
Text Generation
•
2B
•
Updated
•
1.19k
•
1
RedHatAI/Llama-3.1-8B-Instruct-speculator.eagle3
Text Generation
•
1.0B
•
Updated
•
6.68k
•
1
RedHatAI/Qwen3-8B-speculator.eagle3
Text Generation
•
1B
•
Updated
•
44.1k
RedHatAI/NVIDIA-Nemotron-Nano-9B-v2-quantized.w4a16
Text Generation
•
2B
•
Updated
•
224
•
3
RedHatAI/Qwen3-235B-A22B-Instruct-2507-speculator.eagle3
Text Generation
•
1B
•
Updated
•
332
ChibuUkachi/Qwen3-4B-Instruct-2507.w4a16
Text Generation
•
1B
•
Updated
•
770
inference-optimization/Qwen3-4B-Thinking-2507.w4a16
Text Generation
•
1B
•
Updated
•
498
inference-optimization/Qwen3-4B-Instruct-2507.w4a16
Text Generation
•
1B
•
Updated
•
98
inference-optimization/Qwen3-30B-A3B-Thinking-2507.w4a16
Text Generation
•
5B
•
Updated
•
35
inference-optimization/Qwen3-30B-A3B-Instruct-2507.w4a16
Text Generation
•
5B
•
Updated
•
56
RedHatAI/Qwen3-30B-A3B-Instruct-2507-speculator.eagle3
Text Generation
•
0.5B
•
Updated
•
71
•
1