Inference Providers
Active filters: awq
QuantTrio/Qwen3.6-35B-A3B-AWQ
Image-Text-to-Text
• 36B • Updated • 8.05k
• 9
QuantTrio/Qwen3.5-27B-AWQ
Image-Text-to-Text
• 28B • Updated • 380k
• 41
Brooooooklyn/Qwen3.6-35B-A3B-UD-Q4_K_XL-mlx
Text Generation
• 7B • Updated • 188
• 4
QuantTrio/Qwopus3.5-27B-v3-AWQ
Image-Text-to-Text
• 27B • Updated • 22.6k
• 9
QuantTrio/gemma-4-31B-it-AWQ
Image-Text-to-Text
• 31B • Updated • 85.8k
• 6
Qwen/Qwen2.5-14B-Instruct-AWQ
Text Generation
• 15B • Updated • 1.88M
• 33
mratsim/MiniMax-M2.5-FP8-INT4-AWQ
Text Generation
• 39B • Updated • 9.8k
• 21
QuantTrio/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled-v2-AWQ
Image-Text-to-Text
• 28B • Updated • 45.5k
• 12
demon-zombie/MiniMax-M2.7-AWQ-4bit
Text Generation
• 229B • Updated • 4.3k
• 2
hugging-quants/Meta-Llama-3.1-70B-Instruct-AWQ-INT4
Text Generation
• Updated • 149k
• 109
Qwen/Qwen2.5-Coder-32B-Instruct-AWQ
Text Generation
• 33B • Updated • 656k
• 35
casperhansen/llama-3.3-70b-instruct-awq
Text Generation
• 71B • Updated • 277k
• 42
curiousmind147/microsoft-phi-4-AWQ-4bit-GEMM
Text Generation
• 15B • Updated • 401
• 2
kaitchup/QwQ-32B-AWQ-4bit
Text Generation
• 33B • Updated • 223
• 3
Text Generation
• 8B • Updated • 103
• 1
stelterlab/DeepSeek-R1-0528-Qwen3-8B-AWQ
Text Generation
• 8B • Updated • 679
• 6
qdzzzxc/RuadaptQwen3-32B-Instruct-AWQ
33B • Updated • 706
• 3
twhitworth/gpt-oss-120b-awq-w4a16
117B • Updated • 14.5k
• 24
sionic-ai/bge-reasoner-embed-qwen3-8b-0923-AWQ-4bit
Text Ranking
• 8B • Updated • 11
• 6
QuantTrio/Qwen3-VL-30B-A3B-Instruct-AWQ
Text Generation
• 31B • Updated • 758k
• 42
QuantTrio/GLM-4.7-Flash-AWQ
Text Generation
• 31B • Updated • 90.2k
• 12
openbmb/MiniCPM-o-4_5-awq
Any-to-Any
• 9B • Updated • 2.64k
• 19
bullpoint/Qwen3-Coder-Next-AWQ-4bit
Text Generation
• 14B • Updated • 75.2k
• 25
QuantTrio/MiniMax-M2.5-AWQ
Text Generation
• 229B • Updated • 88k
• 15
QuantTrio/Qwen3.5-35B-A3B-AWQ
Image-Text-to-Text
• 36B • Updated • 151k
• 17
Text Generation
• 586B • Updated • 5.37k
• 6
Image-Text-to-Text
• 5B • Updated • 42k
• 8
Brooooooklyn/Qwen3.5-35B-A3B-UD-Q8_K_XL-mlx
Text Generation
• 10B • Updated • 399
• 3
Brooooooklyn/Qwen3.5-9B-UD-Q8_K_XL-mlx
Text Generation
• 3B • Updated • 455
• 1
QuantTrio/gemma-4-31B-it-AWQ-6Bit
Image-Text-to-Text
• 31B • Updated • 11.9k
• 7