roshniramesh
's Collections
int4 llm
updated
Text Generation
•
Updated
•
26
•
1
nvidia/Gemma-2b-it-ONNX-INT4
nvidia/Meta-Llama-3.1-8B-Instruct-ONNX-INT4
Updated
•
26
•
6
nvidia/Meta-Llama-3.2-3B-Instruct-ONNX-INT4
nvidia/Phi-3.5-mini-Instruct-ONNX-INT4
nvidia/Mistral-Nemo-12B-Instruct-ONNX-INT4
nvidia/Nemotron-Mini-4B-Instruct-ONNX-INT4
meta-llama/Llama-3.2-1B-Instruct-SpinQuant_INT4_EO8
Text Generation
•
Updated
•
92
•
38
hugging-quants/gemma-2-9b-it-AWQ-INT4
Text Generation
•
9B
•
Updated
•
2.02k
•
7
Qwen/Qwen2-7B-Instruct-GPTQ-Int4
Text Generation
•
8B
•
Updated
•
690
•
29
hugging-quants/Meta-Llama-3.1-8B-Instruct-AWQ-INT4
Text Generation
•
8B
•
Updated
•
395k
•
85
RedHatAI/Meta-Llama-3.1-8B-Instruct-quantized.w4a16
Text Generation
•
8B
•
Updated
•
26.9k
•
30
ModelCloud/Meta-Llama-3.1-8B-gptq-4bit
Text Generation
•
8B
•
Updated
•
106
hugging-quants/Llama-3.2-3B-Instruct-Q4_K_M-GGUF
Text Generation
•
3B
•
Updated
•
18.1k
•
26
hugging-quants/Meta-Llama-3.1-70B-Instruct-AWQ-INT4
Text Generation
•
71B
•
Updated
•
85.4k
•
107
hugging-quants/Llama-3.2-1B-Instruct-Q4_K_M-GGUF
Text Generation
•
1B
•
Updated
•
30.8k
•
19
hugging-quants/Meta-Llama-3.1-70B-Instruct-GPTQ-INT4
Text Generation
•
71B
•
Updated
•
700
•
23
hugging-quants/Meta-Llama-3.1-8B-Instruct-GPTQ-INT4
Text Generation
•
8B
•
Updated
•
11.3k
•
40
meta-llama/Llama-Guard-3-1B-INT4
Text Generation
•
Updated
•
8
•
27
meta-llama/Llama-3.2-3B-Instruct-QLORA_INT4_EO8
Text Generation
•
Updated
•
95
•
71
meta-llama/Llama-3.2-3B-Instruct-SpinQuant_INT4_EO8
Text Generation
•
Updated
•
94
•
37
meta-llama/Llama-3.2-1B-Instruct-QLORA_INT4_EO8
Text Generation
•
Updated
•
113
•
47
RedHatAI/Mistral-7B-Instruct-v0.3-GPTQ-4bit
Text Generation
•
7B
•
Updated
•
52.7k
•
23
RedHatAI/Mistral-7B-Instruct-v0.3-quantized.w4a16
Text Generation
•
7B
•
Updated
•
71
•
2
RedHatAI/Llama-2-7b-chat-quantized.w4a16
Text Generation
•
7B
•
Updated
•
25
RedHatAI/Meta-Llama-3-8B-Instruct-quantized.w4a16
Text Generation
•
8B
•
Updated
•
66
•
2
RedHatAI/Meta-Llama-3-70B-Instruct-quantized.w4a16
Text Generation
•
71B
•
Updated
•
261
•
2
RedHatAI/gemma-2-2b-it-quantized.w4a16
Text Generation
•
1B
•
Updated
•
56
•
1
RedHatAI/gemma-2-9b-it-quantized.w4a16
Text Generation
•
3B
•
Updated
•
86
•
2
RedHatAI/Mistral-Nemo-Instruct-2407-quantized.w4a16
Text Generation
•
3B
•
Updated
•
1.28k
•
4
RedHatAI/Meta-Llama-3.1-70B-Instruct-quantized.w4a16
Text Generation
•
71B
•
Updated
•
1.37k
•
32
nvidia/Mistral-7B-Instruct-v0.3-ONNX-INT4
OpenVINO/mistral-7b-instruct-v0.1-int4-ov
Text Generation
•
Updated
•
5
OpenVINO/Mistral-7B-Instruct-v0.2-int4-ov
Text Generation
•
Updated
•
522
•
1
Text Generation
•
72B
•
Updated
•
195
•
47
Text Generation
•
14B
•
Updated
•
159
•
100
Text Generation
•
8B
•
Updated
•
702
•
75
Text Generation
•
2B
•
Updated
•
255
•
36
Qwen/Qwen1.5-110B-Chat-GPTQ-Int4
Text Generation
•
111B
•
Updated
•
64.1k
•
18
Qwen/Qwen1.5-1.8B-Chat-GPTQ-Int4
Text Generation
•
2B
•
Updated
•
147
•
7
Qwen/Qwen1.5-MoE-A2.7B-Chat-GPTQ-Int4
Text Generation
•
14B
•
Updated
•
470
•
50
Qwen/Qwen1.5-4B-Chat-GPTQ-Int4
Text Generation
•
4B
•
Updated
•
113
•
6
Qwen/Qwen1.5-72B-Chat-GPTQ-Int4
Text Generation
•
72B
•
Updated
•
2.22k
•
37
Qwen/Qwen1.5-4B-Chat-GGUF
Text Generation
•
4B
•
Updated
•
706
•
16
Qwen/Qwen1.5-0.5B-Chat-GGUF
Text Generation
•
0.6B
•
Updated
•
4.76k
•
35
Qwen/Qwen1.5-7B-Chat-GGUF
Text Generation
•
8B
•
Updated
•
2.7k
•
70
Qwen/CodeQwen1.5-7B-Chat-GGUF
Text Generation
•
7B
•
Updated
•
757
•
109
Qwen/Qwen2.5-1.5B-Instruct-GPTQ-Int4
Text Generation
•
2B
•
Updated
•
803
•
3
Qwen/Qwen2.5-0.5B-Instruct-GPTQ-Int4
Text Generation
•
0.5B
•
Updated
•
412
•
9
Qwen/Qwen2.5-0.5B-Instruct-GGUF
Text Generation
•
0.6B
•
Updated
•
38.6k
•
71
Qwen/Qwen2-1.5B-Instruct-GGUF
Text Generation
•
2B
•
Updated
•
6.34k
•
27
Qwen/Qwen2-0.5B-Instruct-GGUF
Text Generation
•
0.5B
•
Updated
•
15.8k
•
71
Qwen/Qwen2-7B-Instruct-GGUF
Text Generation
•
8B
•
Updated
•
5.53k
•
177
Qwen/Qwen2-0.5B-Instruct-GPTQ-Int4
Text Generation
•
0.6B
•
Updated
•
89
•
15
Qwen/Qwen2-1.5B-Instruct-GPTQ-Int4
Text Generation
•
2B
•
Updated
•
15.5k
•
5
Qwen/Qwen2-72B-Instruct-GPTQ-Int4
Text Generation
•
73B
•
Updated
•
59
•
33
Qwen/Qwen2-57B-A14B-Instruct-GPTQ-Int4
Text Generation
•
57B
•
Updated
•
198
•
23