Instructions to use tensorblock/MoE-Merging-GGUF with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- llama-cpp-python
How to use tensorblock/MoE-Merging-GGUF with llama-cpp-python:
# !pip install llama-cpp-python from llama_cpp import Llama llm = Llama.from_pretrained( repo_id="tensorblock/MoE-Merging-GGUF", filename="MoE-Merging-Q2_K.gguf", )
llm.create_chat_completion( messages = "No input example has been defined for this model task." )
- Notebooks
- Google Colab
- Kaggle
- Local Apps Settings
- llama.cpp
How to use tensorblock/MoE-Merging-GGUF with llama.cpp:
Install from brew
brew install llama.cpp # Start a local OpenAI-compatible server with a web UI: llama-server -hf tensorblock/MoE-Merging-GGUF:Q2_K # Run inference directly in the terminal: llama-cli -hf tensorblock/MoE-Merging-GGUF:Q2_K
Install from WinGet (Windows)
winget install llama.cpp # Start a local OpenAI-compatible server with a web UI: llama-server -hf tensorblock/MoE-Merging-GGUF:Q2_K # Run inference directly in the terminal: llama-cli -hf tensorblock/MoE-Merging-GGUF:Q2_K
Use pre-built binary
# Download pre-built binary from: # https://github.com/ggerganov/llama.cpp/releases # Start a local OpenAI-compatible server with a web UI: ./llama-server -hf tensorblock/MoE-Merging-GGUF:Q2_K # Run inference directly in the terminal: ./llama-cli -hf tensorblock/MoE-Merging-GGUF:Q2_K
Build from source code
git clone https://github.com/ggerganov/llama.cpp.git cd llama.cpp cmake -B build cmake --build build -j --target llama-server llama-cli # Start a local OpenAI-compatible server with a web UI: ./build/bin/llama-server -hf tensorblock/MoE-Merging-GGUF:Q2_K # Run inference directly in the terminal: ./build/bin/llama-cli -hf tensorblock/MoE-Merging-GGUF:Q2_K
Use Docker
docker model run hf.co/tensorblock/MoE-Merging-GGUF:Q2_K
- LM Studio
- Jan
- Ollama
How to use tensorblock/MoE-Merging-GGUF with Ollama:
ollama run hf.co/tensorblock/MoE-Merging-GGUF:Q2_K
- Unsloth Studio
How to use tensorblock/MoE-Merging-GGUF with Unsloth Studio:
Install Unsloth Studio (macOS, Linux, WSL)
curl -fsSL https://unsloth.ai/install.sh | sh # Run unsloth studio unsloth studio -H 0.0.0.0 -p 8888 # Then open http://localhost:8888 in your browser # Search for tensorblock/MoE-Merging-GGUF to start chatting
Install Unsloth Studio (Windows)
irm https://unsloth.ai/install.ps1 | iex # Run unsloth studio unsloth studio -H 0.0.0.0 -p 8888 # Then open http://localhost:8888 in your browser # Search for tensorblock/MoE-Merging-GGUF to start chatting
Using HuggingFace Spaces for Unsloth
# No setup required # Open https://huggingface.co/spaces/unsloth/studio in your browser # Search for tensorblock/MoE-Merging-GGUF to start chatting
- Docker Model Runner
How to use tensorblock/MoE-Merging-GGUF with Docker Model Runner:
docker model run hf.co/tensorblock/MoE-Merging-GGUF:Q2_K
- Lemonade
How to use tensorblock/MoE-Merging-GGUF with Lemonade:
Pull the model
# Download Lemonade from https://lemonade-server.ai/ lemonade pull tensorblock/MoE-Merging-GGUF:Q2_K
Run and chat with the model
lemonade run user.MoE-Merging-GGUF-Q2_K
List all available models
lemonade list
Remove .gguf files (keep Q2_K.gguf)
Browse files- MoE-Merging-Q3_K_L.gguf +0 -3
- MoE-Merging-Q3_K_M.gguf +0 -3
- MoE-Merging-Q3_K_S.gguf +0 -3
- MoE-Merging-Q4_0.gguf +0 -3
- MoE-Merging-Q4_K_M.gguf +0 -3
- MoE-Merging-Q4_K_S.gguf +0 -3
- MoE-Merging-Q5_0.gguf +0 -3
- MoE-Merging-Q5_K_M.gguf +0 -3
- MoE-Merging-Q5_K_S.gguf +0 -3
- MoE-Merging-Q6_K.gguf +0 -3
- MoE-Merging-Q8_0.gguf +0 -3
MoE-Merging-Q3_K_L.gguf
DELETED
|
@@ -1,3 +0,0 @@
|
|
| 1 |
-
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:289b9e7f4443ad17dc31dd996bf255d511a890c1058d1ab60189ef9da8c84790
|
| 3 |
-
size 12544083936
|
|
|
|
|
|
|
|
|
|
|
|
MoE-Merging-Q3_K_M.gguf
DELETED
|
@@ -1,3 +0,0 @@
|
|
| 1 |
-
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:a83b0a93317c50bf61172fca42e7d32d632988f36527be1916684e8a1da4bfbd
|
| 3 |
-
size 11580442592
|
|
|
|
|
|
|
|
|
|
|
|
MoE-Merging-Q3_K_S.gguf
DELETED
|
@@ -1,3 +0,0 @@
|
|
| 1 |
-
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:034bc32e03cbcdc3457d0c84b253bd4d504fac2d54d051fdb4ccef22bef7488b
|
| 3 |
-
size 10433300448
|
|
|
|
|
|
|
|
|
|
|
|
MoE-Merging-Q4_0.gguf
DELETED
|
@@ -1,3 +0,0 @@
|
|
| 1 |
-
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:4c145afdfd9cd665952daf988f19baf629e00712796abfeb6269f2b880c348ea
|
| 3 |
-
size 13623699424
|
|
|
|
|
|
|
|
|
|
|
|
MoE-Merging-Q4_K_M.gguf
DELETED
|
@@ -1,3 +0,0 @@
|
|
| 1 |
-
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:bf7e6db346c27e5caa0f5e94115135a6c8bc23dffcfbcab9e3e755b056b53919
|
| 3 |
-
size 14609885152
|
|
|
|
|
|
|
|
|
|
|
|
MoE-Merging-Q4_K_S.gguf
DELETED
|
@@ -1,3 +0,0 @@
|
|
| 1 |
-
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:4f455ac6911d64606ccde4208af4c83abac02fe9e88c36242809f00e47178971
|
| 3 |
-
size 13743237088
|
|
|
|
|
|
|
|
|
|
|
|
MoE-Merging-Q5_0.gguf
DELETED
|
@@ -1,3 +0,0 @@
|
|
| 1 |
-
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:ee4e2432d1872dbd40b727072c63478e41aed2ce55836fc7fe071ebb828e8923
|
| 3 |
-
size 16626427872
|
|
|
|
|
|
|
|
|
|
|
|
MoE-Merging-Q5_K_M.gguf
DELETED
|
@@ -1,3 +0,0 @@
|
|
| 1 |
-
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:cd3c008ccea36ea8a602c7b0b096c654c004a892f92b5b80df3865fc6b769ab6
|
| 3 |
-
size 17134462944
|
|
|
|
|
|
|
|
|
|
|
|
MoE-Merging-Q5_K_S.gguf
DELETED
|
@@ -1,3 +0,0 @@
|
|
| 1 |
-
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:14232f8f70af9bd1ff5842ab7fd6dcb95762ead4759f599317ad5a4d6bd5d348
|
| 3 |
-
size 16626427872
|
|
|
|
|
|
|
|
|
|
|
|
MoE-Merging-Q6_K.gguf
DELETED
|
@@ -1,3 +0,0 @@
|
|
| 1 |
-
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:9bc92d1dee5260b8027d8a796dfdecf158da1b3d0c7f4fbbce19c6bfa32574d5
|
| 3 |
-
size 19816826848
|
|
|
|
|
|
|
|
|
|
|
|
MoE-Merging-Q8_0.gguf
DELETED
|
@@ -1,3 +0,0 @@
|
|
| 1 |
-
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:5a88acdbe0596a197faeaec0c9f790735c09c70c806f60d98112f04fdcc9fb24
|
| 3 |
-
size 25666357216
|
|
|
|
|
|
|
|
|
|
|
|