Zen Embedding 0.6B GGUF

High-performance text embedding model from the Zen model family optimized for efficient inference.

Downloads

Source	URL
HuggingFace	`hf download zenlm/zen-embedding-0.6B-GGUF`
Direct	https://download.hanzo.ai/llm-models/zen-embedding-0.6B-Q8_0.gguf

Works with llama.cpp and compatible inference engines.

Apache 2.0

GGUF

Model size

0.6B params

Architecture

qwen3

Hardware compatibility

8-bit

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Unable to build the model tree, the base model loops to the model itself. Learn more.