city96
/

umt5-xxl-encoder-gguf

Model card Files Files and versions

umt5-xxl-encoder-gguf / README.md

city96's picture

Update README.md

b535255 verified about 1 year ago

|

history blame contribute delete

775 Bytes

	---
	base_model: google/umt5-xxl
	library_name: gguf
	license: apache-2.0
	quantized_by: city96
	language: en
	---

	This is a GGUF conversion of [Google's UMT5 xxl model](https://huggingface.co/google/umt5-xxl), specifically the encoder part.

	The weights can be used with [`./llama-embedding`](https://github.com/ggerganov/llama.cpp/tree/master/examples/embedding) or with the [ComfyUI-GGUF](https://github.com/city96/ComfyUI-GGUF) custom node together with image/video generation models.

	This is a non imatrix quant as llama.cpp doesn't support imatrix creation for T5 models at the time of writing. It's therefore recommended to use Q5_K_M or larger for the best results, although smaller models may also still provide decent results in resource constrained scenarios.