Edit Models filters

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

4-bit precision

8-bit precision

text-embeddings-inference

Mixture of Experts

Carbon Emissions

Models

9,310

Full-text search

Active filters: dpo, trl

Shifusen/Qwen3-Next-80B-A3B-Instruct-Decensored

Text Generation • 80B • Updated 7 days ago • 35 • 2

HumanLLMs/Human-Like-Qwen2.5-7B-Instruct

Text Generation • 8B • Updated Jan 13, 2025 • 78 • 13

ConicCat/Role-mo-V2-7B

Text Generation • 7B • Updated about 23 hours ago • 28 • 1

mradermacher/Role-mo-V2-7B-GGUF

7B • Updated 4 days ago • 368 • 1

mradermacher/Role-mo-V2-7B-i1-GGUF

7B • Updated 4 days ago • 2.16k • 1

mradermacher/Qwen3-Next-80B-A3B-Instruct-Decensored-GGUF

80B • Updated 4 days ago • 2.09k • 1

wololoo/Llama-3.2-3B-TR-Instruct-DPO

Text Generation • 3B • Updated 1 day ago • 203 • 1

jazztoasty101/qwen3-md-natural

Text Generation • 2B • Updated about 19 hours ago • 21 • 1

lewtun/zephyr-7b-dpo-full

Text Generation • 7B • Updated Jan 5, 2024 • 16

alignment-handbook/zephyr-7b-dpo-full

Text Generation • 7B • Updated Jan 10, 2024 • 29 • 3

alignment-handbook/zephyr-7b-dpo-qlora

Updated Jan 9, 2024 • 15 • 9

amirali1985/gpt-neo-125m_hh_reward

Text Generation • 0.1B • Updated Apr 27, 2024 • 5

lewtun/zephyr-7b-dpo-qlora

Updated Jan 9, 2024 • 5

sambar/zephyr-7b-ipo-lora

Text Generation • Updated Jan 5, 2024 • 2

nlee282/moai-dpo-1.0

Updated Jan 5, 2024 • 4

nikkoyabut/merged_model_dpo

Updated Jan 5, 2024 • 2

sambar/zephyr-7b-ipo-lora-5ep

Text Generation • Updated Jan 6, 2024 • 5

alexredna/TinyLlama-1.1B-Chat-v1.0-reasoning-v2-dpo

Text Generation • 1B • Updated Jan 7, 2024 • 12 • 2

AlbelTec/mistral-dpo-old

Updated Jan 7, 2024

Yaxin1992/mixtral-dpo-1000

Updated Jan 9, 2024 • 1

adhi29/openhermes-mistral-dpo-gptq

Updated Jan 10, 2024

ybelkada/test-tags-model

Text Generation • 1.03M • Updated Jan 9, 2024 • 6

ybelkada/test-tags-model-2

Text Generation • 1.03M • Updated Jan 9, 2024 • 5

justinj92/dpoplatypus-phi2

Text Generation • 3B • Updated Jan 10, 2024

Belred/mistral-dpo

Updated Jan 9, 2024

lewtun/zephyr-7b-dpo-qlora-8e0975a

Updated Jan 10, 2024

mecoaoge2/results

Updated Jan 10, 2024 • 2

mecoaoge2/fununun

Updated Jan 10, 2024 • 2

akashkumarbtc/openhermes-mistral-dpo-gptq

Updated Jan 10, 2024

darshan8950/openhermes-mistral-dpo-gptq

Updated Jan 10, 2024