Edit Models filters

Model Tree

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

efficient-inference

Inference Endpoints

text-generation-inference

Eval Results (legacy)

text-embeddings-inference

4-bit precision

8-bit precision

Mixture of Experts

Carbon Emissions

Models

148

Base only

Active filters: efficient-inference

Jackrong/Qwen3.5-9B-DeepSeek-V4-Flash-GGUF

Image-Text-to-Text • 9B • Updated 6 days ago • 151k • 261

Jackrong/Qwen3.5-9B-DeepSeek-V4-Flash

Image-Text-to-Text • 10B • Updated May 2 • 3.09k • 39

owensong/Inflect-Nano-v1

Text-to-Speech • Updated 16 days ago • 215

nota-ai/Qwen3.5-4B-QAD-W4A16

Text Generation • 5B • Updated 4 days ago • 10 • 1

vhab10/llama_3.1_8b_Q4_K_M-gguf

Text Generation • 8B • Updated Oct 6, 2024 • 41

saytes/SoT_DistilBERT

Text Classification • 67M • Updated Mar 11, 2025 • 510 • 7

stiger1000/TC-MoE

Text Generation • 2B • Updated Jul 25, 2025 • 48 • 1

agentlans/Qwen3-4B-multilingual-sft-GGUF

Text Generation • 4B • Updated Jun 29, 2025 • 27

sudeshmu/fine_tune

Text Generation • Updated Aug 28, 2025 • 26 • 9

weathermanj/Nemotron-nano-9b-fp8

Text Generation • 9B • Updated Aug 29, 2025 • 14 • 6

jackal79/gpt2-ibce-lowrank-192

Text Generation • Updated Sep 19, 2025 • 10

huawei-csl/Qwen3-1.7B-3bit-SINQ

Text Generation • 0.5B • Updated Feb 2 • 7 • 7

huawei-csl/Qwen3-1.7B-3bit-ASINQ

Text Generation • 0.5B • Updated Feb 2 • 6 • 7

huawei-csl/Qwen3-14B-3bit-SINQ

Text Generation • 3B • Updated Feb 2 • 16 • 5

huawei-csl/Qwen3-14B-3bit-ASINQ

Text Generation • 3B • Updated Feb 2 • 6 • 5

huawei-csl/Qwen3-32B-3bit-SINQ

Text Generation • 6B • Updated Feb 2 • 6 • 6

huawei-csl/Qwen3-32B-3bit-ASINQ

Text Generation • 6B • Updated Feb 2 • 4 • 5

huawei-csl/Qwen3-1.7B-4bit-SINQ

Text Generation • 1B • Updated Feb 2 • 6 • 5

huawei-csl/Qwen3-1.7B-4bit-ASINQ

Text Generation • 1B • Updated Feb 2 • 7 • 5

huawei-csl/Qwen3-32B-4bit-SINQ

Text Generation • 18B • Updated Feb 2 • 7 • 7

huawei-csl/Qwen3-14B-4bit-SINQ

Text Generation • 9B • Updated Feb 2 • 6 • 5

huawei-csl/Qwen3-14B-4bit-ASINQ

Text Generation • 9B • Updated Feb 2 • 10 • 6

huawei-csl/Qwen3-32B-4bit-ASINQ

Text Generation • 18B • Updated Feb 2 • 5 • 8

huawei-csl/Qwen3-235B-A22B-3bit-SINQ

Text Generation • Updated Feb 2 • 6 • 2

huawei-csl/Apertus-8B-2509-4bit-SINQ

Text Generation • 5B • Updated Feb 2 • 1 • 2

huawei-csl/Apertus-8B-2509-4bit-ASINQ

Text Generation • 5B • Updated Feb 2 • 35 • 3

huawei-csl/Kimi-Linear-48B-A3B-Instruct-4bit-SINQ

Text Generation • 27B • Updated Feb 2 • 4 • 3

huawei-csl/Qwen3-Next-80B-A3B-Instruct-4bit-SINQ

Text Generation • Updated Feb 2 • 17 • 2

huawei-csl/Kimi-Linear-48B-A3B-Instruct-3bit-SINQ

Text Generation • 7B • Updated Feb 2 • 12 • 1

huawei-csl/Qwen3-Next-80B-A3B-Instruct-3bit-SINQ

Text Generation • Updated Feb 2 • 84 • 2