cyankiwi
/

Devstral-Small-2-24B-Instruct-2512-AWQ-4bit

compressed-tensors

Model card Files Files and versions

Resources

View closed (0)

AWQ 4-bit produces repetitive gibberish on long outputs with vLLM v0.15.1 — root cause identified

#5 opened 4 months ago by

Error: Cannot set `add_generation_prompt`

#4 opened 5 months ago by

Fast Start - Docker Compose

#3 opened 6 months ago by

Loading Mistral Models in vLLM

#2 opened 6 months ago by

Thanks

#1 opened 6 months ago by