AWQ 4-bit produces repetitive gibberish on long outputs with vLLM v0.15.1 β root cause identified
#5 opened 4 months ago
by
BigBlueWhale
Error: Cannot set `add_generation_prompt`
1
#4 opened 5 months ago
by
SlavikF
Fast Start - Docker Compose
ππ 2
1
#3 opened 6 months ago
by
Bellesteck
Loading Mistral Models in vLLM
1
#2 opened 6 months ago
by
BuiDoan
Thanks
3
#1 opened 6 months ago
by
Max-and-Omnis