PolarQuant Gemma Models Collection Google Gemma models quantized with PolarQuant (Hadamard + Lloyd-Max Q5 weights + Q3 KV cache). Full-stack compression for consumer GPU inference. • 4 items • Updated 3 days ago • 3
Qwen-3.5-unsloth-mlx Collection AWQ-style pre-scaling using Unsloth's imatrix calibration data, then 3-6-bit affine quantization with the Unsloth mixed-precision recipe via MLX • 20 items • Updated 8 days ago • 20