Hi! Thanks for uploading this quant. In the readme for ERNIE 4.5 it says the model is trained with 4/2 bit awareness. Even stating it would lead to lossless quantization. Could you therefore also make a 2 bit quant of this model? Thanks in advance!
Β· Sign up or log in to comment