Pushing the Limits of LLM Quantization via the Linearity Theorem

(arxiv.org)

73 points | by felineflock 16 hours ago ago

2 comments