Models Capabilities Use Cases Benchmarks Papers Glossary

Models Capabilities Use Cases Benchmarks Papers Glossary

About Privacy Terms RSS

ThinkLLM

Spot an error in our data? Let us know.

Glossary/Int4/Int8 Mixed Quantization

Int4/Int8 Mixed Quantization

deployment

A quantization strategy that uses 4-bit precision for some weights and 8-bit precision for others, balancing memory savings with accuracy.

Int4/Int8 Mixed Quantization — Glossary — ThinkLLM