Models Capabilities Use Cases Benchmarks Papers Glossary

Models Capabilities Use Cases Benchmarks Papers Glossary

About Privacy Terms RSS

ThinkLLM

Spot an error in our data? Let us know.

Glossary/INT8 Quantization

INT8 Quantization

deployment

A compression technique that reduces a model's precision from full floating-point numbers to 8-bit integers, making it faster and smaller with minimal accuracy loss.

INT8 Quantization — Glossary — ThinkLLM