Models Capabilities Use Cases Benchmarks Papers Glossary

Models Capabilities Use Cases Benchmarks Papers Glossary

About Privacy Terms RSS

ThinkLLM

Spot an error in our data? Let us know.

Glossary/FP8 Static Quantization

FP8 Static Quantization

deployment

A specific quantization method that converts model weights to 8-bit floating-point numbers using fixed scaling factors, reducing model size while potentially affecting accuracy on complex tasks.

FP8 Static Quantization — Glossary — ThinkLLM