A compression technique that reduces a model's precision from full floating-point numbers to 8-bit integers, making it faster and smaller with minimal accuracy loss.