By rotating activations into a normalized basis before quantization, OrbitQuant eliminates the need to recalibrate for different inputs, timesteps, or models—enabling practical low-bit quantization of diffusion transformers without per-checkpoint tuning.
OrbitQuant is a quantization method for diffusion transformers that works without needing to recalibrate for different inputs or models. It uses a mathematical rotation technique to normalize activations so they stay consistent across different timesteps and prompts, allowing a single quantization scheme to work everywhere.