A compact mixture-of-experts reasoning model from NVIDIA that activates only 3 billion parameters per forward pass despite its 30 billion total parameters, keeping inference costs low while tackling structured reasoning tasks. It runs in FP8 precision, trading a small amount of numerical fidelity for significantly faster throughput on compatible hardware.