A compact but capable text model that punches above its weight class through NVIDIA's FP4 quantization — trading a small amount of precision for significant gains in memory efficiency and inference speed. It handles general reasoning and instruction-following solidly, though its text-only input means it won't help with images or documents. The quantized format makes it particularly practical for deployment on consumer or edge hardware.