A compact but capable member of Google's Gemma 4 family, this 12B parameter model runs in a quantization-aware trained (QAT) Q4_0 configuration, meaning it was trained to handle the precision loss of 4-bit quantization gracefully rather than having quantization applied as an afterthought. It handles text-in, text-out tasks with the efficiency you'd expect from a model designed to run on consumer hardware.