A multimodal model that handles both text and image inputs, producing text output. As a quantized FP8 variant of Gemma 4 31B, it trades some numerical precision for reduced memory footprint and faster inference, making the 31B parameter scale more accessible on constrained hardware. It carries the open-weight Apache 2.0 license, so weights can be inspected, modified, and redistributed freely.