A mid-sized multimodal model from Google's Gemma family that handles both text and image inputs. It operates at the 26B parameter scale in a quantized form (Q4_0 IQ4 NL), making it more accessible for local deployment while trading some precision for reduced memory footprint. Expect solid general-purpose reasoning and vision understanding within the constraints of aggressive quantization.