A compact multimodal model that handles both text and image inputs, producing text output. It carries the efficiency-focused DNA of Google's Gemma family, with open weights under Apache 2.0 making it freely inspectable and deployable. The IQ4_0 quantization variant trades some precision for reduced memory footprint, which can affect output quality compared to the full-precision version.