A multimodal model that handles both text and image inputs, producing text output. As an FP8 Static quantized variant of Gemma 4 26B, it trades some numerical precision for reduced memory footprint and faster inference, which can occasionally affect output quality on nuanced tasks. Published by cloud19 as an open-weight release under Apache 2.0.