A focused document reader that takes in images and extracts text from them. It handles a 32K context window, suggesting it can process lengthy documents or multiple pages in a single pass. As an open-weight model released under MIT license, it's transparent and freely adaptable, though its capabilities are narrowly scoped around optical character recognition rather than general-purpose reasoning.