Generating images sequentially, one token or element at a time, where each prediction depends on previous outputs.