A reward signal derived from visual question answering that uses language-vision reasoning to evaluate image quality.