A reward signal that measures quality by having an LLM recover the original task specification from generated outputs.