The gradient of prediction uncertainty with respect to visual embeddings, used to identify ambiguous regions.