The task of identifying where specific objects are located within an image and describing their positions.