A question-answering evaluation framework that tests whether models can retrieve factual information about specific entities.