A benchmark methodology using a fixed document collection, queries, and human relevance judgments to evaluate retrieval systems.