A neural retrieval model design that stores multiple token-level embeddings per document and uses late interaction to achieve higher retrieval accuracy than single-vector approaches.
World knowledge accuracy, recall of facts and relationships