Mixedbread

mxbai-colbert-large-v1

Parameters 335M
Context Window 512
Price / 1M tokens -
Languages EN

Model Description

is a state-of-the-art ColBERT (Contextualized Late Interaction BERT) model for reranking and retrieval tasks. It is based on the model and achieves state-of-the-art performance on 13 publicly available BEIR benchmarks.

ColBERT combines the benefits of vector search and cross-encoders. Queries and documents are encoded separately, but instead of creating a single embedding for the entire document, ColBERT generates contextualized embeddings for each token in the document. During search, the token-level query embeddings are compared with the token-level embeddings of the documents using the lightweight scoring function MaxSim. This allows ColBERT to capture nuanced matching signals while being computationally efficient.

mxbai-colbert-large-v1 is initialized from the model, which was trained on over 700 million samples from various domains. The ColBERT model was then fine-tuned on around 96 million samples to adapt it to the late interaction mechanism. This extensive training enables the model to be used for a wide range of tasks and domains.

On the BEIR benchmark, mxbai-colbert-large-v1 outperforms other ColBERT models on average and directly in most tasks. Its exceptionally high reranking score even surpasses typical scores for cross-encoder based reranker models on the benchmark, despite the advantages of the ColBERT architecture regarding resource efficiency. The model also demonstrates state-of-the-art retrieval performance when compared to other currently available ColBERT models.

Compare with other models

ModelContext WindowDimensionsPrice / 1M tokens
mxbai-colbert-large-v1512 1024-
mxbai-embed-large-v1512 1024$0.10
deepset-mxbai-embed-de-large-v1512 1024$0.10
mxbai-embed-2d-large-v1512 1024$0.10
mxbai-embed-xsmall-v14.1K 384-

Last updated: July 15, 2025

On this page