mxbai-embed-xsmall-v1
Explore mxbai-embed-xsmall-v1, Mixedbread AI's smallest and most efficient English embedding model optimized for retrieval. Discover its competitive performance, long context support and capabilities in resource-constrained applications.
Model Description
Mxbai-embed-xsmall-v1 is Mixedbread AI's smallest and most efficient English embedding model, specifically optimized for retrieval tasks. Despite its compact size with only 22.7 million parameters and 384 dimensions, it delivers competitive performance, making it an ideal choice for applications where computational resources are limited. It is licensed under Apache 2.0.
The model is based on all-MiniLM-L6-v2 and was fine-tuned using the AnglE loss function and Espresso to enhance its capabilities for generating high-quality embeddings, particularly for retrieval scenarios like search, recommendation systems, and Retrieval-Augmented Generation (RAG).
On the Massive Text Embedding Benchmark (MTEB), mxbai-embed-xsmall-v1 shows improved performance over its base model on average (42.80 vs 41.56) across retrieval tasks. It also demonstrates significant gains on long context benchmarks like LoCo (avg. 76.34 vs 67.34) and LongEmb (avg. 45.94 vs 36.10) compared to all-MiniLM-L6-v2
. The small size translates to faster inference, lower resource consumption, and cost-effectiveness, especially beneficial for edge devices or large-scale deployments.
Compare with other models
Model | Context Window | Dimensions | Input Price (/1M tokens) |
---|---|---|---|
mxbai embed xsmall v1 | 4.1K | 384 | $0.00 |
mxbai Embed Large v1 | 512 | 1024 | $0.00 |
deepset mxbai embed german large v1 | 512 | 1024 | $0.00 |
mxbai embed 2d large v1 | 512 | 1024 | $0.00 |
mxbai colbert large v1 | 512 | 1024 | $0.00 |
Calculate Sentence Similarities
The following code illustrates how to compute similarities between sentences using the cosine similarity score function.
Last updated: May 6, 2025