Mixedbread

mxbai-embed-xsmall-v1

Parameters 22.7M
Context Window 4.1K
Price / 1M tokens -
Languages EN

Model Description

Mxbai-embed-xsmall-v1 is Mixedbread AI's smallest and most efficient English embedding model, specifically optimized for retrieval tasks. Despite its compact size with only 22.7 million parameters and 384 dimensions, it delivers competitive performance, making it an ideal choice for applications where computational resources are limited. It is licensed under Apache 2.0.

The model is based on all-MiniLM-L6-v2 and was fine-tuned using the AnglE loss function and Espresso to enhance its capabilities for generating high-quality embeddings, particularly for retrieval scenarios like search, recommendation systems, and [Retrieval-Augmented Generation (RAG).

On the Massive Text Embedding Benchmark (MTEB), mxbai-embed-xsmall-v1 shows improved performance over its base model on average (42.80 vs 41.56) across retrieval tasks. It also demonstrates significant gains on long context benchmarks like LoCo (avg. 76.34 vs 67.34) and LongEmb (avg. 45.94 vs 36.10) compared to all-MiniLM-L6-v2. The small size translates to faster inference, lower resource consumption, and cost-effectiveness, especially beneficial for edge devices or large-scale deployments.

Compare with other models

ModelContext WindowDimensionsPrice / 1M tokens
mxbai-embed-xsmall-v14.1K 384-
mxbai-embed-large-v1512 1024$0.10
deepset-mxbai-embed-de-large-v1512 1024$0.10
mxbai-embed-2d-large-v1512 1024$0.10
mxbai-colbert-large-v1512 1024-

Last updated: July 14, 2025