Mixedbread

Our Embedding Models

Here's an overview of our current model lineup:

Model CardContext LengthDimensionMTEB Average
512102464.68
5121024 (base)63.25 (base)
5121024-
409638442.80

Why Choose Mixedbread Embeddings

The Mixedbread embed family offers several advantages:

  1. Powerful Performance: State-of-the-art results on benchmarks
  2. Size Efficiency: Optimized for resource utilization
  3. Open-Source: Fully accessible and customizable
  4. Versatility: Suitable for various NLP tasks

Performance Comparison

Our new model outperforms other similarly sized open models and even surpasses some closed-source models on the MTEB benchmark:

ModelAvg (56 datasets)
mxbai-embed-large-v164.68
bge-large-en-v1.564.23
jina-embeddings-v2-base-en60.38
OpenAI text-embedding-3-large (Proprietary)64.58
Cohere embed-english-v3.0 (Proprietary)64.47

API Benefits

While you can use our open-source models directly, our API offers additional advantages:

  1. Enhanced Performance: API-exclusive versions offer improvements like better int8-quantization and take advantage of our optimized inference pipeline.
  2. Calibration Data: Generated using over 50 million samples for more accurate float32 to int8 mapping
  3. Faster Response Times: Optimized for low-latency retrieval tasks

Last updated: July 14, 2025