Haystack
Enhance your Haystack NLP pipelines with Mixedbread's comprehensive AI platform. Build production-ready search and retrieval workflows using enterprise-grade semantic understanding, multimodal document processing, and state-of-the-art embedding models designed for Haystack's modular architecture.
Quick Start
- Install the package:
- Save your API key to an environment variable:
Components
This integration provides five production-ready Haystack components for enterprise NLP workflows:
TextEmbedder
: Generate semantic embeddings for queries using state-of-the-art models, optimized for Haystack's pipeline architectureDocumentEmbedder
: Transform document collections into dense vector representations with batch processing capabilities for large-scale workflowsReranker
: Improve retrieval precision by semantically reordering candidate documents within your Haystack pipelinesDocumentParser
: Extract structured content from multimodal documents (PDF, PPTX, HTML) with layout-aware intelligence, outputting Haystack Document objectsVectorStoreRetriever
: Connect to Mixedbread's managed vector databases as a retrieval component in your Haystack search pipelines
Text Embedder
Generate embeddings for text queries:
Document Embedder
Generate embeddings for documents:
Document Parser
Parse documents from various formats:
Reranker
Rerank documents based on query relevance:
Vector Store Retriever
Retrieve documents from Mixedbread vector stores:
Pipeline Examples
RAG Pipeline with vector store retriever
Need Help?
- Join our Discord community
- Contact support
Happy baking with Mixedbread and Haystack!
Langchain
Transform your LangChain applications with Mixedbread's AI-native search and semantic intelligence. This guide covers seamless integration with LangChain's retrieval ecosystem, chain composition with LCEL, and advanced RAG workflows using state-of-the-art embedding and reranking models.
SDKs
Access Mixedbread's API with our easy-to-use SDKs for Python and TypeScript.
Last updated: June 28, 2025