Search
Reranking
Reranking is an advanced feature that improves the quality and relevance of search results by using specialized models to re-evaluate and reorder the initial results. This second-pass approach significantly enhances result accuracy at the cost of slightly increased latency.
Basic Configuration
The simplest way to enable reranking is to set rerank=True
in your search request:
This uses our default reranking model (mixedbread-ai/mxbai-rerank-large-v2
) with standard settings to improve your search results.
Advanced Configuration
Configuration Options:
- Reranking Models: Use any supported reranking model from our inference endpoint. See Inference Models for available models and their capabilities. Defaults to
"mixedbread-ai/mxbai-rerank-large-v2"
when not specified. - Top-k Constraint: The
top_k
value must be smaller than or equal to the first-stage retrieval count (your maintop_k
parameter). Defaults tonull
, which returns all results from the first stage (respects your maintop_k
parameter). - Metadata Inclusion: Defaults to
false
. Set totrue
to include all metadata, or provide a list of specific metadata keys to include.
Question Answering
Get AI-powered answers to your questions using Vector Store content as context, with optional citations and multimodal support.
Metadata Filtering
Learn how to filter Vector Store files and search results using powerful metadata queries with logical and comparison operators.
Last updated: August 29, 2025