Data Models
Understanding the core data structures in Mixedbread Vector Stores helps you work effectively with the API and understand how your content is organized and retrieved.
Vector Store
A Vector Store is the primary container for your searchable content. It holds your files, manages access permissions, and provides the foundation for semantic search operations.
Vector Store Properties
Property | Type | Description |
---|---|---|
id | string | Unique identifier for the Vector Store |
name | string | User-defined name that serves as an identifier |
description | string | Optional description of the Vector Store's purpose |
is_public | boolean | Whether the Vector Store is publicly accessible |
metadata | object | Additional metadata associated with the Vector Store |
file_counts | object | Counts of files in different processing states |
expires_after | object | Expiration configuration based on activity |
status | enum | Current status: expired , in_progress , completed |
created_at | string | ISO timestamp when the Vector Store was created |
updated_at | string | ISO timestamp when the Vector Store was last updated |
last_active_at | string | ISO timestamp of the last activity |
usage_bytes | integer | Total storage space used by indexed content |
expires_at | string | Computed expiration timestamp (if expires_after is set) |
object | string | Always "vector_store" |
File Counts Object
The file_counts
object provides detailed breakdown of file processing states:
Property | Type | Description |
---|---|---|
pending | integer | Number of files waiting to be processed |
in_progress | integer | Number of files currently being processed |
cancelled | integer | Number of files whose processing was cancelled |
completed | integer | Number of successfully processed files |
failed | integer | Number of files that failed processing |
total | integer | Total number of files |
For detailed configuration options including expiration policies and public access, see Vector Store Configuration.
Vector Store Example
Vector Store File
A Vector Store File represents a complete file that you've uploaded to a Vector Store. It tracks the file's processing status, metadata, and relationship to the searchable chunks created from its content.
File Properties
Property | Type | Description |
---|---|---|
id | string | Unique identifier for the file within the Vector Store |
filename | string | Original name of the uploaded file |
metadata | object | Custom key-value pairs you've attached to the file |
status | enum | Current processing status of the file |
last_error | object | Details about any processing errors that occurred |
vector_store_id | string | ID of the Vector Store containing this file |
created_at | string | ISO timestamp when the file was added to the Vector Store |
version | integer | Version number of the file within the Vector Store |
usage_bytes | integer | Storage space used by the file's indexed data |
object | string | Always "vector_store.file" |
For detailed information on file processing lifecycle and status meanings, see Vector Store File Status.
For guidance on metadata structure and types, see Metadata Types.
File Example
Vector Store Chunk
A Vector Store Chunk represents a searchable segment of content created from a Vector Store File. When you search, you get back chunks that contain the most relevant portions of your files.
Chunk Properties
Property | Type | Description |
---|---|---|
chunk_index | integer | Position of this chunk within the source file |
mime_type | string | Content type of the chunk (text/plain, image/png, etc.) |
model | string | Model used to generate the chunk's vector |
score | number | Relevance score for this chunk (in search results) |
file_id | string | ID of the file this chunk came from |
filename | string | Name of the source file |
vector_store_id | string | ID of the Vector Store containing this chunk |
metadata | object | Metadata inherited from the source file |
type | enum | Type of content: text , image_url , audio_url , video_url |
Content-Specific Properties
Text Chunks
Property | Type | Description |
---|---|---|
text | string | Text content of the chunk |
Image Chunks
Property | Type | Description |
---|---|---|
image_url | object | Image URL and format information |
ocr_text | string | Text extracted from images via OCR |
summary | string | AI-generated summary of the image content |
Audio Chunks
Property | Type | Description |
---|---|---|
audio_url | object | Audio URL and format information |
transcription | string | Speech-to-text transcription of the audio |
summary | string | AI-generated summary of the audio content |
Video Chunks
Property | Type | Description |
---|---|---|
video_url | object | Video URL and format information |
transcription | string | Speech-to-text transcription of the video |
summary | string | AI-generated summary of the video content |
Chunk Types
Text Chunks
Image Chunks
Audio Chunks
Video Chunks
Complete Chunk Example
Next Steps
Now that you understand the data models, explore how they're used:
- Create Vector Stores: Work with Vector Store objects
- Ingest Files: Work with Vector Store Files
- Search: Work with Vector Store Chunks
- API Reference: Complete API details
Pro Tip: Understanding these data models helps you design better metadata schemas and write more effective search queries.
Last updated: July 14, 2025