Mixedbread
Ingest

Supported Metadata Types

Metadata is structured information about your files that enables powerful filtering, organization, and search capabilities. This page covers what types of metadata you can store and how to structure them effectively.

Understanding Metadata

  • File-Level: Metadata attached to entire files
  • Inheritance: Chunks inherit metadata from their parent file
  • Filtering: Use metadata to narrow search results ()
  • Content Organization: Structure content for easier discovery

Metadata Types

String Values

String Values
JSON
{
"category": "documentation",
"author": "john.doe",
"department": "engineering",
"status": "published"
}

Best For: Categories, names, tags, status values

Filtering: Equality, inequality, pattern match (like, not like)

Numeric Values

Numeric Values
JSON
{
"priority": 8,
"version": 2.1,
"page_count": 45,
"file_size": 1048576
}

Best For: Scores, versions, counts, measurements

Filtering: Comparisons (>, <, >=, <=)

Boolean Values

Boolean Values
JSON
{
"is_public": true,
"requires_auth": false,
"is_deprecated": false,
"featured": true
}

Best For: Flags, permissions, feature toggles

Filtering: True/false conditions

Date/Time Values

Date/Time Values
JSON
{
"created_at": "2024-01-15T10:30:00Z",
"last_updated": "2024-01-20",
"expires_on": "2024-12-31",
"published_date": "2024-01-15"
}

Best For: Timestamps, deadlines, publishing dates

Filtering: Comparisons (>, <, >=, <=)

Format: ISO 8601 recommended

Array/List Values

Array/List Values
JSON
{
"tags": [
  "python",
  "web",
  "api"
],
"authors": [
  "alice",
  "bob"
],
"categories": [
  "tutorial",
  "guide"
],
"languages": [
  "en",
  "es"
]
}

Best For: Multiple values, collections

Last updated: January 6, 2026