Seeds / Upload

Upload Seeds

Upload documents and text content to create searchable seeds. Accepts PDF, Word, Excel, PowerPoint, images, and plain text. Content is automatically processed: text extracted, split into ~300 token chunks, and embedded using Jina Embeddings v4.

POST {baseAPIUrl}/seeds

Authentication

Required. Include your API key in the Authorization header:

Authorization: Bearer YOUR_API_KEY

Query Parameters

Parameter	Type	Required	Description
appId	string	Required	Application identifier
externalUserId	string	Required	External user identifier

Request Body

Content-Type: multipart/form-data

Field	Type	Description
files	File[]	Binary files (max 10, 100MB total)
fileTypes	string[]	Type for each file
fileSources	string[]	Origin for each file
fileTitles	string[]	Display name for each file
text	string[]	Text content as seeds
textTypes	string[]	Type for each text
textSources	string[]	Origin for each text
textTitles	string[]	Display name for each text
bundleIds	string[]	Bundle UUIDs to associate

Supported File Types

Use these values for the fileTypes array:

Type	Extensions	Description
pdf	.pdf	PDF documents
word	.doc, .docx	Microsoft Word documents
excel	.xls, .xlsx	Microsoft Excel spreadsheets
powerpoint	.ppt, .pptx	Microsoft PowerPoint presentations
image	.jpg, .jpeg, .png, .gif, .webp	Images (OCR + visual embedding)
markdown	.md	Markdown files
csv	.csv	Comma-separated values
json	.json	JSON files
html	.html, .htm	HTML documents
txt	.txt	Plain text files

Supported Text Types

Use these values for the textTypes array:

Type	Description
markdown	Markdown-formatted text content
claude_chat	Exported Claude conversation history
gpt_chat	Exported ChatGPT conversation history
plain	Plain text content
html	HTML content (will be converted to markdown)
code	Source code snippets

Upload Limits

Max files per request: 10 files
Max total size: 100MB per request
Max file size: 50MB per individual file
Max text items: 50 text items per request
Chunk size: ~300 tokens with 50 token overlap

Processing Pipeline

Upload

Files uploaded to temporary storage

Extract

Background job extracts text/images

Convert

Content converted to markdown

Chunk

Chunked with dynamic token-based splitting

Embed

Embeddings generated (Jina Embeddings v4)

Store

Stored in database + cloud storage

Typical processing time: 5-30 seconds depending on file size and type.

Try It

Base API URL

API Key

Query Parameters

appId *

Application identifier

externalUserId *

External user identifier for data isolation

Request Body

files

Binary files to upload (max 10 files, 100MB total)

fileTypes

Type for each file (e.g., "pdf", "word", "image")

fileSources

Origin identifier for each file

fileTitles

Display name for each file

text

Text content to upload as seeds

textTypes

Type for each text item

textSources

Origin identifier for each text item

textTitles

Display name for each text item

bundleIds

Bundle UUIDs to associate with uploaded seeds