Batch Embeddings

Generates embeddings for multiple text inputs using Google's Gemini API.

Common Properties

Name - The custom name of the node.
Color - The custom color of the node.
Delay Before (sec) - Waits in seconds before executing the node.
Delay After (sec) - Waits in seconds after executing node.
Continue On Error - Automation will continue regardless of any error. The default value is false.

info

If the ContinueOnError property is true, no error is caught when the project is executed, even if a Catch node is used.

TaskType - The type of task for which the embeddings will be used. Options include:
- Retrieval Query
- Retrieval Document
- Semantic Similarity
- Classification
- Clustering
- Question Answering
- Fact Verification
Embedding Model - The model to use for generating embeddings. Default is "text-embedding-004".
Title - Title for RETRIEVAL_DOCUMENT task type.
Output Dimensionality - Reduced dimension count (e.g., 256).
Truncate - Whether to truncate inputs longer than max tokens. Default is true.

Embeddings - The generated embeddings as an array of values with associated metadata.

The Batch Embeddings node generates embeddings for multiple text inputs using Google's Gemini API. When executed, the node:

Validates the provided connection ID and content inputs
Ensures the content array is not empty and doesn't exceed 100 texts
Retrieves the configured embedding model (defaults to "text-embedding-004" if not specified)
Builds the embedding configuration with optional parameters like task type, title, and output dimensionality
Generates embeddings for each text in the content array
Returns a structured response containing the embeddings, model information, and metadata

The node will return specific errors in the following cases:

The maximum number of texts allowed per batch is 100
The Title option is only supported for RETRIEVAL_DOCUMENT task type
Output dimensionality must be a positive integer if specified
Each text in the contents array must be non-empty
The embeddings output includes statistics like token count and truncation information when available