Extract Text

Extracts text from documents using Google Document AI.

Common Properties

Name - The custom name of the node.
Color - The custom color of the node.
Delay Before (sec) - Waits in seconds before executing the node.
Delay After (sec) - Waits in seconds after executing node.
Continue On Error - Automation will continue regardless of any error. The default value is false.

info

If the ContinueOnError property is true, no error is caught when the project is executed, even if a Catch node is used.

File Path - The local file path of the document to process.
MIME Type - The MIME type of the document. If not provided, it will be auto-detected.

Credentials - Google Document AI credentials used to authenticate with the service.
Project Id - The Google Cloud project ID associated with your Document AI processor.
Location - The location of the Document AI processor. Default is "us".
Processor Id - The ID of the Document AI processor to use for text extraction.

The Extract Text node integrates with Google Document AI to extract text from documents. When executed, the node:

The node will return specific errors in the following cases:

The File Path should point to a local document file (PDF, images, etc.)
If MIME Type is not provided, it will be auto-detected from the file content
The Project Id, Location, and Processor Id are required for Document AI processing
The output includes both the full extracted text and text organized by pages
This node is useful for converting documents to plain text for further processing
The Location option specifies the geographic location of your Document AI processor (e.g., "us", "eu")
The extracted text preserves the reading order of the original document