Process Image

Processes an image file using ABBYY Cloud OCR to extract text with specified language and export format.

Common Properties

Name - The custom name of the node.
Color - The custom color of the node.
Delay Before (sec) - Waits in seconds before executing the node.
Delay After (sec) - Waits in seconds after executing node.
Continue On Error - Automation will continue regardless of any error. The default value is false.

Inputs

Image Path - Path to the image file to process with OCR. The path can be absolute or relative to the project directory.

Options

Language - Language of the text in the image for OCR recognition (default: English). Choose from over 200 supported languages including English, Spanish, French, German, Chinese, Japanese, Korean, Arabic, and many more.
Export Format - Output format for the OCR results (default: txt). Available formats:
- Text File - Plain text output
- Unstructured Text File - Text without formatting
- Rich Text File - RTF with formatting
- Microsoft Word File - DOCX format
- Microsoft Excel File - XLSX format
- Microsoft PowerPoint File - PPTX format
- Searchable PDF File - PDF with text layer
- PDF with Text And Images - Combined PDF
- PDF A - PDF/A archival format
- XML File - Structured XML
- XML For Corrected Image - XML with image corrections
- Alto - ALTO XML format

Outputs

Task - ABBYY task object containing processing results, task ID, status, and download URLs.

How It Works

The Process Image node uploads an image to ABBYY Cloud OCR service and initiates text recognition. When executed, the node:

Reads the image file from the specified path
Validates the language and export format options
Uploads the image to ABBYY Cloud servers
Initiates OCR processing with the specified language
Returns a task object with task ID and status
The task object can be used with Wait Task to retrieve results

Requirements

Valid ABBYY Cloud credentials configured in credentials.yaml
Valid image file at the specified path (supported formats: JPG, PNG, BMP, TIFF, PDF)
Active internet connection for cloud API access
Valid language and export format selections

Error Handling

The node will return specific errors in the following cases:

Robomotion.ABBYYCloud.ErrImagePath - Image path is invalid or file not found
Robomotion.ABBYYCloud.ErrImageData - Cannot read image file data (file may be corrupted or inaccessible)
Robomotion.ABBYYCloud.ErrOption - Invalid language or export format option selected

Usage Example

Scenario: Extract text from a scanned receipt image

1. Process Image node:
   - Image Path: "C:/receipts/receipt_001.jpg"
   - Language: English
   - Export Format: Text File

2. Wait Task node:
   - Task: {{ $.task }} (from Process Image output)
   - Timeout: 60 seconds

3. Access the recognized text from the task status result URL

Common Use Cases

Document Digitization - Convert scanned paper documents to editable text
Receipt Processing - Extract text from receipt photos for expense tracking
Invoice OCR - Recognize invoice details from images
Screenshot Text Extraction - Extract text from application screenshots
ID Card Recognition - Extract text from ID card images
Sign Recognition - Digitize text from photos of signs and labels

Tips and Best Practices

Image Quality: Use high-resolution images (300 DPI or higher) for best accuracy
Lighting: Ensure images are well-lit with minimal shadows
Language Selection: Always specify the correct language for optimal results
Format Choice: Use XML or Alto formats when you need structured output with coordinates
Multi-Language: For mixed language documents, select the primary language
Preprocessing: Consider image enhancement before OCR for better results
Task Management: Always use Wait Task and Delete Task for complete workflows
Error Handling: Enable Continue On Error for batch processing scenarios

Common Properties​

Inputs​

Options​

Outputs​

How It Works​

Requirements​

Error Handling​

Usage Example​

Common Use Cases​

Tips and Best Practices​