Skip to main content

ABBYY Cloud

ABBYY Cloud package provides cloud-based OCR and document processing capabilities. Process images, documents, barcodes, business cards, receipts, and more using ABBYY's cloud API.

Prerequisites

Before using ABBYY Cloud nodes, you need to:

  1. Sign up for an ABBYY Cloud OCR SDK account at ABBYY Cloud OCR
  2. Obtain your Application ID and Password from the ABBYY Cloud console
  3. Configure credentials in the credentials.yaml file:
applicationId: your-application-id
password: your-password

Available Nodes

Image Processing

  • Process Image - Extract text from images with OCR in various output formats
  • Process Text - Advanced text field recognition with custom settings
  • Submit Image - Submit additional images to existing tasks

Document Processing

  • Process Document - Full document OCR with advanced options and multiple export formats
  • Process Field - Extract specific fields from submitted document images

Specialized Recognition

Task Management

Common Workflow Patterns

Simple Image OCR

  1. Use Process Image to upload and process an image
  2. Use Wait Task to wait for processing to complete
  3. Access the results from the task output
  4. Use Delete Task to clean up

Multi-Page Document Processing

  1. Use Process Image to submit the first page
  2. Use Submit Image to add additional pages to the task
  3. Use Process Document to process all pages together
  4. Use Wait Task to wait for completion
  5. Use Delete Task to clean up

Form Processing with Fields

  1. Use Process Text to recognize specific text fields
  2. Use Process Checkmark to detect checked boxes
  3. Use Process Barcode for barcode fields
  4. Combine results for complete form data extraction

Supported Languages

ABBYY Cloud supports over 200 recognition languages including:

  • English, Spanish, French, German, Italian, Portuguese
  • Chinese (Simplified & Traditional), Japanese, Korean
  • Russian, Arabic, Hebrew, Thai, Vietnamese
  • And many more...

Export Formats

Documents can be exported in various formats:

  • Text: txt, txtUnstructured
  • Documents: rtf, docx, xlsx, pptx
  • PDF: pdfSearchable, pdfTextAndImages, pdfa
  • Structured: xml, xmlForCorrectedImage, alto

Best Practices

  1. Image Quality: Use high-resolution, well-lit images for best OCR accuracy
  2. Language Selection: Always specify the correct language for optimal results
  3. Task Cleanup: Delete tasks after processing to manage your cloud quota
  4. Error Handling: Enable "Continue On Error" for robust automation workflows
  5. Polling Strategy: Adjust poll time based on document complexity and size

Error Handling

All nodes return specific error codes:

  • Robomotion.ABBYYCloud.ErrImagePath - Invalid or missing image file path
  • Robomotion.ABBYYCloud.ErrImageData - Cannot read image file data
  • Robomotion.ABBYYCloud.ErrTaskID - Invalid or missing task ID
  • Robomotion.ABBYYCloud.ErrOption - Invalid option parameters
  • Core.ABBYYCloud.ErrTimeout - Task processing timeout
  • Core.ABBYYCloud.ErrRequest - API request failed