Extract Images

Extracts images from a PDF document and saves them as image files.

Common Properties

Name - The custom name of the node.
Color - The custom color of the node.
Delay Before (sec) - Waits in seconds before executing the node.
Delay After (sec) - Waits in seconds after executing node.
Continue On Error - Automation will continue regardless of any error. The default value is false.

info

If the ContinueOnError property is true, no error is caught when the project is executed, even if a Catch node is used.

PDF File Path - The path to the PDF file from which images will be extracted.

Output Directory - The directory where the extracted images will be saved.
Password - The password for the PDF document if it is encrypted.
Export Type - The format to save the extracted images. Options depend on the system's supported image formats.
Prefix - A prefix to add to the filenames of the extracted images.

The Extract Images node scans a PDF document and extracts all embedded images, saving them as separate image files. When executed, the node:

Validates that the input PDF path is not empty
Checks that the output directory exists
Validates that an export type is specified
Loads the PDF document (using the password if provided)
Iterates through all pages of the document
Extracts images from each page's resources
Saves each extracted image to the output directory with the specified format and prefix
Closes the document

The node will return specific errors in the following cases:

The extracted images will be numbered sequentially (1, 2, 3, etc.)
If a prefix is specified, it will be added to the beginning of each filename
The node can extract images from password-protected PDFs when the correct password is provided
Supported export types depend on the system's image I/O capabilities