PDFExtractImage

Description

Extracts all images out of PDF files.

Each image extracted is numbered consecutively in the following format:  <Image prefix name>_0001.jpg

A new column is created for the results.

Requires the user to specify the PDF as binary data.

Support for JPEG2000 image compression files libraries by default. To extract this type of image, upload a supporting library.

Use

  • Select an argument. (Binary)
  • Enter an image prefix name as the base name for each image file.
  • Enter an output field name.
  • Click OK.

The output for the function is a single record as a list. Use a function like Flatten List to return each element in the list as a separate record.  

Type

Formulas