Work with Invoice Extraction

The Invoice Document Extractor lets users extract pre-selected fields from semi structured invoice documents. (Image and PDF files)

Other than saving and previewing the Extraction, no additional configuration is needed. It is already ready to use. 

The Invoice Extraction model is a fast and easy way for you to quickly grab data from images and PDFs to use in your business automations.

Open Documents

Open AI from the navigation menu and select Documents.

Document Overview Page

View the saved document extractions.

This includes Invoices, Fixed Forms, and Custom Invoice.

The documents overview page displays all of the saved document extractions.

You can view the:

  • Document extraction names - Name given to identify the document extraction.
  • Number of versions - The number of different versions for the document extraction.
  • Created - The date the document extraction was created.
  • Settings:
    • Edit -  Open the extraction in Document Studio to edit the extraction.
    • Clone - Copy the document extraction.
    • Delete -  Delete the document extraction.
    • Configuration - Edit the name of the extraction.

Create a Pre-trained Invoice Extraction

To create a new Invoice Extraction:

  1. After opening Documents in Hero Platform_, click Create Document Model.

  2. Enter a name for the Invoice Extraction and click Next.

  3. Select Invoices and click Next.

  4. Select one or more sample invoice documents from the file browser by clicking Choose file...
    Click Save & Preview.

    A sample invoice document is a reference image. It allows you to view the type of data that will be extracted for future input invoice documents in the Flow Studio.

    Supported file types:

    • .jpg
    • .png
    • .pdf


    Languages supported

    LanguageNote
    EnglishSupported
    German, French, Spanish, Portuguese, ItalianSupported (may take slightly longer to process)


  5. The sample invoice image is displayed in the Document Studio highlighting the data found from the default fields.
    Hero Platform_ scans the documents and matches the values with pre-built fields. Any field found, a value is displayed. Any fields not found are not used in the preview.
    See a list of all pre-built fields.
    There is nothing to configure at this point. Review the highlighted values to see if they match the fields displayed under Results Preview.

    Documents with multiple pages can be viewed by scrolling. View additional invoice documents added to the extraction by clicking the arrows on either side of Document Studio.

    View invoice documents by selecting the Sample Documents tab.
    Add additional invoices (only PDF files are supported) by clicking Choose file... 
         - These files are for preview only and will not affect any Flows using this extractor.
    Remove invoices by clicking the garbage icon next to the invoice's name.

  6. Click Save in the toolbar to add the Invoice Extraction.

Hero Platform_'s Invoice Field List

This is the list of Hero Platform_'s recognized fields in the pre-trained invoice extraction model.

Global values per-document

  • ABA Routing Number
  • Account Name
  • Account Number
  • Amount Due
  • Amount Paid
  • BIC
  • Bill To
  • Company ID
  • Contact Person
  • Customer Address
  • Customer Company ID
  • Customer ID
  • Customer Name
  • Customer Phone Number
  • Customer Tax ID
  • Due Date
  • IBAN
  • Invoice Date
  • Invoice Number
  • Order Date
  • Order Number
  • Payment Reference
  • Payment Terms
  • SWIFT
  • Ship To Address
  • Ship To Name
  • Sort Code
  • Tax ID
  • Tax Rates
  • Total Amount
  • Total Amount Before Tax
  • Total Tax
  • Vendor Address
  • Vendor Name

Line item values

Everything in a line item that does not match one of theses values are combined into a single "Other" value for the line.

  • Item Quantity
  • Item Unit Price
  • Item Cell
  • Item Date
  • Item Number
  • Item Total Price
  • Item Description
  • Item Unit of Measure

Use an Invoice Extraction in a Flow

After an Invoice Extraction has been saved, it can be used as a function in a Flow.

To use an Invoice Extraction in a Flow:

  1. Open and start creating a Flow in the Flow Studio.
  2. View the Document functions in the element browser.
  3. Click and drag the Invoice Extraction from the element browser onto the Flow Studio canvas.
  4. Connect the Invoice Extraction using a cable from an element in the Flow.
  5. Add Input documents.
  6. Configure/review the fields for the Fixed Form model's Docker container deployment.

    1. Capture logs - Select if the Docker function should capture logs.
    2. RAM - Adjust the sliding bar for memory (RAM) allocation for the function.
    3. vCPU - Adjust the sliding bar for CPU consumption. (by cores)
    4. Attempt timeout(s) - Enter the timeout setting (in seconds).
    5. Initial Delay - Enter the initial delay value in seconds for amount of time to between when container starts and when the Flow begins to use it.
    6. Retry attempts - Enter the max retry attempts before failing.

    Automation Hero recommends leaving the Docker container settings at the default levels unless problems arise. 

    An example of when raising the default settings may be beneficial is when the the documents being processed are very large.

  7. Click OK to finish adding the Invoice Extraction to the Flow.