Extract Data

You can select this activity to extract data from a document for the selected fields that are created using the Manage Document Type functionality on the Design tab. Data can be in the form of key-value pair, table, or string.

You can use one of these options to extract data:

  • Regular Expression Based: This option enables you to extract data based on the pattern in the string that is obtained from the OCR engine. For example, the date field pattern is (\d\/\d\/\d+\). This is the most accurate extraction method. See, Extracting Data using Regular Expression Based Extraction section.
  • Layout Based: This option enables you to define and manage templates for extracting data. See, Navigating through the RPA Studio application section.
  • Configure: This option enables you to select the type of extractor you want to use.
  • Flow Based: This option enables you to extract data based on the Flows provided by the IDP service. You cannot use the Regular Expression or Layout Based extraction methods, when using this option. See, Extracting data using Flow Based extraction.

This table lists the properties for the activity.

Property Type Property Name Data Type Description
Input Document Path String The location of file from which data must be extracted.
Output Response JToken Output for the response.
Response code Int32 Response code for the activity. Possible values:
  • 200 to 290: Indicates a successful response or valid output.
  • 400 to 499: Indicates client error responses.
  • 500 to 599: Indicates server error responses.