Extract PDF Text
You can select this activity to extract plain text from a PDF document.
Note: Text from a table is also extracted but not formatted.
This table lists the properties for the activity.
Property Type | Property Name | Data Type | Description |
---|---|---|---|
Common | Continue on error | Boolean | The option to continue the RPA flow even if the activity fails. This check box is selected by default. |
Input | File Path | String | The location of the PDF document from which text must be extracted. For example, C:\RPA\test.pdf |
Page Range | String | The pages from which text must be extracted. You can specify a single page or a
range of pages. For example,1-3 or 5 |
|
Misc | DisplayName | String | The name to be displayed for the activity. |
Output | Extracted Text | String | The text extracted from the specified page range. You must create a variable to store this value. |
Response Code | Int32 | Response code for the activity. Possible values:
Note: You must create and specify the int32 variable to
view the response code.
|