SA Data Extraction for Data Lake and Birst overview
Function acronym: SAABE
Use this report to create an extract of Distribution SX.e tables for use by Infor Distribution Analytics Content for Birst. The report creates a text file, data files, or a bulk extraction. The text file is for each table selected for extraction that can be uploaded in Birst. Data files can be placed in the Data Lake queue (DLQUEUE) to be loaded into the Infor Data Lake repository. The bulk extraction can be used to load up to 20 GB of data into the Data Lake queue (DLQUEUE) table.
Use SA Audit Processing Administration to monitor and purge the records in the DLQUEUE table.
When you run the SA Data Extraction for Data Lake and Birst report, you can optionally specify a date range, and then select the table or tables to be extracted.
Specify a value in the Days to Extract option to extract data from the current day back to the specified number of days. This is useful for setting up the SA Data Extraction for Data Lake and Birst as a stored report to do incremental data loads to Birst.
Select an Extract Type.
- When you select Text File as the Extract Type, the report
extracts data as text files based on the standard temp-tables built for the REST API
FetchWhere service. Not all tables are used to populate the domains supported in the
current version of Distribution Analytics Content for Birst. Specify a value in the
File Delimiter: (C)omma or (P)ipe
option to determine the field delimiter used in the text file. If you select
Text as the Extract
Type option for an Distribution SX.e
on-premises environment, then you must select
Pipe as the File
Delimiter.
See more information about the supported data load sources for Birst in Supported tables for text extract to Birst.
- When you select Data Lake as the Extract Type, the report
extracts data for the selected tables and loads them into the Data Lake queue
(DLQUEUE). The Data Connector web service sends the data through ION Messaging
Service to the Infor Data Lake. The Data Lake
Extract Type is currently
available if you are running CloudSuite Distribution only.
See more information about the supported data load sources for the Data Lake in Supported tables for Data Lake
- When you select Bulk as the Extract Type, a Data Fabric bulk ingestion API
service is used to load up to 20 GB of data into the Data Lake queue (DLQUEUE) table
within 48 hours.Note: The bulk option in SA Data Extraction for Data Lake and Birst is typically used for an initial load, but you can also use it to reload data after large amounts of data are purged from Data Lake.
See the Infor CloudSuite Distribution Configuration Guide for instructions for configuring the Data Lake integration.
You can print the extract file output to view, email, or Dropbox. If you are running Distribution SX.e on-premises, you can also print the output to a local file.