SA Data Extraction for Data Lake and Birst overview

System Administrator > Administration > SAA Reports > SAABE

Function acronym: SAABE

Use this report to create an extract of Distribution SX.e tables for use by Infor Distribution Analytics Content for Birst. The report creates a text file, data files, or a bulk extraction. The text file is for each table selected for extraction that can be uploaded in Birst. Data files can be placed in the Data Lake queue (DLQUEUE) to be loaded into the Infor Data Lake repository. The bulk extraction can be used to load up to 20 GB of data into the Data Lake queue (DLQUEUE) table.

Use SA Audit Processing Administration to monitor and purge the records in the DLQUEUE table.

When you run the SA Data Extraction for Data Lake and Birst report, you can optionally specify a date range, and then select the table or tables to be extracted.

Specify a value in the Days to Extract option to extract data from the current day back to the specified number of days. This is useful for setting up the SA Data Extraction for Data Lake and Birst as a stored report to do incremental data loads to Birst.

Select an Extract Type.

When you select Text File as the Extract Type, the report extracts data as text files based on the standard temp-tables built for the REST API FetchWhere service. Not all tables are used to populate the domains supported in the current version of Distribution Analytics Content for Birst. Specify a value in the File Delimiter: (C)omma or (P)ipe option to determine the field delimiter used in the text file. If you select Text as the Extract Type option for an Distribution SX.e on-premises environment, then you must select Pipe as the File Delimiter.
See more information about the supported data load sources for Birst in Supported tables for text extract to Birst.
When you select Data Lake as the Extract Type, the report extracts data for the selected tables and loads them into the Data Lake queue (DLQUEUE). The Data Connector web service sends the data through ION Messaging Service to the Infor Data Lake. The Data Lake Extract Type is currently available if you are running CloudSuite Distribution only.
See more information about the supported data load sources for the Data Lake in Supported tables for Data Lake
When you select Bulk as the Extract Type, a Data Fabric bulk ingestion API service is used to load up to 20 GB of data into the Data Lake queue (DLQUEUE) table within 48 hours.
Note: The bulk option in SA Data Extraction for Data Lake and Birst is typically used for an initial load, but you can also use it to reload data after large amounts of data are purged from Data Lake.

Note: SA Data Extraction for Data Lake and Birst is used to send an initial data load only to the Data Lake. To update Data Lake data objects when database records are added, changed, or deleted after the initial load, you must activate database table replication in SA Administrator Options-System-Options. The tables selected for updating must be marked for replication in SA Audit Processing Administration.

See the Infor CloudSuite Distribution Configuration Guide for instructions for configuring the Data Lake integration.

You can print the extract file output to view, email, or Dropbox. If you are running Distribution SX.e on-premises, you can also print the output to a local file.