Sending data to Data Lake

There are several methods of data ingestion available depending on the scope and workflow of a particular use case

This can include direct replication from applications and services, integration-based data acquisition with Infor ION, or data migration projects moving data from on-premises systems to the Cloud.

These are the primary methods of data movement to Data Lake:

  • Data Fabric
    • Batch ingestion APIs and services
  • Infor ION
    • Data Lake Flows
    • Data Loader

For more information about Data Lake flows, see the Infor ION Desk User Guide and the Infor ON Development Guide. These guides focus on ingestion services found within the Data Fabric application and API suite.

Data Lake uses object storage, making it capable of storing files, or “data objects”, of virtually any data format. Data objects are eligible to be queried only if they conform to one of the recognized formats, which include NDJSON and DSV.

As a best practice, we recommend that developers define and register metadata that describes a data object before data ingestion.

Schemas are a data dictionary reference for data objects. Schemas maintained within the Data Catalog help indexing and modeling data and content flows throughout the Infor OS platform.

For more information about object formats, see the Data Catalog topics in the ION user guides.