Data object creation
Data objects are typically created by a source system or Data Lake.
Data object creation by a source system
A data publisher provides data replication modules that accumulate transactions from a
specific period of time. The transactions are stored within a data object, which is later
published to Data Lake. If the data is determined to be replicated
to Data Lake by, for example, a schedule or optimal data object
size, the data publisher can use the Data Lake Flows in ION Connect or Data Lake's Batch API for a direct
upload to Data Lake. The Channel
property in
Atlas and the storage APIs provide information on the ingestion
origin.
Data object creation by Data Lake
When a data publisher streams transactional events in real time with Data Fabric's Streaming Ingestion API, the transactions are accumulated and then micro-batched by Data Lake. Data Lake generates data objects after 5 MB of data are reached or every 5 minutes, whichever happens first. Until then, the transactions are not available in Data Lake, but can be processed immediately, in real time with Infor Stream Pipelines.