Late arriving data
If a dimension entry cannot be found, the fact is omitted. For example, if a sold-to business partner that is listed on a sales order line does not exist in the Sold-to BP dimension, then the Sales Order Line fact is automatically excluded. The related amount, quantity, count measures are not included in the aggregated numbers.
This can happen when:
- Dimension data arrives late.
The fact arrives ahead of the dimension data.
In the example, the sales order line data has arrived in the data warehouse, but the sold-to BP data has not yet arrived. The issue with the late arriving dimension data is resolved the next time data is extracted from Data Lake. In the meantime, the sales order line is excluded from analysis.
- Data is corrupt in LN.
In the example, the sold-to BP does not exist in LN, but a sales order line still refers to the sold-to BP. This data corruption must be corrected in LN by adding the sold-to BP in LN. The sales order lines are included the next time data is extracted from Data Lake and processed. They are handled as (very) late arriving dimension data.