Skip to content

Enhancement: Support for default NULL values in Metadata-driven transformations #12

@thms317

Description

@thms317

Problem

When using pushcart for metadata-driven data ingestion, there are instances where the source data lacks some columns that are described in the metadata table. Currently, this scenario causes the ingestion to fail or process incorrectly because the expected columns are absent.

Suggested Enhancement

It would be beneficial to introduce a mechanism to handle such discrepancies gracefully. One possible solution is to add an optional nullable field to the metadata table. This field would specify whether a missing source column can be considered as null during the data ingestion process.

Proposed Behavior

  1. Metadata Table Adjustment: Add a nullable column in the metadata table where each entry specifies if the corresponding data column can default to null when absent.
  2. Ingestion Logic: Modify the ingestion process to check for the nullable flag. If the source data column is missing and nullable is set to true, the column should be assumed as null in the ingested data.
  3. User Notification: Optionally, the system could log or notify when data columns are set to null due to the absence of source data, keeping the user informed of these adjustments.

Benefits

Implementing this feature would make the framework more robust and flexible in handling various real-world data scenarios, reducing the need for manual data pre-processing and enhancing the overall usability of the Pushcart framework for data ingestion tasks.

Please consider this feature for upcoming releases, as it would significantly enhance error handling and data integrity in the ingestion process.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions