marjory-logo

ETL Tasks

You can manipulate Datasources using the following ETL Tasks:

  • Transform: Refine and reshape datasources for downstream processing.
    Transform

  • Join: Merge multiple datasources based on specified keys.
    Join

  • Group By: Consolidate rows by grouping based on a key and applying aggregation functions.
    Group By

  • Sort: Arrange rows in a specified order.
    Sort

  • Concat: Combine two datasources with identical schemas into one.
    Concat

  • Remove Duplicates: Eliminate duplicate rows within a datasource.
    Remove Duplicates

  • Write CSV: Save datasource contents in CSV format for external use.
    Write CSV

  • Write JSON: Save datasource contents in JSON format for external use.
    Write JSON

  • Write XLS(X): Save datasource contents in XLS(X) format for external use.
    Write XLS (X)

  • Load: Transfer processed data to target systems for integration and storage.
    Load

  • Log Rows: Display a specified number of rows for debugging and validation.
    Log Rows

  • Filter: specify a set of conditions to include only the rows that meet the criteria for downstream processing.

    Filter

  • Iterate: configure an iterative process based on a specific datasource.

    Iterate

Version(s)
1.0.0