Using ETL dataflows
ETL dataflows are the primary tools for joining or merging multiple datasets and creating data transformation pipelines. Domo datasets are the only supported inputs; the output of the dataflow is one or more datasets. So that you don't get confused, ETL is a misnomer in the sense that the sequence is extract, load, and transform, or ELT – it starts with the datasets being created by connectors. In this section, we'll walk through how to create an ETL dataflow.
Creating an ETL dataflow
The scenario we are going to walk through will use the Opportunity
dataset. We want to add date dimension information and cleanse the lead source type to produce a new enhanced/cleansed dataset.
Let's start by making sure we have all the input data sources we need in the necessary datasets:
- Download the
Opportunity.xlsx
file from https://github.com/PacktPublishing/Data-Democratization-with-Domo/blob/e92849ebfd14321be6410f88f473dac7838d18b1/Opportunity...