Create a Dataset (Salesforce Object) using Dataset Builder and Dataflow

In my previous blog, I talked about creating a dataset from a csv file (Click Here). Now we would be talking about another approach on creating a dataset using dataset builder and dataflow.

A big question – what is a dataflow ?

A Dataflow in simple terms is just a way in which you can start with a particular set of data and then change it, build on it, combine with other data, derive new columns based on some values in other columns, etc. to come up with your final data. So simply it a set of steps or instructions to be performed on a data to use it for further analytics and insights.

Here we would be creating a dataset for Accounts and its owner information (Full Name and Photo)

STEP 1 : Click on Create Button & Select Dataset

There are multiple ways to start creating a dataset. After entering Analytics Studio, you can either click on the “Create” button (on the top right corner) or you can go into your Einstein Analytics app and click the button, etc. Select Dataset.

Create a dataset

STEP 2 : Choose Data Source

Choose Salesforce Data as the datasource.

Data Source

STEP 3 : Add New Dataset Details

Add the new dataset name and whether you want to create a new dataflow or add it to an existing dataflow. Lets name the dataset as Accounts and choose a new dataflow – Account with Owner Details.

New Dataset

STEP 4 : Select the objects and Fields

Select the root or base object to start building the dataflow. This is the object from which we will start building our dataflow. Ideally you should choose the base or the main object from which you want to create further associations.

Choose Base Object

We will choose Account as the Base object.

Base Object

Click on the plus symbol which will allow you to add fields for the Account object that you want to be included in your dataset.

Account Fields

We wanted Account with Owner Details. Click on the Relationship Tab to Associate with Owner fields.

Relationship Tab

Click on the User (Owner ID) node (plus symbol) and select the fields. In case the list is long, you can even search for the field. We will search for Full Name field. When finished, click on the Next Button.

Choose Fields

STEP 5 : Select the App

Select the App where you want to save your dataset.

Select App

Einstein now does the magic of creating the dataflow and also running the dataflow in the background.

Dataset Creation

You can now see the dataflow that has been created. In case you choose not to perform the above steps, you can directly create the dataflow as well. I will cover the dataflow in the upcoming blogs.

However to give an idea, the dataflow created the following :-

  • 2 SFDCDigest Node – These nodes represent that data is coming from Salesforce Objects
  • 1 Augment Node – This node is to join the Account and the user object i.e. based on the User Id in the Owner field, join the Account data with other user information.
  • 1 SFDCRegister – This is the node for creating the dataset.
Dataflow

You can check the “Monitor” tab and see your dataflow running with stats of every step.

Dataflow Monitor

You can edit the dataflow directly, if required. I will cover that in the upcoming blogs.