Which data flow components should you use to identify modified data?

You are developing a SQL Server Integration Services (SSIS) package.
The package uses a data flow task to source data from a SQL Server database for loading
into a dimension table in a data warehouse.
You need to create a separate data flow path for data that has been modified since it was
last processed.
Which data flow components should you use to identify modified data? (Each correct answer
presents a complete solution. Choose all that apply.)

You are developing a SQL Server Integration Services (SSIS) package.
The package uses a data flow task to source data from a SQL Server database for loading
into a dimension table in a data warehouse.
You need to create a separate data flow path for data that has been modified since it was
last processed.
Which data flow components should you use to identify modified data? (Each correct answer
presents a complete solution. Choose all that apply.)

A.
Multicast

B.
Data Conversion

C.
Lookup

D.
Slowly Changing Dimension

E.
Aggregate

Explanation:
A: The transformation that distributes data sets to multiple outputs.
The transformation that distributes data sets to multiple outputs.
C: Lookup Transformation
The transformation that looks up values in a reference table using an exact match.
Note:

* SQL Server Integration Services provides three different types of data flow components:
sources, transformations, and destinations. Sources extract data from data stores such as
tables and views in relational databases, files, and Analysis Services databases.
Transformations modify, summarize, and clean data. Destinations load data into data stores
or create in-memory datasets.
Incorrect:
Not B: Data Conversion Transformation
The transformation that converts the data type of a column to a different data type.
Not D: Slowly Changing Dimension Transformation
The transformation that configures the updating of a slowly changing dimension.
Not E: The Aggregate transformation applies aggregate functions, such as Average, to
column values and copies the results to the transformation output. Besides aggregate
functions, the transformation provides the GROUP BY clause, which you can use to specify
groups to aggregate across.



Leave a Reply 14

Your email address will not be published. Required fields are marked *


Marge

Marge

A Multicast transformation directs every row to every output.
Not ‘A’

dennis

dennis

the fact that Multicast directs a row to every output doesn’t mean that Multicast is not able to do some logical reasoning, so i think that A is still a valid answer

nmf

nmf

Thats not true! Can you give an example?

Slazenjer_m

Slazenjer_m

…(Each correct answer presents a complete solution. Choose all that apply.)

Without some kind of extra manipulations, Multicast cannot (and will not), create a separate data flow path for data that has been modified since it was loaded. Please, go to MSDN and read up on what Multicast does again. Better yet, try using it in a data flow in SSDT and see what output you get.

Answers: SCD (Changing/Changed Dimension) & Lookup.

Lumi

Lumi

Why not SCD with Changing attributes?

ryahan

ryahan

the question is a bit crap.. for me A doesn t help you identify modified data.. if so give an example ?.. Look up ok why not .. but SCD is actually the first choice you should tick , it s been creating for this very precise purpose , if not then tell me why ?

paeren

paeren

The only correct answers would med Lookup and SCD task. It’s not illegal to think use SSIS to test it.

Slazenjer_m

Slazenjer_m

The scenario described in the question requires an SCD (rows modified since the last processing… any of insert/update/delete). Apart from an SCD, a Lookup transform could also do a row-to-row or column-to-column comparison with a target table to see if data change has occurred.

Correct options are C & D. Multicast is never an option here.

Islam

Islam

agree it makes sense to use a lookup and a SCD

Yuriy

Yuriy

A and C

A(first part of solution) – to …. create a separate data flow path ….
C(second part of solution) – …. to identify modified data ….

Slazenjer_m

Slazenjer_m

Have you ever tried to use an SCD in a project and see what it does?! It actually splits input data into multiple output as described in the question scenario above… I would have uploaded a screen-shot of a project if this textbox would have allowed!! Multicast is completely off-the-mark.

mike

mike

Anyone who think A is a correct answer, do not try this exam…

lol

lol

lol. so true!