You are the data steward for a Business Intelligence project.
You must identify duplicate rows stored in a SQL Server table and output discoveries to a
CSV file. A Data Quality Services (DQS) knowledge base has been created to support this
project.
You need to produce the CSV file with the least amount of development effort.
What should you do?
A.
Create an Integration Services package and use a Fuzzy Lookup transform.
B.
Create a Master Data Services (MDS) model.
C.
Create a Data Quality Project.
D.
Create an Integration Services package and use a DQS Cleansing transform.
E.
Create an Integration Services package and use a Fuzzy Grouping transform.
Explanation:
The DQS Cleansing transformation uses Data Quality Services (DQS) to correct data from a
connected data source, by applying approved rules that were created for the connected data
source or a similar data source.
To determine whether the data has to be corrected, the DQS Cleansing transformation
processes data from an input column when the following conditions are true:
* The column is selected for data correction.
* The column data type is supported for data correction.
* The column is mapped a domain that has a compatible data type.
* The transformation also includes an error output that you configure to handle row-level
errors
C
I agree it should be option C, since you already have a DQS KB.
Question says “you need to produce the CSV file with the least amount of development effort”, so a Data Quality project using the DQS Client would be much easier/quicker to complete than a SSIS DQS Cleansing transform project.
C.
Create a Data Quality Project.