You develop a SQL Server Integration Services (SSIS) package that imports Windows Azure
SQL Database data into a data warehouse every night.
The Windows Azure SQL Database data contains many misspellings and variations of
abbreviations. To import the data, a developer used the Fuzzy Lookup transformation to
choose the closest-matching string from a reference table of allowed values. The number of
rows in the reference table is very large.
If no acceptable match is found, the Fuzzy Lookup transformation passes a null value.
The current setting for the Fuzzy Lookup similarity threshold is 0.50.
Many values are incorrectly matched.
You need to ensure that more accurate matches are made by the Fuzzy Lookup
transformation without degrading performance.
What should you do?
A.
Change the similarity threshold to 0.40.
B.
Decrease the maximum number of matches per lookup.
C.
Change the similarity threshold to 0.85.
D.
Increase the maximum number of matches per lookup.
Explanation:
* Similarity threshold
Set the similarity threshold at the component level by using the slider. The closer the value is
to 1, the closer the resemblance of the lookup value to the source value must be to qualify
as a match. Increasing the threshold can improve the speed of matching since fewer
candidate records need to be considered.
Incorrect:
* Maximum number of matches to output per lookup
Specify the maximum number of matches the transformation can return for each input row.
The default is 1.