You develop a SQL Server Integration Services (SSIS) package that imports Windows Azure SQL Database
data into a data warehouse every night.
The Windows Azure SQL Database data contains many misspellings and variations of abbreviations. To
import the data, a developer used the Fuzzy Lookup transformation to choose the closest-matching
string from a reference table of allowed values. The number of rows in the reference table is very large.
If no acceptable match is found, the Fuzzy Lookup transformation passes a null value.
The current setting for the Fuzzy Lookup similarity threshold is 0.50.
Many values are incorrectly matched.
You need to ensure that more accurate matches are made by the Fuzzy Lookup transformation without
degrading performance.
What should you do?
A.
Change the similarity threshold to 0.40.
B.
Decrease the maximum number of matches per lookup.
C.
Change the similarity threshold to 0.85.
D.
Increase the maximum number of matches per lookup.
Explanation:
* Similarity threshold
Set the similarity threshold at the component level by using the slider. The closer the value is to 1, the
closer the resemblance of the lookup value to the source value must be to qualify as a match. Increasing
the threshold can improve the speed of matching since fewer candidate records need to be considered.