Note: The question is part of a series of questions that present the same scenario. Each question in the series
contains a unique solution that might meet the stated goals. Some question sets might have more than one
correct solution, while others might not have a correct solution.
After you answer a question in this section, you will NOT be able to return to it. As a result, these questions will
not appear in the review screen.
Your company has multiple databases that contain millions of sales transactions.
You plan to implement a data mining solution to identify purchasing fraud.
You need to design a solution that mines 10 terabytes (TB) of sales date. The solution must meet the following
requirements:
Run the analysis to identify fraud once per week.
Continue to receive new sales transactions while the analysis runs.
Be able to stop computing services when the analysis is NOT running.
Solution: You create a Microsoft Azure HDInsight cluster.
Does this meet the goal?
A.
Yes
B.
No
Explanation:
HDInsight cluster billing starts once a cluster is created and stops when the cluster is deleted. Billing is prorated per minute, so you should always delete your cluster when it is no longer in use.
https://docs.microsoft.com/en-us/azure/hdinsight/hdinsight-hadoop-provision-linux-clusters
YES
Not sure if this solution works if the cluster is dropped, but the data is persisted in azure blob storage.