You are analyzing taxi trips in New York City. You leverage the Azure Data Factory to create data pipelines and
to orchestrate data movement.
You plan to develop a predictive model for 170 million rows (37 GB) of raw data in Apache Hive by using
Microsoft R Server to identify which factors contribute to the passenger tipping behavior.
All of the platforms that are used for the analysis are the same. Each worker node has eight processor cores
and 26 GB of memory.
Which type of Azure HDInsight cluster should you use to produce results as quickly as possible?
A.
Hadoop
B.
HBase
C.
Interactive Hive
D.
Spark
Explanation:
https://azure.microsoft.com/en-gb/blog/general-availability-of-hdinsight-interactive-query-blazing-fast-datawarehouse-style-queries-on-hyper-scale-data-2/