What should you configure?

You have an Apache Hive table that contains one billion rows.
You plan to use queries that will filter the data by using the WHERE clause. The values of the columns will be
known only while the data loads into a Hive table.
You need to decrease the query runtime.
What should you configure?

You have an Apache Hive table that contains one billion rows.
You plan to use queries that will filter the data by using the WHERE clause. The values of the columns will be
known only while the data loads into a Hive table.
You need to decrease the query runtime.
What should you configure?

A.
static partitioning

B.
bucket sampling

C.
parallel execution

D.
dynamic partitioning

Explanation:
https://www.qubole.com/blog/5-tips-for-efficient-hive-queries/

← Previous question

Next question →

Leave a Reply 0