Ask the Community
Ask any professional question and get answers from other specialists.
hdfs involve extensive disk i/o in its mapreduce process. Therefore if the number of nodes are high, parallelism will be high and query will be efficient. On the other ha ... See More
It depends - if you have partitioning/bucketing defined on tables. And if so how have you used / defined them. The whole idea is (large) that for table join you need to ... See More
Hash Partion is the default partioner in hadoop which is handled by Hadoop internally if no partioner has been defined.