Monday, February 11, 2019

PySpark - getNumPartitions


It is easy to find the no of partitons in RDD , this can be achieved by using the getnumberPartition() method. We will see the following by writing a simple code :-

In the above example , we are checking the number of partitions and we can see that in both the cases the number of partitions are 1.

No comments:

Post a Comment

Hadoop - What is a Job in Hadoop ?

In the field of computer science , a job just means a piece of program and the same rule applies to the Hadoop ecosystem as wel...