0

Pyspark glom()

I do understand that it returns RDD coalescing all elements within each partition into a list. What happens when we don’t specify the num of partition, is there is a default? where do we actually use it?

18th Jun 2024, 3:28 AM
Chethana
Chethana - avatar
1 Réponse