0

Pyspark glom()

I do understand that it returns RDD coalescing all elements within each partition into a list. What happens when we don’t specify the num of partition, is there is a default? where do we actually use it?

action pyspark transformation dataengineering

18th Jun 2024, 3:28 AM

Chethana

1 Antwort

+ 1

Have you tried looking at the documentation? The glom() method does not have any arguments. https://spark.apache.org/docs/latest/api/JUMP_LINK__&&__python__&&__JUMP_LINK/reference/api/pyspark.RDD.glom.html https://stackoverflow.com/questions/24996302/setting-sparkcontext-for-pyspark https://stackoverflow.com/questions/65489387/whats-the-meaning-of-num-slices-parameter-in-sc-parallelize

18th Jun 2024, 4:20 PM

Tibor Santa

Heute heiß

1 Votes

Does anyone have the solution for this challenge?

1 Votes

How would you solve the part of the C# Intermediate code project that requires operator overloading?

0 Votes

Why does coding take so long to learn

0 Votes

Solved Ai generated practice the last question

0 Votes

0 Votes

Solved# Survey data format in coding for data

0 Votes

How to add unordered lists in HTML.

0 Votes

What is the use of .kt classes in the React Native project

0 Votes

Solved #Relay race coding for unit 9 you are creating code for a relay race

0 Votes