Repartition vs Coalesce in Apache Spark | Rock the JVM

Опубликовано: 31 Декабрь 2024
на канале: Rock the JVM
5,051
188

Written version: https://blog.rockthejvm.com/repartiti...

This video is for the Spark programmer who knows the basics and who is ready to dive a little deeper into it. In this video we'll talk about a fundamental Spark distinction between the two ways of redistributing data in between partitions. We'll see how they're similar, how they're different, and we'll do a small performance test, then we'll explain why.

This is one of the many techniques we talk about in the Spark Optimization series at Rock the JVM.

Follow Rock the JVM on:
LinkedIn:   / rockthejvm  
Twitter:   / rockthejvm  
Blog: https://rockthejvm.com/blog

-------------------------------------------------------------------------
Home: https://rockthejvm.com
-------------------------------------------------------------------------