Written version: https://blog.rockthejvm.com/repartiti...
This video is for the Spark programmer who knows the basics and who is ready to dive a little deeper into it. In this video we'll talk about a fundamental Spark distinction between the two ways of redistributing data in between partitions. We'll see how they're similar, how they're different, and we'll do a small performance test, then we'll explain why.
This is one of the many techniques we talk about in the Spark Optimization series at Rock the JVM.
Follow Rock the JVM on:
LinkedIn: / rockthejvm
Twitter: / rockthejvm
Blog: https://rockthejvm.com/blog
-------------------------------------------------------------------------
Home: https://rockthejvm.com
-------------------------------------------------------------------------