PySpark for absolute beginners - part 1

Опубликовано: 31 Октябрь 2024
на канале: ETL-SQL
516
29

This video series will help you learn PySpark from a very beginner level.
In this video I have explained key concepts to start with:
1) Compute
2) Storage
3) Parallelism
4) Distribution
5) Dataframe
6) Parquet
We will also signup for databricks community edition & setup environment to run pyspark application.
Databricks community link : https://community.cloud.databricks.com

video timeline:
00:00 Introduction - PySpark for beginners
00:17 First two concepts - Compute and Storage
03:18 Two more concepts - Parallelism & distribution
05:42 Databrick community edition - Signup
11:05 Two more concepts - Dataframe & Parquet