Spark Deployment Modes
Standalone/Cluster/Client
Spark Driver & Executors
Executor allocation and interaction with resource manager
Dynamic Allocation
Code serialisation
When code runs on driver vs executors
Spark Driver & Executor Resources
Choosing resources - executors and drivers
Tuning resources
Problem solving - OOM etc.
Jobs, Stages and Tasks
Recap on Jobs, Stages and Tasks - show in Spark UI
RDD, Dataframe, Dataset API’s
Dataframes and Row classes
Spark UI & History Server
Walkthrough of Spark jobs
Query Plans