Spark lab / workshop
19 Sep 2016
I did a introduction to Spark lab for the employees of Jayway. It contains method stubs which should be implemented as a way to learn the different parts of Spark. The goal was to minimize setup hassle and thus there’s (almost) no installation / configuration needed and everything run locally.
The workshop was divided into five parts:
- Basic RDD functions.
- Pair RDD functions.
- Shared variables - accumulator and broadcast variables.
- Partitioner.
- Iterative / recursive function.