SUMMARYSUMMARY OverviewLinking with SparkInitializing SparkUsing the ShellResilient Distributed Datasets RDDs Parallelized CollectionsExternal DataSetsRDD Operations BasicPassing Functions to SparkWorking with key-value PairsTransfromationsActions RDD Persistence Which Strorage Level to ChooseRemoving Data