Hacking with apache spark
Apache spark is a distributed cluster computing framework which enables parallel process large amount of data. Spark built on top of Hadoop Distributed File System(HDFS).
