Source: medium.com

Hacking with apache spark

Category: Data

Apache spark is a distributed cluster computing framework which enables parallel process large amount of data. Spark built on top of Hadoop Distributed File System(HDFS).