We think you might be interested in this job:
Resilience Distributed Dataset, RDD is an immutable distributed dataset which partitioned across the cluster nodes. RDD released with Spark1.0. It is the fundamental data structure of Spark. RDD provides compile time type safety.