Resilience Distributed Dataset, RDD is an immutable distributed dataset which partitioned across the cluster nodes. RDD released with Spark1.0. It is the fundamental data structure of Spark. RDD provides compile time type safety.

Related Articles