sparkRDD RDDResilient Distributed DatasetSparkSpark is a unified analytics engine for large-scale data processing. It provides hh