The “Spark Overview for Scala Analytics” course will cover the history of Spark and how it came to be, how to build applications with Spark, establish an understanding of RDDs and DataFrames, and other advanced Spark topics. Apache Spark™ is a fast and general engine for large-scale data processing, with built-in modules for streaming, SQL, machine learning and graph processing. Having finished this class, a student would be prepared to leverage the core RDD and DataFrame APIs to perform analytics on datasets.
This course is meant to be an overview of Spark and its associated ecosystem. For deeper understanding of Spark, we recommend that students take the Spark Fundamentals courses I and II.