Big Data Analytics Using Spark

Jun 2, 2024

—

The analysis of big datasets requires using a cluster of tens, hundreds or thousands of computers. Effectively using such clusters requires the use of distributed files systems, such as the Hadoop Distributed File System (HDFS) and corresponding computational models, such as Hadoop, MapReduce and Spark.

In this course, part of the Data Science MicroMasters program, you will learn what the bottlenecks are in massive parallel computation and how to use spark to minimize these bottlenecks.

https://www.classcentral.com/course/big-data-the-university-of-california-san-diego-b-8221

Big Data Analytics Using Spark

Comments

Leave a Reply Cancel reply