Author: admin
-
Big Data with Scala and Spark
In this 1 hour class you will learn the basics of using Scala with Apache Spark to process Big Data (large data sets). Work on a set of bicycle sale data to help the marketing team target different demographics. At the end of the course, you will have a good foundation of how to get…
-
Cleaning and Exploring Big Data using PySpark
By the end of this project, you will learn how to clean, explore and visualize big data using PySpark. You will be using an open source dataset containing information on all the water wells in Tanzania. I will teach you various ways to clean and explore your big data in PySpark such as changing column’s…
-
Data Lakes for Big Data
Each day an astounding amount of data is generated from just about everything around us – from our mobile devices to our health care provider to where we shop for groceries – just to name a few. Big Data is a term used to describe the volume of data, variety or type – both structured…
-
Orchestrating Big Data with Azure Data Factory
This data analysis course teaches you how to use Azure Data Factory to coordinate data movement and transformation using technologies such as Hadoop, SQL, and Azure Data Lake Analytics. You will learn how to create data pipelines that will allow you to group activities to perform a certain task. https://www.classcentral.com/course/edx-orchestrating-big-data-with-azure-data-factory-7812
-
Big Data for Smart Cities
Cities run on a stream of data. In the smart city, the innovative use of data helps provide better and more inventive services to improve people’s lives and make the entire city run more smoothly. But the data our cities collect nowadays is more massive and varied, and is accessed at higher speeds than ever…
-
Algorithms for Big Data
In this course, you will learn how to design and analyse algorithms in the streaming and property testing models of computation. The algorithms will be analysed mathematically, so it is intended for a mathematically mature audience with prior knowledge of algorithm design and basic probability theory. Traditional algorithms work well when the input data fits…
-
Run a Big Data Text Processing Pipeline in Cloud Dataflow
This is a self-paced lab that takes place in the Google Cloud console. In this lab you will use Google Cloud Dataflow to create a Maven project with the Cloud Dataflow SDK, and run a distributed word count pipeline using the Google Cloud Platform Console. https://www.classcentral.com/course/googlecloud-run-a-big-data-text-processing-pipeli-81457
-
The Ultimate Hands-On Hadoop: Tame your Big Data!
Learn and master the most popular big data technologies in this comprehensive course, taught by a former engineer and senior manager from Amazon and IMDb. We’ll go way beyond Hadoop itself, and dive into all sorts of distributed systems you may need to integrate with. https://www.classcentral.com/course/skillshare-the-ultimate-hands-on-hadoop-tame-your-big-data-84223
-
Hadoop Administration
End to end Apache Hadoop Administration What you’ll learn: In This i am going to explain apache hadoop administration. first we i will explain bigdata and hadoop introduction. next i will explain hadoop architecture and all hadoop deamons and next i will explain about hdfs filesystem apache hadoop installation in 3 modes they are local…
-
Applied Optimization For Wireless, Machine Learning, Big Data
This course is focused on developing the fundamental tools/ techniques in modern optimization as well as illustrating their applications in diverse fields such as Wireless Communication, Signal Processing, Machine Learning, Big-Data and Finance. Various topics will be covered in different areas such as; Wireless: MIMO/ OFDM systems, Beamforming, Cognitive Radio and Cooperative Communication; Signal Processing:…