Scaling Machine Learning Workflows to Big Data with Fugue

This course teaches learners how to scale machine learning workflows to big data using Fugue. The learning outcomes include understanding how to transition from Pandas to Spark or Dask as data grows, implementing Fugue to port Python code with minimal changes, and writing code in a framework-agnostic manner for different execution environments. The course covers skills such as Spark transformation, Fugue code implementation, lazy evaluation of Spark, partitioning, and decoupling logic and execution. The teaching method involves a demo-driven approach with examples and explanations. The intended audience for this course includes data scientists, machine learning engineers, and anyone interested in scaling data compute from a single machine to a Spark cluster.

https://www.classcentral.com/course/youtube-scaling-machine-learning-workflows-to-big-data-with-fugue-kevin-kho-prefect-han-wang-lyft-238441


Comments

Leave a Reply

Your email address will not be published. Required fields are marked *

Big Data Labs
Privacy Overview

This website uses cookies so that we can provide you with the best user experience possible. Cookie information is stored in your browser and performs functions such as recognising you when you return to our website and helping our team to understand which sections of the website you find most interesting and useful.