Data Science and Engineering with Spark

Add to Favourites
1 1 1 1 1
Price: 1927 EUR 1927 EUR
Contact Berkeley University of California

More details about the program

Description

The Data Science and Engineering with Spark XSeries, created in partnership with Databricks, will teach students how to perform data science and data engineering at scale using Spark, a cluster computing system well-suited for large-scale machine learning tasks. It will also present an integrated view of data processing by highlighting the various components of data analysis pipelines, including exploratory data analysis, feature extraction, supervised learning, and model evaluation. Students will gain hands-on experience building and debugging Spark applications. Internal details of Spark and distributed machine learning algorithms will be covered, which will provide students with intuition about working with big data and developing code for a distributed environment. This XSeries requires a programming background and experience with Python (or the ability to learn it quickly). All exercises will use PySpark (the Python API for Spark), but previous experience with Spark or distributed computing is NOT required. Familiarity with basic machine learning concepts and exposure to algorithms, probability, linear algebra and calculus are prerequisites for two of the courses in this series.

Specific details

Category of Education Computer Sciense and IT

University

Berkeley University of California

Comments (0)

There are no comments posted here yet

Leave your comments

Search

Related Programs

In this course you will work on your very own proj ...
The goal of the program Open Informatics is to edu ...
In the international Bachelor's degree programme i ...
Our MSc Advanced Computing course is an advanced c ...

 

©2023 EDUCOM NET. All Rights Reserved.

If you find an inaccuracy or you have comments on the description of the university or program - please let us know info@educom.net