Quality Seal Emagister EMAGISTER CUM LAUDE

Big Data Analysis with Spark - University of California

edX
Online
5 opinioni

Gratis

Informazione importanti

  • Corso
  • Online
  • Durata:
    4 Weeks
  • Quando:
    Flessible
Descrizione

Learn how to apply data science techniques using parallel programming in Spark to explore big data. With an apprenticeship you earn while you learn, you gain recognized qualifications, job specific skills and knowledge and this helps you stand out in the job market.With this course you earn while you learn, you gain recognized qualifications, job specific skills and knowledge and this helps you stand out in the job market.

Informazione importanti

Requisiti: Programming background and experience with Python required. All exercises will use PySpark (part of Apache Spark). Previous experience with Spark equivalent to CS105x: Introduction to Spark required.

Sedi

Dove e quando

Inizio Luogo
Flessible
Online

Opinioni

X

21/12/2016
Il meglio Great hands-on lab to kick you off rapidly. Be that as it may, the instructional class is not all that identified with the lab. Better bring it with a book on Spark.

Da migliorare No negative aspects.

Corso realizzato: Dicembre 2016 | Recomendarías este centro? Sí.
E

22/12/2016
Il meglio Incredible course association, particularly the harmony amongst hypothesis and practice. A few undertakings were too simple and some were not clear at in the first place, but rather piazza look normally made a difference. I consider this is a decent pyspark instructional exercise with clarification of start key elements.

Da migliorare N/A.

Corso realizzato: Dicembre 2016 | Recomendarías este centro? Sí.
E

23/12/2016
Il meglio A considerable measure of duplicacy with the 2 different courses of the xSerie. I would not prompt taking this course on the off chance that you took them. The remainder of the 4 weeks comprises of just 20 minutes of video clarifying extremely essential statistical ideas.

Da migliorare Nothing.

Corso realizzato: Dicembre 2016 | Recomendarías este centro? Sí.

Cosa impari in questo corso?

Data analysis
Programming
Big Data
Spark
Science Techniques

Programma

Organizations use their data to support and influence decisions and build data-intensive products and services, such as recommendation, prediction, and diagnostic systems. The collection of skills required by organizations to support these functions has been grouped under the term ‘data science’.

This statistics and data analysis course will attempt to articulate the expected output of data scientists and then teach students how to use PySpark (part of Spark) to deliver against these expectations. The course assignments include log mining, textual entity recognition, and collaborative filtering exercises that teach students how to manipulate data sets using parallel processing with PySpark.

This course covers advanced undergraduate-level material. It requires a programming background and experience with Python (or the ability to learn it quickly). All exercises will use PySpark (the Python API for Spark), and previous experience with Spark equivalent to Introduction to Spark, is required.