Data Science Novice to Professional Training Course

1795 registered students with an average rating of 5.0
  •  

    Demo Class

    • It just takes 15 seconds to register for a free webinar on Data Science Novice to Professional Course

  •  

    Course Summary

    • TBD

  •  

    Course Duration

    • TBD

  •  

    Course Objectives

    • TBD

  •  

    Additional Notes

    • TBD

  •  

    Who can attend this course?

    • TBD

  •  

    Course Outline

    • Unit 1:
       Using Hadoop for Data Science
       Use Cases of Data Science
       What is Data Science?
       What is a Data Scientist?
       The Hadoop & Data Science Ecosystem
       The Hortonworks Data Platform (HDP)
       Quiz
       Summary
       
      Unit 2:
       Hadoop Architecture
       What is HDFS?
       HDFS Components
       Understanding Block Storage
       Overview of MapReduce
       Understanding MapReduce
       WordCount in MapReduce
       Hadoop Streaming
       Running a Hadoop Streaming Job
       What is YARN?
       The Components of YARN
       A Cluster View Example
       Quiz
       Summary
       
      Unit 3:
       Machine Learning
       What is Machine Learning?
       The History of Machine Learning
       What’s New in ML?
       Supervised vs Unsupervised Learning
       Six Machine Learning Tasks
       Task #1: Clustering
       Task #2: Outlier Detection
       Task #3: Affinity Analysis
       Task #4: Classification
       Task #5: Regression Analysis
       Task #6: Recommendation
       Machine Learning and Hadoop
       What is Mahout?
       The Mahout Algorithms
       Quiz
       Summary
       
       
      Unit 4:
       Introduction to Pig
       What is Pig?
       Pig Latin
       The Grunt Shell
       Demonstration: Understanding Pig
       Pig Latin Relation Names
       Pig Latin Field Names
       Pig Data Types
       Pig Complex Types
       Defining a Schema
       The GROUP Operator
       ROUP ALL
       Relations without a Schema
       The FOREACH GENERATE Operator
       Specifying Ranges in FOREACH
       Field Names in a FOREACH
       FOREACH with Groups
       The FILTER Operator
       The LIMIT Operator
       Quiz
       Summary
       
       Unit 5:
       Python Programming
       Overview of Python
       A Simple Python Program
       Defining Variables
       Data Types
       Lists
       List Functions
       The range Function
       List Comprehensions
       Tuples
       The Slice Notation
       Dictionaries
       If/else Statement
       Strings
       String Manipulation
       Defining Functions
       Function Return Values
       The lambda Keyword
       Python Modules and Packages
       Importing Modules
       Developing Python Code
       Quiz
       Summary
       
       
       Unit 6:
       Analyzing Data with Python
       The Scientific Python Ecosystem
       Overview of NumPy
       The NumPy ndarray
       NumPy Universal Functions
       Other NumPy Modules
       Demo: The NumPy Package
       Overview of pandas
       pandas Data Structures
       Series
       DataFrames
       Demo: The pandas Library
       The SciPy Library
       matplotlib
       A matplotlib Example
       Quiz
       Summary
       
       Unit 7:
       Running Python on Hadoop
       Options for Running Python on Hadoop
       Overview of Pig UDFs
       UDF Libraries
       An Example of Using a UDF
       Writing a Pig UDF in Python
       Specifying the outputSchema
       Taking Advantage of Python
       Invoking a UDF
       The Pig STREAM Command
       Defining a Streaming Alias
       Quiz
       Summary
       
       Unit 8:
       Implementing Machine Learning
       Tools for Machine Learning
       Overview of Scikit-­Learn
       Scikit-­learn Algorithm Cheat-­sheet
       Support Vector Machines (SVM)
       Support Vector Classification
       Naive Bayes for Classification
       Nearest Neighbors
       Brute Force vs Tree-­based Nearest Neighbors
       Demo: Classification with Scikit-­‐learn
       Support Vector Regression
       Decision Trees
       Demo: Regression with Scikit-­learn
       Clustering Algorithms
       Comparing Clustering Algorithms
       Demo: Clustering with Scikit-­learn
       Challenges of Machine Learning on Hadoop
       The K-­Nearest Neighbor Lab
       Implementing K-­Means Clustering on Hadoop
       Quiz
       Summary  
       
       Unit 9: Natural Language Processing
       What is NLP?
       NLP in Big Data
       Common Tasks in NLP
       Optical Character Recognition
       Sentence Segmentation
       Part-­of-­speech Tagging
       Named Entity Recognition
       Topic Modeling
       NLTK -­Natural Language Toolkit
       Classifying Text
       The NaiveBayesClassifier
       Decision Trees
       Demonstration: POS Tagging using a Decision Tree
       Quiz
       Summary
       
       
       Unit 10:
       MLlib
       What is Spark?
       Understanding Spark Applications
       The SparkContext
       park RDDs
       Creating RDDs
       RDD Operations
       Transformations
       Examples of Transformations
       Actions
       Examples of Actions
       WordCount in Spark
       What is MLlib?
       Data Science Algorithms of MLlib
       K-­‐Means Clustering
       Naive Bayes Classification
       Linear Least Squares Regression
       Quiz
       Summary
       
       Appendix A:
       Where to Learn More About Data Science
       How can I learn more?
       Books on Data Science
       Free online classes (MOOCs)
       Publicly available datasets
       
       Appendix B:
       Unit Quiz Answers
       

  •  

    Course Testimonials

      • Online delivery rocks. -Ebay Inc.
      • Virtual class room of Vulab has allowed our employees to attend the training from any location. -Rigus Inc.
      • I got my break into IT industry with Vulab. -Jawahar.
      • After my Spring Hibernate training from Vulab, I have become very productive at my work and was able to execute projects at faster pace and stable code base. - Manager at Cisco
  •  

    Free Online Video Access

    • All registered student's can attend the live sessions and have free access to videos during training. Vulab provides every student with access to our fantastic student tool TrainingramTM. Student's can attend the live session with live instructor and ask any questions. They can also review the class by viewing the same session in HD format using TrainingramTM. Student's can access the course resources at any time from any device. TrainingramTM supports your Apple Iphone or Apple Ipad or any other mobile device or PC.

      • Access student projects online using TrainingramTM from your pc or tablet.
      • Access class notes in PDF format using TrainingramTM
      • Access Course Videos and Prerequisite videos online using TrainingramTM
      • Ask any question to your instructor using Forum
  •  

    Online Vulab Student's only Forum

    • Every student will have access to Forum specially dedicated for the service of Vulab Student's. A student can post questions to the forum as well as browser the questions and answers posted by other students.

  •  

    key Features

    • TBD

  •  

    Testimonials

    • TBD

  •  

    Trademark Notice

    • TBD

Take this course

Step 1: Choose a Plan

LEARNER

$1,500.00

  • Data Science Novice to Professional Live Instructor led Training +
  • Free E-Learning platform access during training

CERTIFIED PROFESSIONAL

$2250.00

  • VERIFIED CERTIFICATE +
  • Data Science Novice to Professional Live Instructor led Training +
  • Free 1 Year E-Learning platform access

CERTIFIED PROJECT PROFESSIONAL

$2850.00

  • PROJECT +
  • VERIFIED CERTIFICATE +
  • Data Science Novice to Professional Live Instructor led Training +
  • 1 Year E-Learning platform access

SUPPORTED CERTIFIED PROJECT PROFESSIONAL WITH E-LEARNING

$3750.00

  • E-Learning platform lifetime access +
  • Audit course multiple times +
  • 4 Hours of One to One instructor support +
  • PROJECT +
  • VERIFIED CERTIFICATE +
  • Data Science Novice to Professional Live Instructor led Training

Step 2: Choose a Schedule

Step 3: Register