Big Data & Hadoop

About Course

Traditional Data storage techniques can’t store or process large volume of structured and unstructured Data sets. Those data are referred as Big Data.  Hadoop on the other hand is a tool that is used to handle big data. It is an open-source framework manufactured by the Apache Software Foundation.

Affluenz IT Academy will emphasize on how to design distributed applications to manage “Big Data” using Hadoop. This will also detail how to use Pig and Spark to write scripts in order to process data sets on Hadoop cluster

Key Features

40 hours of instructor led online training

2 Live Industry Project

Certificate of completion

Dedicated awareness session from Industry experts

Questions On Your Mind About Our Courses

According to IDC, by 2020, organizations that can analyze all relevant data and deliver actionable information will earn $430 billion more than their less analytically oriented peers. As of today, there is a surplus demand for people who are Hadoop 2.0 certified

Affluenz IT Academy crafts this course to all who aspire to build a career in Big Data like Data Scientist, AI Engineer, Business Analyst, Information architect and so on. We recommend this course to

  • Developers aspiring to be a data scientist or machine learning engineer
  • Analytics managers who are leading a team of analysts 
  • Business analysts who want to understand data science techniques
  • Information architects who want to gain expertise in machine learning algorithms 
  • Analytics professionals who want to work in machine learning or artificial intelligence
  • Graduates looking to build a career in data science and machine learning
  • Experienced professionals who would like to harness machine learning in their fields to get more insights

Participants in this online course should have:

  • Basic understanding on Data structure and Data Science
  • Basics of statistics and mathematics

Our Syllabus

4 Weeks Course

  1. What is Big Data
  2. Big Data Learning Techniques
  3. 5C Architecture – connection, conversion, cyber cognition and configuration
  1. Basics of Hadoop

  2. Hadoop Ecosystem

  1. 2.x Core Components

  2. Hadoop Storage: HDFS (Hadoop Distributed File System)

  3. Hadoop Processing: MapReduce Framework

  1. Hadoop 2.x Cluster Architecture

  2. Federation and High Availability Architecture

  3. Typical Production Hadoop Cluster

  4. Hadoop Cluster Modes

  5. Common Hadoop Shell Commands

  1. Certification of Completion

  2. Review and Feedback

8 Weeks Course

  1. What is Big Data
  2. Big Data Learning Techniques
  3. 5C Architecture – connection, conversion, cyber cognition and configuration
  1. Basics of Hadoop

  2. Hadoop Ecosystem

  1. 2.x Core Components

  2. Hadoop Storage: HDFS (Hadoop Distributed File System)

  3. Hadoop Processing: MapReduce Framework

  1. Hadoop 2.x Cluster Architecture

  2. Federation and High Availability Architecture

  3. Typical Production Hadoop Cluster

  4. Hadoop Cluster Modes

  5. Common Hadoop Shell Commands

  6. Hadoop 2.x Configuration Files

  7. Single Node Cluster & Multi-Node Cluster set up

  8. Basic Hadoop Administration

  1. Traditional way vs MapReduce way

  2.  Why MapReduce

  3. YARN Components

  4. YARN Architecture

  5. YARN MapReduce Application Execution Flow

  6. YARN Workflow

  7. Anatomy of MapReduce Program

  8. Input Splits, Relation between Input Splits and HDFS Blocks

  9. MapReduce: Combiner & Partitioner

  10. Demo of Health Care Dataset

  11. Demo of Weather Dataset

  1. Certification of Completion

  2. Internship Certificate

  3. Review and Feedback

Pricing

Online Classroom with virtual Internship

9999
  • 3 Weeks course
  • 1 Week Live Project Internship
  • Certificate of completion
  • Certficate of Internship

Online Classroom with Live Project Internship

14,999
  • 6 Weeks course
  • 2 Weeks Corporate Internship
  • Certificate of completion
  • Certficate of Internship

Corporate Training

Contact
  • Self paced + Instructor Led classes
  • Flexible Pricing option