Big Data Hadoop

Big Data Hadoop
Eligibility (Best suited for B.Tech/BE 1st, 2nd & 3rd year all streams)
Duration 45 Days

Course Curriculum

  • Hadoop 2.x Cluster Architecture
  • Federation and High Availability,
  • Hadoop Cluster Modes
  • Hadoop 2.x Configuration Files
  • Single node cluster and Multi node cluster
  • Hadoop 2.x MapReduce Architecture
  • Hadoop 2.x MapReduce Components
  • Anatomy of MapReduce Program
  • Relation between Input Splits and HDFS Blocks
  • MapReduce: Combiner & Partitioner
  • Demo on de-identifying Health Care Data set
  • Counters, Distributed Cache
  • MRunit, Reduce Join
  • Custom Input Format
  • Sequence Input Format
  • ml file Parsing using MapReduce
  • MapReduce Vs Pig
  • Pig Use Cases
  • Programming Structure in Pig
  • Pig Running Modes
  • Pig Latin Program
  • Data Models in Pig
  • Built In Functions ( Eval Function, Load and Store Functions
  • Math function
  • String Function
  • Pig Streaming
  • Aviation use case in PIG
  • Hive Background
  • Hive Use Case, About Hive
  • Hive Architecture and Components
  • Comparison with Traditional Database
  • Hive Data Types and Data Models
  • Partitions and Buckets
  • Managing Outputs
  • Hive Demo on Healthcare Data set
  • Hive QL: Joining Tables
  • Dynamic Partitioning
  • Custom Map/Reduce Scripts
  • Hive Indexes and views Hive query optimizers
  • User Defined Functions
  • HBase v/s RDBMS
  • HBase Components
  • Run Modes & Configuration
  • HBase Cluster Deployment
  • HBase Data Model
  • HBase Shell
  • HBase Client API
  • Data Loading Techniques
  • ZooKeeper Data Model
  • Zookeeper Service
  • Demos on Bulk Loading,
  • Getting and Inserting Data
  • What is Apache Spark
  • Spark Ecosystem
  • Spark Components
  • History of Spark and Spark Versions/Releases
  • Spark a Polyglot
  • What is Scala?
  • Why Scala?
  • Spark Context
  • Flume and Sqoop Demo
  • Oozie, Oozie Components
  • Oozie Workflow
  • Scheduling with Oozie
  • Oozie Co-ordinator
  • Oozie Web Console
  • Oozie for MapReduce
  • Hive, and Sqoop
  • Combine flow of MR
  • Hadoop Project Demo
  • Hadoop Integration with Talend.

Course Registrations

Features

  • Internship certification from industry
  • Certification by AICRA
  • Working exposure with industry experts
  • Profile sharing with AICRA member companies

Course Information

  • Course Duration: 45 Days
  • Course Fee: 9999 INR