BIGDATA- HADOOP FOR BEGINNERS

ApacheHadoop
  •    Introduction to Big data and Hadoop fundamentals

  •    Dimensions of Big data

  •    Types of Data generation

  •    Hadoop and its features

  •    Hadoop Ecosystem

  •    HDFS core components

  •    HDFS architecture

  •    Basic installation

  •    HDFS read and write operation

  •    HDFS basic commands

MapReduce Framework and Apache YARN
  •    MapReduce design flow

  •    MRV1 vs. MRV2 architecture

  •    MapReduce Program execution

  •    Types of Input formats and Output formats

  •    MapReduce Data types

Apache Hive
  •    Hive architecture

  •    Hive Installation

  •    Hive Datatypes and Hive table types

  •    DML / DDL commands

  •    Hive Partitioning

  •    Hive Bucketing

Apache Pig
  •    Introduction to pig

  •    Pig Installation

  •    Pig models execution and storage concepts

  •    Pig basic commands

  •    pig script execution

Apache Scoop
  •    Introduction to Scoop concepts

  •    Scoop architecture design

  •    Scoop Installation

  •    Scoop Import concepts

  •    Scoop Export concepts

  •    Create MySQL database for import to HDFS

  •    Scoop command execution

-Apache Oozie
  •    Introduction to Oozie and features

  •    Oozie Installation

  •    Basic Oozie commands

  •    Working with Oozie Actions and Coordinators

Apache Flume
  •    Introduction to Flume and features

  •    Flume topology and core concepts

  •    Flume Installation

  •    Flume Sources and Sinks

Apache Zookeeper
  •    Introduction to Zookeeper

  •    Principles of Zookeeper & usage in Hadoop framework

Apache Hbase
  •    Introduction to Hbase

  •    NoSQL/CAP theorem

  •    Hbase design and architecture

  •    Hbase Installation

  •    Hbase commands (Table)

  •    Hbase + Hive integration

  •    Hbase + Phoenix integration

  •    Hbase execution (Shell and Java API)

Distribution
  •    Distribution (Cloudera) Overview

  •    POCs