Hadoop, (Part 1 of 4): Introduction and HDFS
Interactive

Hadoop, (Part 1 of 4): Introduction and HDFS

Biz Library
Updated Feb 04, 2020

Join Hadoop expert Kevin McCarty as he takes a high level look at Hadoop beginning with its history. Next, McCarty examines a number of key components in the Hadoop ecosystem used for storage, processing, data ingest, and transformation. He will show how Hadoop addresses problems that plague large systems such as failover and redundant storage as well as explore how an organization might incorporate Hadoop into their existing IT framework.


Lesson 1:

  • Hadoop Sandbox
  • PuTTY
  • Linux
  • Linux BASH
  • Linux Commands
  • WinSCP
  • Notepad++
  • Demo: Open Source Tools
  • Demo: BASH
  • Demo: Other Tools.

Lesson 2:

  • The Lure of Big Data
  • What Is Big Data?
  • Where Do We Get Big Data?
  • Types of Big Data
  • Managing Big Data
  • The Goal of Big Data
  • Companies Using Big Data Today
  • The Challenge of Big Data
  • How Do We Process Big Data?.

Lesson 3:

  • The Motivation for Hadoop
  • Enter Hadoop
  • History of Hadoop
  • What Hadoop Provides
  • Major Users of Hadoop
  • The Future.

Lesson 4:

  • Hadoop's Architecture
  • HDFS - Name Node
  • HDFS - DataNode
  • Hadoop Architecture
  • Job and Task Tracker
  • Hadoop Architecture.

Lesson 5:

  • Hadoop Ecosystem
  • Zookeeper - Motivation
  • What Is ZooKeeper
  • Data Ingest
  • Apache Flume
  • Flume Overview
  • Apache Sqoop
  • Sqoop Example
  • Pig
  • Example Pig Script
  • Apache Hive
  • Example Hive Script
  • HBase
  • HBase Is...
  • Oozie - Job Workflow
  • Mahout
  • Mahout Use Cases.

Lesson 6:

  • Hadoop in the Enterprise
  • Demo: Hadoop Sandbox.

Lesson 7:

  • Motivation for HDFS
  • Hadoop Distributed Architecture
  • HDFS
  • HDFS - Nodes
  • HDFS NameNode
  • HDFS - Secondary NameNode
  • HDFS - Standby NameNode
  • HDFS - DataNode
  • Demo: HDFS
  • Demo: Other Commands.