Hadoop, Part 4 of 4: HBase and MapReduce
Interactive

Hadoop, Part 4 of 4: HBase and MapReduce

LearnNow Online
Updated Aug 21, 2018

Course description

Big Data development is a growing field and understanding how technologies such as the MapReduce Combiner and HBase can make data analysis easier is important. Learn how to use the Combiner, look deeply at HBase, a NoSQL tool for managing the storage and retrieval of large and diverse data sets, and create a starter app to use HBase in Java.

Each LearnNowOnline training course is made up of Modules (typically an hour in length). Within each module there are Topics (typically 15-30 minutes each) and Subtopics (typically 2-5 minutes each). There is a Post Exam for each Module that must be passed with a score of 70% or higher to successfully and fully complete the course.


Prerequisites

You should have some programming background and some familiarity with a Unix-based operating system. No specific experience with Java programming language or Hadoop is required. As with any such course, the more experience you bring to the course, the more you’ll get out of it. This course moves quickly through a broad range of topics, but it does not require any prior experience with Hadoop. The course does assume that you are well familiarized with how to use the version of Windows that you are running. For example, the course might say simply “Open PuTTY” without explaining how to do that. You should also be able to navigate the folder hierarchy using Windows Explorer.


Meet the expert

Kevin McCarty

Kevin McCarty is a computer professional with over 30 years of experience in the industry as a programmer, project manager, database administrator, architect, and data scientist. He is a Microsoft Certified Trainer with over 25 individual certifications in programming and database technologies and serves as the chapter leader of the Boise SQL Server Users Group. A former Army officer and Eagle Scout, he holds a doctorate in Computer Science and a lifelong love of learning.

Video Runtime

139 Minutes

Time to complete

248 Minutes

Course Outline

MapReduce and HBase

MapReduce Combiner (07:22)

  • Introduction (00:41)
  • MapReduce Components (00:52)
  • Combiner (00:35)
  • Why Use a Combiner? (01:41)
  • How the Combiner Works (00:51)
  • Combiner, Reducer Differences (01:56)
  • Summary (00:44)

MapReduce Combiner Demo (19:17)

  • Introduction (00:30)
  • Demo: Mapper Stub (04:20)
  • Demo: Reducer & Combiner Stubs (01:59)
  • Demo: Main (01:14)
  • Demo: Fill in Methods (04:28)
  • Demo: Run WordCountCombiner (05:56)
  • Summary (00:47)

HBase Basics (12:37)

  • Introduction (00:42)
  • Relational vs. Non-Relational Databases (02:45)
  • What Is HBase? (01:29)
  • What Does HBase Do? (04:32)
  • Other HBase Features (01:00)
  • The HBase Shell (01:10)
  • Summary (00:55)

HBase Shell Intro (18:03)

  • Introduction (00:38)
  • Demo: The HBase Shell (03:09)
  • Demo: Ambari (03:18)
  • Demo: Build a Table (03:58)
  • Demo: Add Data (06:13)
  • Summary (00:43)

HBase Lifecycle Architecture (12:03)

  • Introduction (00:42)
  • HBase Architecture (01:12)
  • HBase Scalability (02:07)
  • HBase Durability (00:52)
  • HBase Consistency (00:44)
  • HBase Input/Output (00:55)
  • Working with Data (00:53)
  • Partitioning a Table (00:57)
  • Serving Regions (00:50)
  • Partitioning Advantage (00:44)
  • Region Server Failover/Load Balancing (01:24)
  • Summary (00:39)

HBase Lifecycle Services (18:06)

  • Introduction (00:46)
  • The HBase Write Path (00:23)
  • HBase MemStore (00:44)
  • HBase Write-Ahead Log (03:12)
  • HBase on a Single Server (00:40)
  • Scaling HBase (00:44)
  • The Flush Operation (02:15)
  • HBase Cache (01:31)
  • Read Optimization (00:36)
  • Minimizing Disk Seeks (01:29)
  • HBase Compaction (00:50)
  • HBase Load Balancing (00:48)
  • HBase Split Operation (00:46)
  • HBase Scalability (00:36)
  • HBase Limitations (01:42)
  • Summary (00:57)
Creating an HBase App

Creating an Hbase App (28:45)

  • Introduction (00:33)
  • HBase Archetypes (01:07)
  • What Is a Good HBase Archetype? (01:23)
  • Effective Design (00:31)
  • Effective Design of Queries (03:11)
  • Row-Key Design (01:52)
  • Effective Application Design (02:00)
  • Entities in HBase (00:56)
  • Working with Simple Entities (01:41)
  • Simple Entity Guidelines (01:33)
  • Working with Linked Entities (01:21)
  • Creating a Linked Entity (01:28)
  • Linked Entity Guidelines (04:31)
  • Compound Key (00:48)
  • Compound Time Series (00:52)
  • HBase Techniques for Historical Data (02:53)
  • Considerations for Timestamps (01:03)
  • Summary (00:54)

Creating an Hbase App Demo (23:20)

  • Introduction (00:44)
  • Demo: Setup (04:06)
  • Demo: HBase Methods (03:52)
  • Demo: Flesh Out Methods (05:19)
  • Demo: Main Method (01:05)
  • Demo: HBase Environment (03:33)
  • Demo: Run HBase App (04:05)
  • Summary (00:34)

Bundle Code: LNO1234