
Once the untar process is done, for clarity it’s recommended use two different folders one for NameNode and other for DataNode. This is used to untar the file in a folder hadoop-2.2.0 in your current diectory location. Untar it using the following commands: tar -xzvf hadoop-2.2.0.tar.gz You can tail this log file using tail -200f wget-log.

This –b option will download the tar file as a background process. The location can be local, mount points or in cloud environments it can be block storage: The first step will be to create a directory at user/u/HBase B and download the tar file from the location given later. Let’s get started! Configuring and deploying Hbaseīefore we start HBase in fully distributed mode, we will be setting up first Hadoop-2.2.0 in a distributed mode, and then on top of Hadoop cluster we will set up HBase because HBase stores data in HDFS. This book provides a solid understanding of the HBase basics. This article is an excerpt taken from the book ‘ HBase High Performance Cookbook’ written by Ruchir Choudhry. For cluster setup, we will consider REH (RedHat Enterprise-6.2 Linux 64 bit) for the setup we will be using six nodes. We will also help you set up HBase on a fully distributed cluster. In this tutorial, we will discuss how to effectively set up mid and large size HBase cluster on top of Hadoop/HDFS framework. It supports multiple clients running on heterogeneous platforms by providing user-friendly APIs.

It uses the latest concepts of block cache and bloom filter to provide faster response to online/real-time request. In addition to this, it also provides very advanced features, such as auto sharding, load-balancing, in-memory caching, replication, compression, near real-time lookups, strong consistency (using multi-version). It is a column-oriented database, empowered by a fault-tolerant distributed file structure known as HDFS. Architecturally it sits on Apache Hadoop and runs by using Hadoop Distributed File System (HDFS) as its foundation. Written in Java, it is designed and developed by many engineers under the framework of Apache Software Foundation. HBase is inspired by the Google big table architecture, and is fundamentally a non-relational, open source, and column-oriented distributed NoSQL.
