Category: Data, shell

HBase is a column-oriented data store that sits on top of the Hadoop Distributed File System and provides random data lookup and updates for big data consultants.

However, HBase provides a schema on top of the HDFS files to access and update these files any number of times.

HBase is based on top of HDFS and can be integrated with MapReduce programs to act as a source and sinks.

Region Server also consists of memstore that is a write cache to cache new data that is not yet written to disk.

So Zookeeper is an integral part of HBase architecture that maintains all the coordination and synchronization in the HBase cluster.

Related Articles