Apache HBase Primer by Deepak Vohra

By Deepak Vohra

Study the elemental foundations and ideas of the Apache HBase (NoSQL) open resource database. It covers the HBase facts version, structure, schema layout, API, and management. Apache HBase is the database for the Apache Hadoop framework. HBase is a column kin dependent NoSQL database that gives a versatile schema version.

Show description

Read Online or Download Apache HBase Primer PDF

Similar object-oriented software design books

JDBC: Practical Guide for Java Programmers (The Practical Guides)

JDBC: useful consultant for Java Programmers is the fastest technique to achieve the talents required for connecting your Java program to a SQL database. sensible, tutorial-based insurance retains you centered at the crucial projects and methods, and incisive causes cement your figuring out of the API good points you will use many times.

Aspect-Oriented, Model-Driven Software Product Lines: The AMPLE Way

Software program product strains offer a scientific technique of coping with variability in a set of goods. they've got many advantages yet there are 3 significant obstacles that may hinder them from attaining their complete capability. First, there's the problem of scale: numerous versions might exist in a product line context and the variety of interrelationships and dependencies can upward thrust exponentially.

Sams teach yourself ADO . NET in 24 hours

ADO. internet is the information entry version outfitted into the . web Framework. It replaces the previous (and principally winning) ADO utilized in just about all visible simple and ASP functions outfitted during the last few years. ADO. internet permits an program to speak with any OLE database resource (including Oracle, Sybase, Microsoft entry, or even textual content files).

Programming F#: A comprehensive guide for writing simple code to solve complex problems

Why study F#? This multi-paradigm language not just provides you with a massive productiveness improve via sensible programming, it additionally allows you to boost functions utilizing your present object-oriented and important programming talents. With Programming F#, you will fast detect the various merits of Microsoft's new language, such as entry to all of the nice instruments and libraries of the .

Extra info for Apache HBase Primer

Example text

RegionServers 3. Regions within a RegionServer 4. MemStores and HFiles within a Region CHAPTER 2 ■ APACHE HBASE AND HDFS HBase is based on HDFS as the filesystem. The ZooKeeper coordinates the different components of HBase. HBase may be accessed using Java Client APIs, external APIs, and the Hadoop FileSystem API. The Master coordinates the RegionServers. A Region is a subset of a table’s rows, such as a partition. A RegionServer serves the region’s data for reads and writes. The ZooKeeper stores global information about the cluster.

To store related rows adjacent or near each other, a common row key pattern is usually used. A column, which stores a value, consists of a column family name and a column qualifier delimited by a : (colon). For example, column family cf1 could consist of column qualifiers (or column keys) c1, c2, and c3. A column family consists of a collocated set of columns. Each column family has a set of storage properties such as whether the column values are to be cached in memory, how the row keys are encoded, or how data is compressed.

The characteristics include fast, random access to large quantities of data with high throughput. Application characteristics not suitable were also discussed. In the next chapter, I will discuss the physical storage in HBase. 47 PART II Data Model CHAPTER 4 Physical Storage The filesystem used by Apache HBase is HDFS, as discussed in Chapter 2. HDFS is an abstract filesystem that stores data on the underlying disk filesystem. HBase indexes data into HFiles and stores the data on the HDFS Datanodes.

Download PDF sample

Rated 4.26 of 5 – based on 6 votes