HBase and Hypertable: two Bigtable-like systems built on top of Hadoop

  • HBase: Bigtable-like structured storage for Hadoop HDFS:

    Google’s [WWW] Bigtable, a distributed storage system for structured data, is an effective mechanism for storing large amounts of data in a distributed environment. Just as Bigtable leverages the distributed data storage provided by the [WWW] Google File System, HBase provides Bigtable-like capabilities on top of Hadoop. Data is organized into tables, rows and columns. An Iterator-like interface is available for scanning through a row range (and of course there is the ability to retrieve a column value for a specific key). Any particular column may have multiple values for the same row key. A secondary key can be provided to select a particular value or an Iterator can be set up to scan through the key-value pairs for that column given a specific row key.

  • Hypertable:

    Modeled after Google’s well known Bigtable project, Hypertable is designed to manage the storage and processing of information on a large cluster of commodity servers, providing resilience to machine and component failures. Hypertable seeks to set the open source standard for highly available, petabyte scale, database systems.

    About Hypertable

Leave a Reply