HBase and Hypertable: two Bigtable-like systems built on top of Hadoop
Wednesday, February 27th, 2008- HBase: Bigtable-like structured storage for Hadoop HDFS:
Google’s [WWW] Bigtable, a distributed storage system for structured data, is an effective mechanism for storing large amounts of data in a distributed environment. Just as Bigtable leverages the distributed data storage provided by the [WWW] Google File System, HBase provides Bigtable-like capabilities on top of Hadoop. Data is organized into tables, rows and columns. An Iterator-like interface is available for scanning through a row range (and of course there is the ability to retrieve a column value for a specific key). Any particular column may have multiple values for the same row key. A secondary key can be provided to select a particular value or an Iterator can be set up to scan through the key-value pairs for that column given a specific row key.
- Hypertable:
Modeled after Google’s well known Bigtable project, Hypertable is designed to manage the storage and processing of information on a large cluster of commodity servers, providing resilience to machine and component failures. Hypertable seeks to set the open source standard for highly available, petabyte scale, database systems.
This post is part of the series Bookmarks Trawl: February up to Feb. 27th
- Interesting Stuff: little b, PAMP, Schelog, OpenWetWare.org, and SamePlace Instant Messenger
- Editors
- SNOBOL and Icon
- regex-dna benchmark | Gentoo : Intel® Pentium® 4 Computer Language Benchmarks Game
- HBase and Hypertable: two Bigtable-like systems built on top of Hadoop
- Remainder of February Bookmarks Trawl