2006 BigtableaDistributedStorageSyst

From GM-RKB
Jump to navigation Jump to search

Subject Headings: Google BigTable System, Bloom Filter.

Notes

Cited By

Quotes

Author Keywords

Abstract

Bigtable is a distributed storage system for managing structured data that is designed to scale to a very large size: petabytes of data across thousands of commodity servers. Many projects at Google store data in Bigtable, including web indexing, Google Earth, and Google Finance. These applications place very different demands on Bigtable, both in terms of data size (from URLs to web pages to satellite imagery) and latency requirements (from backend bulk processing to real-time data serving). Despite these varied demands, Bigtable has successfully provided a flexible, high-performance solution for all of these Google products. In this paper we describe the simple data model provided by Bigtable, which gives clients dynamic control over data layout and format, and we describe the design and implementation of Bigtable.

10. Related Work

The manner in which Bigtable uses memtables and SSTables to store updates to tablets is analogous to the way that the Log-Structured Merge Tree [26] stores updates to index data. In both systems, sorted data is buffered in memory before being written to disk, and reads must merge data from memory and disk.

References

,

 AuthorvolumeDate ValuetitletypejournaltitleUrldoinoteyear
2006 BigtableaDistributedStorageSystJeffrey Dean
Sanjay Ghemawat
Fay Chang
Wilson C. Hsieh
Deborah A. Wallach
Mike Burrows
Tushar Chandra
Andrew Fikes
Robert E. Gruber
Bigtable: A Distributed Storage System for Structured Data