BlueDBM: An appliance for big data analytics

  • Sang Woo Jun
  • , Ming Liu
  • , Sungjin Lee
  • , Jamey Hicks
  • , John Ankcorn
  • , Myron King
  • , Shuotao Xu
  • , Arvind

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

145 Scopus citations

Abstract

Complex data queries, because of their need for random accesses, have proven to be slow unless all the data can be accommodated in DRAM. There are many domains, such as genomics, geological data and daily twitter feeds where the datasets of interest are 5TB to 20 TB. For such a dataset, one would need a cluster with 100 servers, each with 128GB to 256GBs of DRAM, to accommodate all the data in DRAM. On the other hand, such datasets could be stored easily in the flash memory of a rack-sized cluster. Flash storage has much better random access performance than hard disks, which makes it desirable for analytics workloads. In this paper we present BlueDBM, a new system architecture which has flash-based storage with in-store processing capability and a low-latency high-throughput inter-controller network. We show that BlueDBM outperforms a flash-based system without these features by a factor of 10 for some important applications. While the performance of a ram-cloud system falls sharply even if only 5%∼10% of the references are to the secondary storage, this sharp performance degradation is not an issue in BlueDBM. BlueDBM presents an attractive point in the cost-performance trade-off for Big Data analytics.

Original languageEnglish
Title of host publicationISCA 2015 - 42nd Annual International Symposium on Computer Architecture, Conference Proceedings
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages1-13
Number of pages13
ISBN (Electronic)9781450334020
DOIs
StatePublished - 13 Jun 2015
Event42nd Annual International Symposium on Computer Architecture, ISCA 2015 - Portland, United States
Duration: 13 Jun 201517 Jun 2015

Publication series

NameProceedings - International Symposium on Computer Architecture
Volume13-17-June-2015
ISSN (Print)1063-6897

Conference

Conference42nd Annual International Symposium on Computer Architecture, ISCA 2015
Country/TerritoryUnited States
CityPortland
Period13/06/1517/06/15

Bibliographical note

Publisher Copyright:
© 2015 ACM.

Fingerprint

Dive into the research topics of 'BlueDBM: An appliance for big data analytics'. Together they form a unique fingerprint.

Cite this