BioHD: An Efficient Genome Sequence Search Platform Using HyperDimensional Memorization

  • Zhuowen Zou
  • , Hanning Chen
  • , Prathyush Poduval
  • , Yeseong Kim
  • , Mahdi Imani
  • , Elaheh Sadredini
  • , Rosario Cammarota
  • , Mohsen Imani

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

56 Scopus citations

Abstract

In this paper, we propose BioHD, a novel genomic sequence searching platform based on Hyper-Dimensional Computing (HDC) for hardware-friendly computation. BioHD transforms inherent sequential processes of genome matching to highly-parallelizable computation tasks. We exploit HDC memorization to encode and represent the genome sequences using high-dimensional vectors. Then, it combines the genome sequences to generate an HDC reference library. During the sequence searching, BioHD performs exact or approximate similarity check of an encoded query with the HDC reference library. Our framework simplifes the required sequence matching operations while introducing a statistical model to control the alignment quality. To get actual advantage from BioHD inherent robustness and parallelism, we design a processing in-memory (PIM) architecture with massive parallelism and compatible with the existing crossbar memory. Our PIM architecture supports all essential BioHD operations natively in memory with minimal modifcation on the array. We evaluate BioHD accuracy and efciency on a wide range of genomics data, including COVID-19 databases. Our results indicate that PIM provides 102.8× and 116.1× (9.3× and 13.2×) speedup and energy efciency compared to the state-of-theart pattern matching algorithm running on GeForce RTX 3060 Ti GPU (state-of-the-art PIM accelerator).

Original languageEnglish
Title of host publicationISCA 2022 - Proceedings of the 49th Annual International Symposium on Computer Architecture
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages656-669
Number of pages14
ISBN (Electronic)9781450386104
DOIs
StatePublished - 11 Jun 2022
Event49th IEEE/ACM International Symposium on Computer Architecture, ISCA 2022 - New York, United States
Duration: 18 Jun 202222 Jun 2022

Publication series

NameProceedings - International Symposium on Computer Architecture
ISSN (Print)1063-6897
ISSN (Electronic)2575-713X

Conference

Conference49th IEEE/ACM International Symposium on Computer Architecture, ISCA 2022
Country/TerritoryUnited States
CityNew York
Period18/06/2222/06/22

Bibliographical note

Publisher Copyright:
© 2022 Copyright held by the owner/author(s). Publication rights licensed to ACM.

Fingerprint

Dive into the research topics of 'BioHD: An Efficient Genome Sequence Search Platform Using HyperDimensional Memorization'. Together they form a unique fingerprint.

Cite this