DNN-SAM: Split-and-Merge DNN Execution for Real-Time Object Detection

Woosung Kang, Siwoo Chung, Jeremy Yuhyun Kim, Youngmoon Lee, Kilho Lee, Jinkyu Lee, Kang G. Shin, Hoon Sung Chwa

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

15 Scopus citations

Abstract

As real-time object detection systems, such as autonomous cars, need to process input images acquired from multiple cameras, they face significant challenges in delivering accurate and timely inferences often based on machine learning (ML). To meet these challenges, we want to provide different levels of object detection accuracy and timeliness to different portions within each input image with different criticality levels. Specifically, we develop DNN-SAM, a dynamic Split-And-Merge Deep Neural Network (DNN) execution and scheduling framework, that enables seamless split-and-merge DNN execution for unmodified DNN models. Instead of processing an entire input image once in a full DNN model, DNN-SAM first splits a DNN inference task into two smaller sub-tasks-a mandatory sub-task dedicated for a safety-critical (cropped) portion of each image and an optional sub-task for processing a down-scaled image-then executes them independently, and finally merges their results into a complete inference. To achieve DNN-SAM's timely and accurate detection of objects in each image, we also develop two scheduling algorithms that prioritize sub-tasks according to their criticality levels and adaptively adjust the scale of the input image to meet the timing constraints while minimizing the response time of mandatory sub-tasks or maximizing the accuracy of optional sub-tasks. We have implemented and evaluated DNN-SAM on a representative ML framework. Our evaluation shows DNN-SAM to improve detection accuracy in the safety-critical region by 2.0-3.7× and lower average inference latency by 4.8-9.7× over existing approaches without violating any timing constraints.

Original languageEnglish
Title of host publicationProceedings - 28th IEEE Real-Time and Embedded Technology and Applications Symposium, RTAS 2022
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages160-172
Number of pages13
ISBN (Electronic)9781665499989
DOIs
StatePublished - 2022
Event28th IEEE Real-Time and Embedded Technology and Applications Symposium, RTAS 2022 - Milan, Italy
Duration: 4 May 20226 May 2022

Publication series

NameProceedings of the IEEE Real-Time and Embedded Technology and Applications Symposium, RTAS
Volume2022-May
ISSN (Print)1545-3421

Conference

Conference28th IEEE Real-Time and Embedded Technology and Applications Symposium, RTAS 2022
Country/TerritoryItaly
CityMilan
Period4/05/226/05/22

Bibliographical note

Publisher Copyright:
© 2022 IEEE.

Fingerprint

Dive into the research topics of 'DNN-SAM: Split-and-Merge DNN Execution for Real-Time Object Detection'. Together they form a unique fingerprint.

Cite this