Software plagiarism detection via the static API call frequency birthmark

Dong Kyu Chae, Jiwoon Ha, Sang Chul Lee, Sang Wook Kim, Gyun Woo

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

14 Scopus citations

Abstract

In this paper, we propose a system for detecting software plagiarism using a birthmark. The birthmark is representative features of a program, which can be used to identify the program. We use a set of frequency of APIs used in a program as its birthmark. The proposed system consists of three components. First, it extracts the frequency of APIs employed in a program. Next, it generates the program birthmark using a set of frequency of APIs and weights to APIs to extract unique features of the program. Finally, it decides the plagiarism based on the cosine similarity between the birthmarks. Through extensive experiments, it was found that the proposed system can provide 97.2% of precision and 95.7% of recall in plagiarism detection.

Original languageEnglish
Title of host publication28th Annual ACM Symposium on Applied Computing, SAC 2013
Pages1639-1643
Number of pages5
DOIs
StatePublished - 2013
Event28th Annual ACM Symposium on Applied Computing, SAC 2013 - Coimbra, Portugal
Duration: 18 Mar 201322 Mar 2013

Publication series

NameProceedings of the ACM Symposium on Applied Computing

Conference

Conference28th Annual ACM Symposium on Applied Computing, SAC 2013
Country/TerritoryPortugal
CityCoimbra
Period18/03/1322/03/13

Keywords

  • Birthmark
  • Static analysis

Fingerprint

Dive into the research topics of 'Software plagiarism detection via the static API call frequency birthmark'. Together they form a unique fingerprint.

Cite this