Skip to main navigation Skip to search Skip to main content

Choosing a profile length in the SCAP method of source code authorship attribution

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Source code authorship attribution is the task of determining the author of source code whose author is not explicitly known. One specific method of source code authorship attribution that has been shown to be extremely effective is the SCAP method. This method, however, relies on a parameter L that has heretofore been quite nebulous. In the SCAP method, each candidate author's known work is represented as a profile of that author, where the parameter L defines the profile's maximum length. In this study, alternative approaches for selecting a value for L were investigated. Several alternative approaches were found to perform better than the baseline approach used in the SCAP method. The approach that performed the best was empirically shown to improve the performance from 91.0% to 97.2% measured as a percentage of documents correctly attributed using a data set consisting of 7,231 programs written in Java and C++.

Original languageEnglish
Title of host publicationConference Proceedings - IEEE SOUTHEASTCON
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9781479965854
ISBN (Print)9781479965854
DOIs
StatePublished - Mar 1 2014
EventIEEE SoutheastCon 2014 - Lexington, United States
Duration: Mar 13 2014Mar 16 2014

Publication series

NameIEEE SOUTHEASTCON 2014

Conference

ConferenceIEEE SoutheastCon 2014
Country/TerritoryUnited States
CityLexington
Period3/13/143/16/14

Bibliographical note

Publisher Copyright:
© 2014 IEEE.

ASJC Scopus Subject Areas

  • Computer Networks and Communications
  • Software
  • Electrical and Electronic Engineering
  • Control and Systems Engineering
  • Signal Processing

Keywords

  • authorship attribution
  • information retrieval
  • plagiarism detection
  • software forensics

Fingerprint

Dive into the research topics of 'Choosing a profile length in the SCAP method of source code authorship attribution'. Together they form a unique fingerprint.

Cite this