Software System for Similarity Detection in the Picocomputer Assembly Programs

Izdanje: Sinteza 2022 - International Scientific Conference on Information Technology and Data Related Research

DOI: 10.15308/Sinteza-2022-246-253

Oblast: Information Systems and Security Session

Stranice: 246-253

Link: https://portal.sinteza.singidunum.ac.rs/paper/868

Apstrakt:
This paper tackles the problem of plagiarism in an academic environment with an emphasis on the detection of similarities between the source codes from student assignments in the programming courses. The detected similarity in these codes greatly helps a human expert to bring the final decision on which codes are plagiarised and to which extent. Since the manual comparison of the source codes is a tedious task, the system for automatic detection of similarities in the assembly programs written for the picoComputer architecture is envisioned and implemented. It relies on the application which first performs the scanning and tokenization of the source codes. The pair-wise similarity detection is carried out by the Greedy String Tiling algorithm upgraded with the hash-based Karp-Rabin modification. A convenient GUI is also provided for efficient communication for the users and the choice of necessary parameters. Two different approaches are pursued in the testing and evaluation of the system. The first test set consists of a starting program with several versions with intentional modifications to simulate plagiarism. The second test set represents a real workload which comprises 250 real source codes from the student assignments. In both cases, the system demonstrated good efficiency.
Ključne reči: plagiarism, similarity detection, picoComputer architecture, tokenization, RKR-GST algorithm
Priložene datoteke:
  • 246-253 ( veličina: 414,13 KB, broj pregleda: 1254 )

Preuzimanje citata:

BibTeX format
@article{article,
  author  = {V. Tomašević, M. Mišić and V. Tomašević}, 
  title   = {Software System for Similarity Detection in the Picocomputer Assembly Programs},
  journal = {Sinteza 2022 - International Scientific Conference on Information Technology and Data Related Research},
  year    = 2022,
  pages   = {246-253},
  doi     = {10.15308/Sinteza-2022-246-253}
}
RefWorks Tagged format
RT Conference Proceedings
A1 Vojislav Tomašević
A1 Marko Mišić
A1 Violeta Tomašević
T1 Software System for Similarity Detection in the Picocomputer Assembly Programs
AD Univerzitet Singidunum, Beograd, Beograd, Srbija
YR 2022
NO doi: 10.15308/Sinteza-2022-246-253
Unapred formatirani prikaz citata
V. Tomašević, M. Mišić and V. Tomašević, Software System for Similarity Detection in the Picocomputer Assembly Programs, Univerzitet Singidunum, Beograd, 2022, doi:10.15308/Sinteza-2022-246-253