Optimization of plagiarism detection using vector space model on CUDA architecture

dc.contributor.authorJiffriya Mohamed, Abdul-Cader
dc.contributor.authorAkmal Jahan, Mohamed Abdul Cader
dc.contributor.authorHasindu, Gamaarachchi
dc.contributor.authorRoshan, G. Ragel
dc.date.accessioned2022-10-14T04:27:31Z
dc.date.available2022-10-14T04:27:31Z
dc.date.issued2022-09-26
dc.description.abstractPlagiarism is a rapidly rising issue among students that occurs during the submission of assignments, reports, and publications in universities and educational institutions because of the easy accessibility of abundant e-resources on the Internet. To mitigate plagiarism among students, many tools are available for natural language plagiarism detection. However, they become inefficient when dealing with a prolific number of documents with large content due to the time they consume. Therefore, we have proposed a way for software-based acceleration on text-based plagiarism detection using a suitable model on CPU/GPU. For the evaluation on the CPU, initially, a software-based serial vector space model was implemented on the CPU and tested with 1000 text-based documents particularly, students’ assignments, where it consumed 1641s for plagiarism detection. As the computation time of plagiarism detection is a bottleneck of performance while treating a prolific number of text-based sources with different sizes, we focus on accelerating and optimizing the model with the number of documents. Therefore, this research intends to implement and optimize the vector space model on the Graphics Processing Units (GPU) using Compute Unified Device Architecture (CUDA). In order to speed up, a parallel version of the model was developed on GPU using CUDA and tested with the same dataset which consumed only 36s and gained a 45x speedup compared to CPU, and when optimized further it took only 4s for the same dataset which was 389x faster than serial implementation.en_US
dc.identifier.citationInternational Journal of Innovative Computing and Applications, Vol.13, No.4, 2022. pp.232 - 244 (pp. 1-20).en_US
dc.identifier.urihttps://doi.org/10.1504/IJICA.2022.125675
dc.identifier.urihttp://ir.lib.seu.ac.lk/handle/123456789/6267
dc.language.isoen_USen_US
dc.publisherInder Science Publishersen_US
dc.subjectGraphics Processing Units (GPU)en_US
dc.subjectComputer Unified Device Architecture (CUDA)en_US
dc.subjectPlagiarism Detectionen_US
dc.subjectVector Space Modelen_US
dc.titleOptimization of plagiarism detection using vector space model on CUDA architectureen_US
dc.typeArticleen_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Optimisation of plagiarism.pdf
Size:
686.44 KB
Format:
Adobe Portable Document Format
Description:

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: