Searching for Translated Plagiarism with the Help of Desktop Grids

Pataki, Máté and Marosi, Attila (2013) Searching for Translated Plagiarism with the Help of Desktop Grids. Journal of Grid Computing, 11 (1). pp. 149-166. ISSN 1570-7873 10.1007/s10723-012-9224-5

[img]
Preview
Text
201303_JOGC_SearchingForTranslatedPlagiarismWithTheHelpOfDesktopGrids.pdf - Accepted Version
Available under License Creative Commons Attribution No Derivatives.

Download (9MB) | Preview
[img]
Preview
Image
cover_120.jpg - Cover Image

Download (9kB) | Preview
[img] Text
Pataki_149_2168171_z.pdf
Restricted to Repository staff only

Download (1MB) | Request a copy

Abstract

Translated or cross-lingual plagiarism is defined as the translation of someone else’s work or words without marking it as such or without giving credit to the original author. The existence of cross-lingual plagiarism is not new, but only in recent years, due to the rapid development of the natural language processing, appeared the first algorithms which tackled the difficult task of detecting it. Most of these algorithms utilize machine translation to compare texts written in different languages. We propose a different method, which can effectively detect translations between language-pairs where machine translations still produce low quality results. Our new algorithm presented in this paper is based on information retrieval (IR) and a dictionary based similarity metric. The preprocessing of the candidate documents for the IR is computationally intensive, but easily parallelizable. We propose a desktop Grid solution for this task. As the application is time sensitive and the desktop Grid peers are unreliable, a resubmission mechanism is used which assures that all jobs of a batch finish within a reasonable time period without dramatically increasing the load on the whole system.

Item Type: ISI Article
Subjects: Q Science > QA Mathematics and Computer Science > QA75 Electronic computers. Computer science / számítástechnika, számítógéptudomány
Divisions: Department of Distributed Systems
Laboratory of Parallel and Distributed Systems
SWORD Depositor: MTMT Injector
Depositing User: EPrints Admin
Date Deposited: 05 Feb 2014 12:33
Last Modified: 06 Feb 2014 14:05
URI: http://eprints.sztaki.hu/id/eprint/7508

Update Item Update Item