Temporary Cost of Cheating Different Plagiarism Detection Algorithms by Students
- Solís-Martínez, Jaime 1
- Pascual Espada, Jordán 1
- Alonso Virgos, Lucia 2
- González Crespo, Rubén 2
-
1
Universidad de Oviedo
info
-
2
Universidad de La Rioja
info
- Singh Mer, K.K. (ed. lit.)
- Semwal, V.B. (ed. lit.)
- Bijalwan, V. (ed. lit.)
- Crespo, R.G. (ed. lit.)
Editorial: Springer
ISSN: 2524-7565, 2524-7573
ISBN: 9789813363069, 9789813363076
Any de publicació: 2021
Pàgines: 937-948
Tipus: Capítol de llibre
Resum
Plagiarism detection in all kinds of works is a recurrent problem in the educational environment, at all levels. There are many tools capable of detecting plagiarism, with most of them using a combination of different algorithms. The most basic plagiarism detection system is based on the comparison of literal text strings, with the use of more complex algorithms allowing the detection of other types of copies where authors change or alter the structure and words in order for the work to have less resemblance to the original source. In this research work, we have developed a tool that allows text comparison with different types of algorithms. Ten students were asked to copy a text trying to hide that it has been copied, and the resulting texts were analyzed with a set of algorithms included in the tool, with the objective of verifying which algorithms do not detect the plagiarism and also how much time the students required to do so.
Referències bibliogràfiques
- S. Hannabuss, Contested texts: issues of plagiarism. Libr. Manag. 22, 311–318 (2001)
- D. Sakamoto, K. Tsuda, A detection method for plagiarism reports of students. Procedia Comput. Sci. 159, 1329–1338 (2019)
- D.D. Arce, Plagio académico en estudiantes de bachillerato: ¿qué detecta turnitin? RUIDERAe Rev. Unidades Inf. 1–31 (2016)
- K. Baba, T. Nakatoh, T. Minami, Plagiarism detection using document similarity based on distributed representation. Procedia Comput. Sci. 111, 382–387 (2017)
- R. Lukashenko, V. Šakele, J. Grundspenkis, Computer-based plagiarism detection methods and tools: an overview, in ACM International Conference Proceeding Series, vol. 285(40) (2007)
- R. Larson, Introduction to information retrieval. JASIST 61, 852–853 (2010)
- M. Abdel-Nasser, K. Mahmoud, H. Kashef, A novel smart grid state estimation method based on neural networks. Int. J. Interact. Multimed. Artif. Intell. 5, 92–100 (2018)
- J. Feng, B. Cui, K. Xia, A code comparison algorithm based on AST for plagiarism detection, in 20 Fourth International Conference on Emerging Intelligent Data and Web Technologies (2013), pp. 393–397. https://doi.org/10.1109/eidwt.2013.73
- J. Zhao, K. Xia, Y. Fu, B. Cui, An AST-based code plagiarism detection algorithm, in 2015 10th International Conference on Broadband and Wireless Computing, Communication and Applications (BWCCA) (2015), pp. 178–182. https://doi.org/10.1109/bwcca.2015.52
- M.A.C. Jiffriya, M.A.C. Akmal Jahan, R. Ragel, Plagiarism detection on electronic text based assignments using vector space model, in ICIAfS14 (2014) https://doi.org/10.1109/iciafs.2014.7069593
- A. Parker, J.O. Hamblen, Computer algorithms for plagiarism detection. IEEE Trans. Educ. 32, 94–99 (1989)
- S.M. Alzahrani, N. Salim, A. Abraham, Understanding plagiarism linguistic patterns, textual features, and detection methods. IEEE Trans. Syst. Man Cybern. Part C Appl. Rev. 42, 133–149 (2012)
- R. Lackes, J. Bartels, E. Berndt, E. Frank, A word-frequency based method for detecting plagiarism in documents, in 2009 IEEE International Conference on Information Reuse Integration (2009), pp. 163–166. https://doi.org/10.1109/iri.2009.5211544
- A. Moldagulova, R.B. Sulaiman, Document classification based on KNN algorithm by term vector space reduction, in 2018 International Conference on Control, Automation and Systems (ICCAS) (2018), pp. 387–391
- M. Kuta, J. Kitowski, Optimisation of character n-gram profiles method for intrinsic plagiarism detection (2014). https://doi.org/10.1007/978-3-319-07176-3_44
- M.A. Carmona, Semantically-informed distance and similarity measures for paraphrase plagiarism identification (2018)
- S. Zouaoui, K. Rezeg, Multi-agents indexing system (MAIS) for plagiarism detection. J. King Saud Univ. Comput. Inf. Sci. (2020) doi:https://doi.org/10.1016/j.jksuci.2020.06.009
- G.A. León-Paredes, Presumptive detection of cyberbullying on Twitter through natural language processing and machine learning in the Spanish language, in 2019 IEEE CHILEAN Conference on Electrical, Electronics Engineering, Information and Communication Technologies (CHILECON) (2019), pp. 1–7. https://doi.org/10.1109/chilecon47746.2019.8987684
- K. Vani, D. Gupta, Text plagiarism classification using syntax based linguistic features. Expert Syst. Appl. 88, 448–464 (2017)
- G. Oberreuter, J.D. Velásquez, Text mining applied to plagiarism detection: the use of words for detecting deviations in the writing style. Expert Syst. Appl. 40, 3756–3763 (2013)
- M. Roostaee, M.H. Sadreddini, S.M. Fakhrahmad, An effective approach to candidate retrieval for cross-language plagiarism detection: a fusion of conceptual and keyword-based schemes. Inf. Process. Manag. 57, 102150 (2020)
- Z. Su, Plagiarism detection using the Levenshtein Distance and Smith-Waterman algorithm, in 2008 3rd International Conference on Innovative Computing Information and Control, vol. 569 (2008). https://doi.org/10.1109/icicic.2008.422
- K. Manaf, Comparison of carp rabin algorithm and Jaro-Winkler distance to determine the equality of Sunda languages, in 2019 IEEE 13th International Conference on Telecommunication Systems, Services, and Applications (TSSA) (2019), pp. 77–81. https://doi.org/10.1109/tssa48701.2019.8985470