ISSN: 3107-4553

Comparative Analysis of Algorithms for Detecting ChatGPT-Paraphrased Texts

Abstract

The increasing use of AI-generated text has introduced new challenges in detecting paraphrased content, particularly in lesser-resourced languages. This study investigates the effectiveness of different algorithms in identifying ChatGPT-paraphrased texts, focusing on the impact of word unigram and character multigram features, classification algorithm performance across English and Serbian corpora, and the comparative efficiency of commercial detectors like ZeroGPT against custom models. Additionally, it examines the role of syntax analysis and model temperature in influencing AI-generated text structures. A quantitative methodology involving classification algorithms, feature set evaluations, and cross-linguistic comparisons is employed. Results indicate that tailored algorithms outperform commercial detectors, especially when incorporating syntactic features. The study underscores the necessity for language-specific approaches to enhance detection accuracy and proposes directions for future research in AI text detection.

References

  1. Jawahar, G., Sagot, B., & Seddah, D. (2019). "What Does BERT Learn about the Structure of Language?" ACL Anthology
  2. Fabbri, A. R., Kryscinski, W., McKeown, K., & Radev, D. (2021). "SummEval: Re-evaluating Summarization Evaluation." Transactions of the Association for Computational Linguistics, 9, 391-409
  3. Solaiman, I., & Dennison, C. (2021). "Process for Adapting Language Models to Society." arXiv preprint arXiv:2103.10393
  4. Kumar, A., & Li, Y. (2022). "AI-Based Text Generation and the Challenges of Detection." Journal of Computational Linguistics, 48(2), 289–305
  5. Ippolito, D., Karpinska, M., Eck, D., Callison-Burch, C., & Chan, Z. (2020). "Automatic Detection of Machine-Generated Text: A Comparative Study." EMNLP Findings, 2020, 1463-1474
  6. Pavlick, E., & Callison-Burch, C. (2016). "Simple Sentences for Complex Paraphrases." ACL Proceedings, 200-211
Download PDF

How to Cite

Pradeep Upadhyay, (2025/4/14). Comparative Analysis of Algorithms for Detecting ChatGPT-Paraphrased Texts. JANOLI International Journal of Big Data , Volume ALIHWmliJRGzmKonHyii, Issue 1.