Cross Language Plagiarism of Arabic-English Documents Using Linear Log
Noor Publishing ( 23.01.2017 )
€ 49,90
Cross-language Plagiarism Detection (CLPD) is used to automatically identify and extract plagiarism among documents in different languages. The main challenge of cross-language plagiarism detection is the difference of text languages, where the original source can be analyzed and translated. This book proposes an Arabic-English cross-language plagiarism detection method by automatically detect the semantic relatedness between the words of two suspect targeted files. The proposed method consists of six phases: The first phase is a pre-processing phase, The second involves keyphrase extraction and translation, The third phase retrieves the candidate document that match with the key phrase of the proposed plagiarism text. The fourth phase is a similarity measurement between the key phrases by measuring the similarity between the original text and plagiarism text, The fifth phase is the classification process using Linear Logistic Regression (LLR) approach and the last phase is an evaluation phase using Precision, Recall and F-measure on dataset consisting of Wikipedia articles. The experimental implementation was down with C# language and achieved excellent results.
تفاصيل الكتاب: |
|
ISBN-13: |
978-3-330-84467-4 |
ISBN-10: |
3330844671 |
EAN: |
9783330844674 |
لغة الكتاب: |
عربي |
By (author) : |
Mohammed Hasan Abdulameer Almayali |
عدد الصفحات: |
92 |
النشر في: |
23.01.2017 |
الصنف: |
Internet |