Click here to Skip to main content
11,706,455 members (47,995 online)
Rate this: bad
Please Sign up or sign in to vote.
i know this website write-check

his functionality is uploading a file(PDF,Doc) and check percentage of redundancy between the file uploaded and a lot of websites ,books,researches and after user upload file and result shows that result show redundancy percentage and highlight on copied paragraphs . that paragraphs were repeated in website references when user hover on these highlights the source or references appear to the user to make sure the source he copied from

this is explain simply for website functionality can any one help me in analysis for website has the same functionality and how check between uploaded file and archived files
Posted 11-Nov-12 0:54am
Edited 11-Nov-12 6:19am
Richard MacCutchan at 11-Nov-12 7:09am
Text analysis is a whole art/science. Try searching Google for information on this.
Sergey Alexandrovich Kryukov at 11-Nov-12 22:45pm
Exactly. Actually, I don't believe such tool can be developed and made universal, on the present level of computer science. If you simply check of exact matched in some big fragment of text, it will to make too many false negatives and false positives.
nagiub2007 at 12-Nov-12 3:41am
i can't understand clearly can u explain more with examples plz

1 solution

Rate this: bad
Please Sign up or sign in to vote.

Solution 1

Take a look at Copyleaks plagiarism checker. It has a web API for checking plagiarism. You will learn a lot of their sample code.

This content, along with any associated source code and files, is licensed under The Code Project Open License (CPOL)

  Print Answers RSS
0 OriginalGriff 192
1 Sergey Alexandrovich Kryukov 141
2 chainerlt 110
3 ppolymorphe 71
4 Mika Wendelius 70
0 OriginalGriff 9,070
1 Sergey Alexandrovich Kryukov 8,413
2 CPallini 5,189
3 Maciej Los 4,746
4 Mika Wendelius 3,676

Advertise | Privacy | Mobile
Web02 | 2.8.150819.1 | Last Updated 27 May 2015
Copyright © CodeProject, 1999-2015
All Rights Reserved. Terms of Service
Layout: fixed | fluid

CodeProject, 503-250 Ferrand Drive Toronto Ontario, M3C 3G8 Canada +1 416-849-8900 x 100