Click here to Skip to main content
11,704,315 members (44,626 online)
Rate this: bad
Please Sign up or sign in to vote.
See more: C++ C
I am try to make a program(by C and C++ language), which input an article and analize the article's content and find some article which has same content.(English article)

I already made R-tree to index but I don't know how to get the content from article.

how can I get feature value from text by article?

The way I tried is build an dictionary from the text is inputed(only include noun), and from articles find some article which has same words. and output those article file's name.

But I think my method is not very well so I want to try an advansed one.

thank you.
Posted 8-Jun-14 22:36pm

1 solution

Rate this: bad
Please Sign up or sign in to vote.

Solution 1

Have you checked the algorithm here[^] ?

This content, along with any associated source code and files, is licensed under The Code Project Open License (CPOL)

  Print Answers RSS
0 Sergey Alexandrovich Kryukov 740
1 OriginalGriff 520
2 Maciej Los 355
3 Andy Lanng 284
4 CPallini 275
0 OriginalGriff 8,824
1 Sergey Alexandrovich Kryukov 8,247
2 CPallini 5,197
3 Maciej Los 4,726
4 Mika Wendelius 3,606

Advertise | Privacy | Mobile
Web01 | 2.8.150819.1 | Last Updated 18 Jun 2014
Copyright © CodeProject, 1999-2015
All Rights Reserved. Terms of Service
Layout: fixed | fluid

CodeProject, 503-250 Ferrand Drive Toronto Ontario, M3C 3G8 Canada +1 416-849-8900 x 100