Click here to Skip to main content
15,896,456 members
Please Sign up or sign in to vote.
0.00/5 (No votes)
See more:
i am trying to read a ms word document in c#. i want the repeated words in the document.my aim is to categorize the document based on the findings of the repeated words. also i need to do label matching with the documents uploaded so that i can group the documents that are matching with the label and place under same group. please help me with a solution. thanks.
Posted
Updated 12-Oct-12 2:15am
v2
Comments
Nelek 12-Oct-12 8:19am    
[no name] 12-Oct-12 10:38am    
Okay. Open the word document, read the text, count the words, categorize the document according to your criteria, group the documents.

1 solution

1) Open the word document using the Microsoft.Office.Interop.Word namespace.
2) Read all the content and create a dictionary list of words and occurance count.
3) Do whatever else you need to in order to satisfy your business requirements.
 
Share this answer
 

This content, along with any associated source code and files, is licensed under The Code Project Open License (CPOL)



CodeProject, 20 Bay Street, 11th Floor Toronto, Ontario, Canada M5J 2N8 +1 (416) 849-8900