Click here to Skip to main content
15,937,003 members
Please Sign up or sign in to vote.
0.00/5 (No votes)
See more:
I am developing an application which opens a word document and convert it to basic html without any font information, but I need to get the information like indended paragraph, bold and italic etc...

I tried to parse the RTF, but that is pretty complex, Is there any alternate solution to achieve this task?

Thanks in advance

1 solution


Have you considered a Google search for your answer? A quick search[^] revealed some interesting results.

You can save a word file as a html[^] file. Then if you're still unhappy with the HTML that Word creates, you could consider Cleaning word's Nasty HTML[^].

I think the two links above make take care of your problem.

Share this answer

This content, along with any associated source code and files, is licensed under The Code Project Open License (CPOL)

CodeProject, 20 Bay Street, 11th Floor Toronto, Ontario, Canada M5J 2N8 +1 (416) 849-8900