Click here to Skip to main content
13,046,281 members (86,687 online)
Rate this:
Please Sign up or sign in to vote.
See more:
im working on a project to extract all images, pdf, word documents from a web page and store it in a database.
so i need to convert web page to xhtml using Jtidy, and then convert that XHTML to XML using XSLT, then extract all those files using the XML.
i dont know how to convert XHTML to XML using XSLT and extract data from it.
somebody help me on it.
Posted 13-Mar-12 2:19am

1 solution

Rate this: bad
Please Sign up or sign in to vote.

Solution 1

NO. Your idea sounds interesting but does things you do not want and probably even don't need.

Read this:[^]

and use another approach.
kalai91 15-Mar-12 6:11am
thanks for ur solution. i read that article. but still im forced to do it in that way! wat shall i do??
TorstenH. 15-Mar-12 10:14am
start here: XSL @w3schools

and also take a look here: Converting XHTML files to XSL and XML files @

an old fashion book will also be worth it. It's kind of like an additional display. Also is the search mechanism much better than anything online.
kalai91 19-Mar-12 6:48am
Thank you...

This content, along with any associated source code and files, is licensed under The Code Project Open License (CPOL)

  Print Answers RSS
Top Experts
Last 24hrsThis month

Advertise | Privacy | Mobile
Web02 | 2.8.170713.1 | Last Updated 13 Mar 2012
Copyright © CodeProject, 1999-2017
All Rights Reserved. Terms of Service
Layout: fixed | fluid

CodeProject, 503-250 Ferrand Drive Toronto Ontario, M3C 3G8 Canada +1 416-849-8900 x 100