Click here to Skip to main content
11,801,910 members (56,107 online)
Rate this: bad
Please Sign up or sign in to vote.
See more: XML Java XHTML XSLT
im working on a project to extract all images, pdf, word documents from a web page and store it in a database.
so i need to convert web page to xhtml using Jtidy, and then convert that XHTML to XML using XSLT, then extract all those files using the XML.
i dont know how to convert XHTML to XML using XSLT and extract data from it.
somebody help me on it.
Posted 13-Mar-12 2:19am

1 solution

Rate this: bad
Please Sign up or sign in to vote.

Solution 1

NO. Your idea sounds interesting but does things you do not want and probably even don't need.

Read this:[^]

and use another approach.
kalai91 at 15-Mar-12 6:11am
thanks for ur solution. i read that article. but still im forced to do it in that way! wat shall i do??
TorstenH. at 15-Mar-12 10:14am
start here: XSL @w3schools

and also take a look here: Converting XHTML files to XSL and XML files @

an old fashion book will also be worth it. It's kind of like an additional display. Also is the search mechanism much better than anything online.
kalai91 at 19-Mar-12 6:48am
Thank you...

This content, along with any associated source code and files, is licensed under The Code Project Open License (CPOL)

  Print Answers RSS
0 OriginalGriff 445
1 KrunalRohit 301
2 F-ES Sitecore 215
3 CHill60 185
4 Afzaal Ahmad Zeeshan 180
0 OriginalGriff 2,715
1 Maciej Los 1,900
2 KrunalRohit 1,797
3 CPallini 1,550
4 Richard MacCutchan 1,157

Advertise | Privacy | Mobile
Web03 | 2.8.151002.1 | Last Updated 13 Mar 2012
Copyright © CodeProject, 1999-2015
All Rights Reserved. Terms of Service
Layout: fixed | fluid

CodeProject, 503-250 Ferrand Drive Toronto Ontario, M3C 3G8 Canada +1 416-849-8900 x 100