Click here to Skip to main content
11,702,168 members (62,778 online)
Rate this: bad
good
Please Sign up or sign in to vote.
See more: XML Java XHTML XSLT
im working on a project to extract all images, pdf, word documents from a web page and store it in a database.
so i need to convert web page to xhtml using Jtidy, and then convert that XHTML to XML using XSLT, then extract all those files using the XML.
i dont know how to convert XHTML to XML using XSLT and extract data from it.
somebody help me on it.
Posted 13-Mar-12 2:19am
kalai91216

1 solution

Rate this: bad
good
Please Sign up or sign in to vote.

Solution 1

NO. Your idea sounds interesting but does things you do not want and probably even don't need.

Read this: http://java.sun.com/developer/technicalArticles/ThirdParty/WebCrawler/[^]

and use another approach.
  Permalink  
Comments
kalai91 at 15-Mar-12 6:11am
   
thanks for ur solution. i read that article. but still im forced to do it in that way! wat shall i do??
TorstenH. at 15-Mar-12 10:14am
   
start here: XSL @w3schools

and also take a look here: Converting XHTML files to XSL and XML files @ ibm.com

an old fashion book will also be worth it. It's kind of like an additional display. Also is the search mechanism much better than anything online.
kalai91 at 19-Mar-12 6:48am
   
Thank you...

This content, along with any associated source code and files, is licensed under The Code Project Open License (CPOL)

  Print Answers RSS
0 Maciej Los 395
1 Sergey Alexandrovich Kryukov 355
2 OriginalGriff 314
3 CHill60 145
4 Mika Wendelius 119
0 OriginalGriff 8,344
1 Sergey Alexandrovich Kryukov 7,507
2 CPallini 4,922
3 Maciej Los 4,371
4 Mika Wendelius 3,556


Advertise | Privacy | Mobile
Web03 | 2.8.150819.1 | Last Updated 13 Mar 2012
Copyright © CodeProject, 1999-2015
All Rights Reserved. Terms of Service
Layout: fixed | fluid

CodeProject, 503-250 Ferrand Drive Toronto Ontario, M3C 3G8 Canada +1 416-849-8900 x 100