Click here to Skip to main content
Rate this: bad
good
Please Sign up or sign in to vote.
See more: XML Java XHTML XSLT
im working on a project to extract all images, pdf, word documents from a web page and store it in a database.
so i need to convert web page to xhtml using Jtidy, and then convert that XHTML to XML using XSLT, then extract all those files using the XML.
i dont know how to convert XHTML to XML using XSLT and extract data from it.
somebody help me on it.
Posted 13-Mar-12 3:19am
kalai91209

1 solution

Rate this: bad
good
Please Sign up or sign in to vote.

Solution 1

NO. Your idea sounds interesting but does things you do not want and probably even don't need.
 
Read this: http://java.sun.com/developer/technicalArticles/ThirdParty/WebCrawler/[^]
 
and use another approach.
  Permalink  
Comments
kalai91 at 15-Mar-12 6:11am
   
thanks for ur solution. i read that article. but still im forced to do it in that way! wat shall i do??
TorstenH. at 15-Mar-12 10:14am
   
start here: XSL @w3schools
 
and also take a look here: Converting XHTML files to XSL and XML files @ ibm.com
 
an old fashion book will also be worth it. It's kind of like an additional display. Also is the search mechanism much better than anything online.
kalai91 at 19-Mar-12 6:48am
   
Thank you...

This content, along with any associated source code and files, is licensed under The Code Project Open License (CPOL)

  Print Answers RSS
0 OriginalGriff 195
1 ProgramFOX 130
2 Maciej Los 105
3 Afzaal Ahmad Zeeshan 82
4 Sergey Alexandrovich Kryukov 75
0 OriginalGriff 6,564
1 Sergey Alexandrovich Kryukov 6,048
2 DamithSL 5,228
3 Manas Bhardwaj 4,717
4 Maciej Los 4,150


Advertise | Privacy | Mobile
Web03 | 2.8.1411022.1 | Last Updated 13 Mar 2012
Copyright © CodeProject, 1999-2014
All Rights Reserved. Terms of Service
Layout: fixed | fluid

CodeProject, 503-250 Ferrand Drive Toronto Ontario, M3C 3G8 Canada +1 416-849-8900 x 100