Click here to Skip to main content
15,886,689 members
Please Sign up or sign in to vote.
1.00/5 (1 vote)
Hi
I have a download option in my application which download a webpage. I want to include all Images, Style sheet, Script along with the plain HTML page. Same thing happen when we select Complete webpage download while saving web page. Can any one suggest how to do that , so that always it will save Complete web page not just plain HTML page.


Many many Thanks in advance
Posted
Updated 20-Feb-13 0:31am
v2
Comments
Karthik Harve 20-Feb-13 6:32am    
[Edit] removed pre tags.

http://msdn.microsoft.com/en-us/library/office/microsoft.office.interop.word%28v=office.11%29.aspx[^]

SQL
You need to download and parse the page to find out all the external resources, then download each in turn (and possibly parse it and download the resources within it).

For parsing the HTML, I suggest using the HTML Agility Pack. You need to keep in mind what resources you want to download (images, css, javascript etc) and query the page for those specifically.

You will need to keep in mind that some pages will define a base and that you will need to consider that as well as the page URL with relative and absolute links.
 
Share this answer
 
Please visit the following link. It will guide you about your problem.
downloading-a-complete-webpage-problematically
 
Share this answer
 

This content, along with any associated source code and files, is licensed under The Code Project Open License (CPOL)



CodeProject, 20 Bay Street, 11th Floor Toronto, Ontario, Canada M5J 2N8 +1 (416) 849-8900