Click here to Skip to main content
15,886,362 members
Articles / Desktop Programming / MFC
Article

Selective Offline Browser

Rate me:
Please Sign up or sign in to vote.
3.00/5 (3 votes)
6 Jan 2000 84.9K   1.5K   17   11
An useful application to download only selected files for offline browsing.

Sample Image - OfflineBrowser.jpg

While browsing a message board a few days back, I noticed that someone wanted a utility to download only selected files for offline browsing. Seemed to me like an useful tool to have, so I wrote one.

While IE does let you download files for offline browsing, you have no control over which files are downloaded, you may just want a few, not all of the pages. This utility will give you that control.

To run this application, you will require IE 4.01 or above, as it is based on the WebBroswer2 control provided by IE. What the app does is pretty simple. You can specify a list of file specifications (*.html, r*.htm, etc...) and then choose a start page to start downloading from. Also you can specify the number of levels you want to download. The default is 0, which is rather useless. So for e.g., if you ask for 2 levels - all links which match the specifications you have entered will be followed down to 2 levels.

The application by itself is based on a CHTMLView which creates an IE WebBrowserApp control and runs it. All I really have done is create a class CHTTPLineHolder which takes a LPDISPATCH (interface to a WebBrowserApp) and uses it to download pages into your local system cache.

So, the basic logic followed is as follows:

NavigateToPage - ( URL to navigate to )
{
    Store Page Into Cache
    GetAllLinksWithinPage 

    For Each Link which satisfies  File Spec given 
        NavigateToPage ( Current Link)

}

This by itself will download the file - but to store it, a couple of calls to the WinInet library have to be made. These calls get a cache file name and store the page into the file. As this file name is got through WinInet, any application (including IE) which uses WinInet can use this cache.

Also, the whole operation is done on a separate thread, so as to allow painting by the main view. So as each page is got, you get to see it in the main view window.

The methods which do the work are:

CHTTPLinkHolder::ReadPage(const CString& szUrl)

and

CHTTPLinkHolder::ReadPageIntoCache(const CString& szURL)

To use the application, first choose View | File Extensions.. and add or delete items from the list until you have only the specs you need. You can use wildcards here so, "*.htm", "a*b?c.htm" etc. are valid specs. Next, choose View | Start... You will be asked for a starting page and the number of levels to download, enter this info and click OK. And once this process is over, go to IE, make sure your Offline Browsing option is set and choose the same start page you entered in the application. And you should be able to view the files you downloaded. Exceptions of course are dynamic content files which may not be downloaded as expected.

So - Happy Offline Browsing.

License

This article has no explicit license attached to it but may contain usage terms in the article text or the download files themselves. If in doubt please contact the author via the discussion board below.

A list of licenses authors might use can be found here


Written By
Web Developer
United States United States
This member has not yet provided a Biography. Assume it's interesting and varied, and probably something to do with programming.

Comments and Discussions

 
GeneralUsing a proxy Pin
Martin Gillmann28-Jun-06 22:35
Martin Gillmann28-Jun-06 22:35 
Hi Rajiv,
I hape you can answer this question. It seems you have a lot of knowlage in this area:

I have a simple application based on a CHtmlView. I need to force this application to use a proxy server (127.0.0.1:90) while I want internet explorer to access the net directly. Is there a way to achieve this?

Many thanks Martin




QuestionEnable ActiveX execution? Pin
dragomir11-Sep-04 7:42
dragomir11-Sep-04 7:42 
GeneralCHtmlView and Swing Pin
Sky17-Jul-02 22:43
Sky17-Jul-02 22:43 
Generalwe could not save anything... Pin
26-Jun-01 3:34
suss26-Jun-01 3:34 
GeneralBrowser Helper Objects Pin
1-Apr-01 22:45
suss1-Apr-01 22:45 
GeneralRe: Browser Helper Objects Pin
10-Jul-01 22:55
suss10-Jul-01 22:55 
GeneralGreat selection of webpage to show Pin
Ron20-Mar-00 3:41
Ron20-Mar-00 3:41 
GeneralRe: Great selection of webpage to show Pin
24-Apr-01 8:59
suss24-Apr-01 8:59 
GeneralRe: Great selection of webpage to show Pin
29-Nov-01 10:08
suss29-Nov-01 10:08 
GeneralFix For Not being able to build project Pin
Rajiv Ramachandran28-Jan-00 12:16
Rajiv Ramachandran28-Jan-00 12:16 

General General    News News    Suggestion Suggestion    Question Question    Bug Bug    Answer Answer    Joke Joke    Praise Praise    Rant Rant    Admin Admin   

Use Ctrl+Left/Right to switch messages, Ctrl+Up/Down to switch threads, Ctrl+Shift+Left/Right to switch pages.