Click here to Skip to main content
11,576,064 members (58,142 online)
Rate this: bad
Please Sign up or sign in to vote.
See more: C#3.0
I want to read the content in html file online and save all the content text in .txt document. Can anyone tell me how to read the text from html document?
Posted 21-Dec-12 4:02am
richcb at 21-Dec-12 10:14am
A simple google search found this:
.NET_USER at 24-Dec-12 2:16am
Thank you for the link, I have searched a lot before but didn't get the required answer, Thanx a lot!
BillWoodruff at 22-Dec-12 9:54am
What have you tried so far ?

1 solution

Rate this: bad
Please Sign up or sign in to vote.

Solution 1

If you mean download the source of a webpage...

using System;
using System.IO;
using System.Net;

And the function...

public static void SaveSource(string url, string path)
    WebClient client = new WebClient();
    string source = client.DownloadString(url);
    File.WriteAllText(path, source);

For faster results of getting the source of a webpage, you may want to look into HttpWebRequests.

This content, along with any associated source code and files, is licensed under The Code Project Open License (CPOL)

  Print Answers RSS
0 Sergey Alexandrovich Kryukov 335
1 F-ES Sitecore 145
2 OriginalGriff 125
3 Abhinav S 98
4 Dave Kreskowiak 80
0 OriginalGriff 705
1 Sergey Alexandrovich Kryukov 630
2 Abhinav S 438
3 F-ES Sitecore 410
4 Suvendu Shekhar Giri 365

Advertise | Privacy | Mobile
Web03 | 2.8.150603.1 | Last Updated 23 Dec 2012
Copyright © CodeProject, 1999-2015
All Rights Reserved. Terms of Service
Layout: fixed | fluid

CodeProject, 503-250 Ferrand Drive Toronto Ontario, M3C 3G8 Canada +1 416-849-8900 x 100