Click here to Skip to main content
Rate this: bad
good
Please Sign up or sign in to vote.
See more: C#3.0
I want to read the content in html file online and save all the content text in .txt document. Can anyone tell me how to read the text from html document?
Posted 21-Dec-12 5:02am
Comments
richcb at 21-Dec-12 10:14am
   
A simple google search found this:
 
http://social.msdn.microsoft.com/Forums/en-US/csharpgeneral/thread/060ea8e0-cc63-44a3-b0dc-b531c29b8a0f/
.NET_USER at 24-Dec-12 2:16am
   
Thank you for the link, I have searched a lot before but didn't get the required answer, Thanx a lot!
BillWoodruff at 22-Dec-12 9:54am
   
What have you tried so far ?

1 solution

Rate this: bad
good
Please Sign up or sign in to vote.

Solution 1

If you mean download the source of a webpage...
 
using System;
using System.IO;
using System.Net;
 
And the function...
 
public static void SaveSource(string url, string path)
{
    WebClient client = new WebClient();
    string source = client.DownloadString(url);
    File.WriteAllText(path, source);
}
 
For faster results of getting the source of a webpage, you may want to look into HttpWebRequests.
  Permalink  
v2

This content, along with any associated source code and files, is licensed under The Code Project Open License (CPOL)

  Print Answers RSS
0 OriginalGriff 450
1 Suvendu Shekhar Giri 130
2 Abhinav S 126
3 Jochen Arndt 100
4 Maciej Los 100
0 Sergey Alexandrovich Kryukov 9,623
1 OriginalGriff 8,870
2 Peter Leow 4,984
3 Kornfeld Eliyahu Peter 3,300
4 Maciej Los 2,456


Advertise | Privacy | Mobile
Web04 | 2.8.150327.1 | Last Updated 23 Dec 2012
Copyright © CodeProject, 1999-2015
All Rights Reserved. Terms of Service
Layout: fixed | fluid

CodeProject, 503-250 Ferrand Drive Toronto Ontario, M3C 3G8 Canada +1 416-849-8900 x 100