Hello,
I want to extract Nowrgerian lanaguage text available from below websites:
Edit: Rohan Leuva
[Links to website removed]
I have prepared a small windows application, with below code:
WebClient web = new WebClient();
System.IO.Stream stream = web.OpenRead("http://www.xyz.com/");
using (System.IO.StreamReader reader = new System.IO.StreamReader(stream, Encoding.Default, true))
{
text = reader.ReadToEnd();
}
richTextBox1.Text = text;
===============================================================
Now in above case I am not able to get all special characters correctly that appear in Norwgerian, even by using Encoding.Default.
Norwgerian special characters:
«
»
…
å
æ
é
ø
à
æ
Kindly suggest how should I move ahead. Also, my ultitmate object is to get plain text (without any tags) from these website, any additional support in this matter will also help.
Thanks in advance for sparing time and reading my issue. Hope to get some suitable solution.
Regards,
Ankit