Click here to Skip to main content
15,894,405 members
Please Sign up or sign in to vote.
3.00/5 (1 vote)
See more:
Hi there,

I am a beginner to RSS and HTML Parsing.
So I want to practice my skill.

There are heaps of options in this catalogue on that page.

So I want to retrieve all the catalogue and turn them into a well-formed XML file for making a RSS.

Is there any idea about how to do it?
It would be better if someone could kindly provide code fragments.

Thanks in advance.
Posted

I don't think anyone has attempted to parse the site map yet. Why don't you give it a shot? If you run into specific issues, post here and someone will be sure to help you out. But it would be quite unlikely that you'd find a guy to write the whole parser for you from scratch.
 
Share this answer
 
Comments
franva 24-Jan-11 7:10am    
Hi Nishant,

Thanks to your reply :)
And what do you mean --- why don't you give it a shot?
The reanson why I want to parser sitemap is I want to transform it to a RSS file. That's the only reason.
I saw the example code HTML2RSS in HTML Agility Pack, but I have no idea about the XPath field, so..... I just want to know some simple and effective way to implement it.

Once again, thank you a lot ^_^

Winston
I have an article that uses HtmlAgilityPack to parse some of the Codeproject article/user pages. You could use that as an example.

CodeProject Article Scraper, Revisited[^]

I don't rely completely on it, but there is some code in ther that uses it.
 
Share this answer
 
Comments
franva 26-Jan-11 22:54pm    
thanks mate, :)

This content, along with any associated source code and files, is licensed under The Code Project Open License (CPOL)



CodeProject, 20 Bay Street, 11th Floor Toronto, Ontario, Canada M5J 2N8 +1 (416) 849-8900