Click here to Skip to main content
16,021,765 members
Please Sign up or sign in to vote.
3.00/5 (1 vote)
See more:
Hi,
I saw this project here Create Your Own Googlebot[^] and really does a nice job for meta tags scraping. I would like to ask for help about a solution to extract information only specific divs or tags from a website/webpages or even html files on a disk. I have many html files on my disk, and seek for a way how to extract only particular data from those files.

Thanks
Posted
Comments
Sergey Alexandrovich Kryukov 13-Nov-12 18:38pm    
"I would like to ask a question" is not a question. Why not asking a question?

Please understand: we have overwhelming number of non-questions here, so if your question does not resemble a question, someone like myself would remove it just accidentally, and you don't want it. Frankly, I would love that some questions were denied to post if there is no a question mark, or something like that...

And what's the problem?
--SA
Andreas Gieriet 13-Nov-12 19:02pm    
What exactly do you want to extract? E.g. <div class="ABC">...</div>? Please some input example, and what the expected output was?
And finally: did you try out anything? Where is the problem in your solution?
Cheers
Andi

1 solution

Why not asking Chris, the author of this article? Locate "Comments and Discussions" section at the end and click "Add a Comment or Question", to ask him a question. The author will get a notification on your post and a chance to reply.

Good luck,
—SA
 
Share this answer
 

This content, along with any associated source code and files, is licensed under The Code Project Open License (CPOL)



CodeProject, 20 Bay Street, 11th Floor Toronto, Ontario, Canada M5J 2N8 +1 (416) 849-8900