Click here to Skip to main content
13,193,048 members (64,172 online)
Rate this:
Please Sign up or sign in to vote.
See more:
I saw this project here Create Your Own Googlebot[^] and really does a nice job for meta tags scraping. I would like to ask for help about a solution to extract information only specific divs or tags from a website/webpages or even html files on a disk. I have many html files on my disk, and seek for a way how to extract only particular data from those files.

Posted 13-Nov-12 12:34pm
Sergey Alexandrovich Kryukov 13-Nov-12 18:38pm
"I would like to ask a question" is not a question. Why not asking a question?

Please understand: we have overwhelming number of non-questions here, so if your question does not resemble a question, someone like myself would remove it just accidentally, and you don't want it. Frankly, I would love that some questions were denied to post if there is no a question mark, or something like that...

And what's the problem?
Andreas Gieriet 13-Nov-12 19:02pm
What exactly do you want to extract? E.g. <div class="ABC">...</div>? Please some input example, and what the expected output was?
And finally: did you try out anything? Where is the problem in your solution?

1 solution

Rate this: bad
Please Sign up or sign in to vote.

Solution 1

Why not asking Chris, the author of this article? Locate "Comments and Discussions" section at the end and click "Add a Comment or Question", to ask him a question. The author will get a notification on your post and a chance to reply.

Good luck,

This content, along with any associated source code and files, is licensed under The Code Project Open License (CPOL)

  Print Answers RSS
Top Experts
Last 24hrsThis month

Advertise | Privacy |
Web01 | 2.8.171017.2 | Last Updated 13 Nov 2012
Copyright © CodeProject, 1999-2017
All Rights Reserved. Terms of Service
Layout: fixed | fluid

CodeProject, 503-250 Ferrand Drive Toronto Ontario, M3C 3G8 Canada +1 416-849-8900 x 100