Click here to Skip to main content
15,881,812 members
Please Sign up or sign in to vote.
0.00/5 (No votes)
See more:
Hi,

I have tried o look in Google but, it seems like it is going to take me 2 days to find good notes or a good tutorial.
I am also looking for easy to follow notes or tutorials on HTML parsers

The tutorials have to be for C++ programmers.

Thanks in advance
Posted

here[^]

or

here[^]
 
Share this answer
 
Comments
Espen Harlinn 13-Mar-11 8:07am    
Nice options - as it's a bit unclear what OP means by "HTML Trees"
Sergey Alexandrovich Kryukov 13-Mar-11 15:43pm    
How did you understand this Question? My 5.
--SA
You question is some what not complete. For starters you need to explain what you mean by 'HTML Trees'. The fact that you put it in quotes suggest that you meant something, but clearly you know that is not the term. Also, the fact that you are unable to find any good hits with google may be the use of incorrect terms.

By 'HTML Trees', I am assuming you are talking about the html structure, right? something like
<html>
<head>
</head>
<body>
  <form>
  </form>
</body></html>


Is that what you have in mind? If not then there is no point in continuing. But if we are in agreement, well, depending when shallow or somewhat deep you work/project/assignment is the best source is to learn something about HTML specification. You don't need to go deep, just enough to get you going and solve your problem. I understand you are not trying to create the next killer app Browser. Here is good starting point http://www.w3.org/html/[^]

Next there are few C++ HTML parsers around. You will be able to better understand what is going on if you have some understanding of the HTML spec. Then when you ready check

HTML Reader C++ Class Library[^]

http://stackoverflow.com/questions/489522/library-recommendation-c-html-parser[^]

and of course our old friend recommends this [^]
 
Share this answer
 
Comments
Espen Harlinn 13-Mar-11 8:09am    
Nice links :)
The_Real_Chubaka 13-Mar-11 9:04am    
Thanks for the links.

Id don't mean HTML structure. I mean HTML parse tree.
Yusuf 13-Mar-11 10:31am    
I guess you are somewhat confused. A tree is nothing but a data structure. You can save/put/set your parsed html into any format or shape you like. You may want to concentrate on the HTML parsing issue. Once you are able to parse it, how you want to save it or render it becomes your choice.
The_Real_Chubaka 13-Mar-11 10:35am    
I think you have given me the best answer.
You are right, i was confused. I will concentrate on HTML parsing.
Yusuf 13-Mar-11 11:15am    
Good Luck and Thanks for accepting my answer.

This content, along with any associated source code and files, is licensed under The Code Project Open License (CPOL)



CodeProject, 20 Bay Street, 11th Floor Toronto, Ontario, Canada M5J 2N8 +1 (416) 849-8900