Click here to Skip to main content
15,907,328 members
Please Sign up or sign in to vote.
0.00/5 (No votes)
See more:
Hi Dears
I want to Create a program in c# to download All new papers on www.ScienceDirect.com.
I want to know How can I recognize addition of new papers and its link.
Please help me.
Posted

1 solution

In general case, you cannot recognize anything. The new HTML page is not different from old one; it does not provide any distinction between "new" and "old" items, unless it is specially designed this way, which you cannot expect. The HTML header can contain date information, but it is also not guaranteed.

Therefore, you have to scan the same HTML documents on regular basis and compare the set of links with the set you should preserve in your database, only then you can spot the change. Even then you cannot guarantee that you spotted, for example, updates in the article. Such detail can be based on conventions used on the site, but those conventions also can change.

—SA
 
Share this answer
 

This content, along with any associated source code and files, is licensed under The Code Project Open License (CPOL)



CodeProject, 20 Bay Street, 11th Floor Toronto, Ontario, Canada M5J 2N8 +1 (416) 849-8900