Click here to Skip to main content
15,887,027 members
Please Sign up or sign in to vote.
0.00/5 (No votes)
See more:
I am writing a crawler...I encounter a problem...how should I find the content of the page when websites are using _dopostback for their pagination?

Is there anyway to have access the source page of the next/prev page when using _dopostback

I really stuck and dont have any idea

Thank you very much for your help in advance
Posted

1 solution

As you are developing a crawler, you should never send POST requests, that means that you will not be able to access pages that have navigation only using _doPostBack.
Typically webmasters add xml sitemaps that contain links to pages of the site:
http://www.xml-sitemaps.com/about-sitemaps.html[^]
you can make use of them.
 
Share this answer
 

This content, along with any associated source code and files, is licensed under The Code Project Open License (CPOL)



CodeProject, 20 Bay Street, 11th Floor Toronto, Ontario, Canada M5J 2N8 +1 (416) 849-8900