Click here to Skip to main content
15,884,628 members
Articles
(untagged)

Mine Your Website's Data With a Private Custom Crawler

19 Sep 2012CPOL 621   1
Web pages provide a plethora of information and mineable data. Unfortunately most of them are not using the XML based XHTML but the classic HTML. Therefore we decided to extend the ANKHOR XML parser to accept most HTML content.With this extension it is now quite simple to e.g. extract all referenc
Only logged in members can view this content