Click here to Skip to main content
15,886,137 members
Please Sign up or sign in to vote.
1.00/5 (2 votes)
See more: (untagged)
I am a programmer for state government and I just inherited the GSA service for the website. It is poorly set up with bad regular expressions, out dated contact info for groups, and lots of errors.

The oddest thing I have found is that one of the groups show that it has crawled (-234,001) sites. Does anyone have any knowledge of GSA or indexing services that could tell me why this is? I haven't a clue.


Thank you all in advance.







Edit:


Alright, not a lot of activity on here but to clarify I am wondering why I am getting a negative crawled page count. After going through the errors in a few hundred groups I found that most of them were caused by Dot-Net-Nuke which is used by some of the departments. This doesn't really answer the question but makes me think that DNN is the cause of the error. Just not sure why yet.
Posted
Updated 30-Oct-13 11:40am
v2
Comments
Sergey Alexandrovich Kryukov 30-Oct-13 14:25pm    
"Does anyone have any knowledge..." is not a valid question for the forum. You are supposed to ask some questions on the topics of the forum and get answer on this page, just that.
—SA
BuckMaverick 30-Oct-13 14:35pm    
The question is why do I have a negative pages crawled count
Matt T Heffron 30-Oct-13 14:26pm    
My first guess would be signed integer overflow (=wrap around)
BuckMaverick 30-Oct-13 14:33pm    
That is what I was thinking as well, but that would be a lot of crawled pages. We have a licence for 500,000 indexed pages which should be more than enough. I have excluded a lot of problem pages, and set it to ignore case. This has brought us down from nearly 1,000,000 pages served which is what the issue was to begin with, to 259,078.
BuckMaverick 30-Oct-13 14:42pm    
right, I fixed it. I am thinking it might have to do with the fact that there was pages excluded, and for some reason the number of pages that are now not indexed is greater than the total number of what was indexed before. But I am not sure what to do about it. HAHA. I am a programmer 2 and the state webmaster. I haven't played with the GSA service except to add it to pages until yesterday when google calls and says we are indexing more than what we are paying for.

This content, along with any associated source code and files, is licensed under The Code Project Open License (CPOL)



CodeProject, 20 Bay Street, 11th Floor Toronto, Ontario, Canada M5J 2N8 +1 (416) 849-8900