Click here to Skip to main content
11,706,979 members (53,531 online)
Rate this: bad
Please Sign up or sign in to vote.

i have thousands of pdf files in one server and i want to develop and internal web application to search some words inside each file....

one week ago i did a dll for this (using itextsharp), but it take a long time to receive a response (15 minutes aprox.)

what can i do to improve the response time?
any idea?


PS - actually i have 12000 pdf files and i have the url of each file in a sqlserver database
Posted 21-Feb-12 6:41am

1 solution

Rate this: bad
Please Sign up or sign in to vote.

Solution 1

I would think that you need to create an index of all the words in your files so you can do speedy searches. You could do this immediately by scanning all files or build it up gradually by creating it as each search is processed. Unfortunately the second option will still mean that early searches will still be quite slow.

This content, along with any associated source code and files, is licensed under The Code Project Open License (CPOL)

  Print Answers RSS
0 Sergey Alexandrovich Kryukov 140
1 OriginalGriff 125
2 chainerlt 110
3 ppolymorphe 99
4 Mika Wendelius 70
0 OriginalGriff 9,178
1 Sergey Alexandrovich Kryukov 8,487
2 CPallini 5,189
3 Maciej Los 4,766
4 Mika Wendelius 3,696

Advertise | Privacy | Mobile
Web02 | 2.8.150819.1 | Last Updated 21 Feb 2012
Copyright © CodeProject, 1999-2015
All Rights Reserved. Terms of Service
Layout: fixed | fluid

CodeProject, 503-250 Ferrand Drive Toronto Ontario, M3C 3G8 Canada +1 416-849-8900 x 100