Click here to Skip to main content
Rate this: bad
good
Please Sign up or sign in to vote.
hi
 
i have thousands of pdf files in one server and i want to develop and internal web application to search some words inside each file....
 
one week ago i did a dll for this (using itextsharp), but it take a long time to receive a response (15 minutes aprox.)
 
what can i do to improve the response time?
any idea?
 
thanks
 
PS - actually i have 12000 pdf files and i have the url of each file in a sqlserver database
Posted 21-Feb-12 7:41am

1 solution

Rate this: bad
good
Please Sign up or sign in to vote.

Solution 1

I would think that you need to create an index of all the words in your files so you can do speedy searches. You could do this immediately by scanning all files or build it up gradually by creating it as each search is processed. Unfortunately the second option will still mean that early searches will still be quite slow.
  Permalink  

This content, along with any associated source code and files, is licensed under The Code Project Open License (CPOL)

  Print Answers RSS
0 OriginalGriff 5,170
1 DamithSL 4,357
2 Maciej Los 3,750
3 Kornfeld Eliyahu Peter 3,470
4 Sergey Alexandrovich Kryukov 2,851


Advertise | Privacy | Mobile
Web01 | 2.8.141216.1 | Last Updated 21 Feb 2012
Copyright © CodeProject, 1999-2014
All Rights Reserved. Terms of Service
Layout: fixed | fluid

CodeProject, 503-250 Ferrand Drive Toronto Ontario, M3C 3G8 Canada +1 416-849-8900 x 100