I used indexing throw Lucene.net, but the max time that lucene.net can give you the result is 6 minute.
My website based on the search performance, i need another strategy or methodology to using them.
If someone know another way else (lucene.net) to search about text in million of records.
If the text records are in a database using the database manager's build in search functionality will probably be a lot faster. After all, that is what is build for.
If you read the text from files on the hard disk it might be your implementation. I am not familiar with Lucene and how it works, but if it looks through those files each query there is a lot of overhead in opening and closing the files each time.
I personally once wrote a search system that indexed txt files, but since the number of them was quite small, as was the amount of content, I just read the file as one long string and dumped it into SQLServer. Performance was very well, but I don't think this will be the most optimal strategy if you have 1M PDFs to index.
if you could you go into some more detail of the system: where does the text come from, where is it stored, how much text is in each record, that sort of thing, we might be able to provide some better, more directly usable, advice.
This content, along with any associated source code and files, is licensed under The Code Project Open License (CPOL)