Page 1 of 1

Is there a limit when searching a pdf's contents?

PostPosted:Wed Apr 23, 2008 2:13 am
by thuper
Hi

I uploaded a test.pdf in /user2 in demosite. When I search it\'s contents. It seems could only search the contents of first paragragh.

Maybe when the file is indexed to database. There is a limit of total contents size, right?

Can this limit be changed larger when I install it to my computer.

thanks

Re:Is there a limit when searching a pdf's contents?

PostPosted:Wed Apr 23, 2008 8:20 am
by pavila
No there is no limit in content size. The search is done using Lucene and the text is extracted from PDF using PDFBox. I\'ve tested to look for \"开会这‡天正好\" which is in the last paragraph in the last page and seems to work. Anyway the system is preconfigured to work with english language and works with several others latin based languages, but to work better with chinese a tunning should be done.