Hi Geoff,
thank you for the reply.
My need is to offer a search facility on my website permitting searching on file content.
My website contain a document repository (pdf, ppt, doc, xls..) and my visitors when they lanch a search request with keywords will have a result if there keywords are on the file title or on the file description (stored in mysql tables).In the current situation if the keywords exists on the file content and don't exist on the title or on the description they will not have a positive result.
You may agree that is not the best way.
in my recent research i have found 2 solutions :
- converting all my files (pdf, ppt, doc, xls..) to xml format and populate them on mysql database to be searchable.
- or installing a search index (like microsoft index called catalog or swish-e or glimpse) in order to index all the files content stored in a specified directory not all ones in my server and be able to return a positive result when the keyword is indexed.
As i readed recently, glimpse can be added as an add on cpanel and swish is using a perl api called by php so i may added in the perl module so both are normally feasible !
I am asking to have a feedback from surmunity member if someones have implemented some of these solutions, how to implement them or is there another better way to do what i want.
Thanks a lot for any help.