Hello,
I'm wondering if other types of files are supported by the full-text search. Like PDF's, Word Docs, Excel files, etc.
If it's not yet supported, then when would this feature be implemented?
Thanks,
Takumi
Reading content out of PDF files it not easy (or it is a bit harder to setup and costs some serious money - see the pricing for PDFLib TET). Or you know some other, free PHP tool that can extract text from PDF files?
Searching through office documents is not hard, but it is unreliable because they are closed formats and I don't know how they work. Or you know some text extraction tool written in PHP? :D
I dug a little on Google, found this:
I'm not that familiar with PHP, but I did a quick search and found this script. http://www.conradish.net/pdfhi.php.txt (from: http://www.thescripts.com/forum/thread3742.html )
And then for Word docs, here's something that converts word files to other formats: http://www.granneman.com/techinfo/windows/extractcontentfromword.htm (requires a windows os i think)
if we are willing to pay a fee for licensing would this option also work? http://www.phpwordlib.motion-bg.com/
Actually, it means Yes - #192 is scheduled for 0.8 so it will be added in next 4 - 5 months and it will include PDF support and support for office documents.
I'm wondering if other types of files are supported by the full-text search. Like PDF's, Word Docs, Excel files, etc.
If it's not yet supported, then when would this feature be implemented?
Thanks,
Takumi