This topic is locked, no new messages can be posted
avatar Takumi Pro Nov 8. 2006. 6:25 am
Hello,
I'm wondering if other types of files are supported by the full-text search. Like PDF's, Word Docs, Excel files, etc.
If it's not yet supported, then when would this feature be implemented?
Thanks,
Takumi
avatar Ilija Studen Staff Nov 8. 2006. 7:15 am
Reading content out of PDF files it not easy (or it is a bit harder to setup and costs some serious money - see the pricing for PDFLib TET). Or you know some other, free PHP tool that can extract text from PDF files?

Searching through office documents is not hard, but it is unreliable because they are closed formats and I don't know how they work. Or you know some text extraction tool written in PHP? :D
avatar Takumi Pro Nov 8. 2006. 7:22 am
Hm... I don't know a way for either... Would it be possible to integrate Google search with an aC installation?
avatar Ilija Studen Staff Nov 8. 2006. 7:24 am
tshimada:
Hm... I don't know a way for either... Would it be possible to integrate Google search with an aC installation?


I honestly don't know. The thing that I do know is that GoogleBot can't go past login screen so it can't index content posted by users.
avatar Takumi Pro Nov 8. 2006. 7:32 am
I dug a little on Google, found this:
I'm not that familiar with PHP, but I did a quick search and found this script. http://www.conradish.net/pdfhi.php.txt (from: http://www.thescripts.com/forum/thread3742.html )
And then for Word docs, here's something that converts word files to other formats: http://www.granneman.com/techinfo/windows/extractcontentfromword.htm (requires a windows os i think)
if we are willing to pay a fee for licensing would this option also work? http://www.phpwordlib.motion-bg.com/
avatar Ilija Studen Staff Nov 8. 2006. 7:47 am
activeCollab needs to be self-contained so no external services or commercial tools are (or will be) used.

Ticket #192
avatar Takumi Pro Nov 8. 2006. 7:56 pm
So even PDF integration is a no? (Sorry, it might seem like a 'duh' question)
avatar Ilija Studen Staff Nov 9. 2006. 12:06 am
Actually, it means Yes - #192 is scheduled for 0.8 so it will be added in next 4 - 5 months and it will include PDF support and support for office documents.
avatar Takumi Pro Nov 9. 2006. 5:34 am
Awesome. Thanks. If there's anything I can do to help, email me. :)
avatar Nov 9. 2006. 9:56 am
Correct me if I'm wrong, but I'm sure I read that PHP 5.2.0 has a built-in PDF extension.

Topic is locked

If you have something important to say about the issues discussed in this post please write at hi@a51dev.com.

or Go To Next Page