Login or Register

RSS IconRecent posts in this topic

avatar Pro
Takumi on Nov 8. 2006. 12:25 pm
Hello,
I'm wondering if other types of files are supported by the full-text search. Like PDF's, Word Docs, Excel files, etc.
If it's not yet supported, then when would this feature be implemented?
Thanks,
Takumi
avatar Staff
Ilija Studen on Nov 8. 2006. 1:15 pm
Reading content out of PDF files it not easy (or it is a bit harder to setup and costs some serious money - see the pricing for PDFLib TET). Or you know some other, free PHP tool that can extract text from PDF files?

Searching through office documents is not hard, but it is unreliable because they are closed formats and I don't know how they work. Or you know some text extraction tool written in PHP? :D
activeCollab team member | LinkedIn
avatar Pro
Takumi on Nov 8. 2006. 1:22 pm
Hm... I don't know a way for either... Would it be possible to integrate Google search with an aC installation?
avatar Staff
Ilija Studen on Nov 8. 2006. 1:24 pm
tshimada:
Hm... I don't know a way for either... Would it be possible to integrate Google search with an aC installation?


I honestly don't know. The thing that I do know is that GoogleBot can't go past login screen so it can't index content posted by users.
activeCollab team member | LinkedIn
avatar Pro
Takumi on Nov 8. 2006. 1:32 pm
I dug a little on Google, found this:
I'm not that familiar with PHP, but I did a quick search and found this script. http://www.conradish.net/pdfhi.php.txt (from: http://www.thescripts.com/forum/thread3742.html )
And then for Word docs, here's something that converts word files to other formats: http://www.granneman.com/techinfo/windows/extractcontentfromword.htm (requires a windows os i think)
if we are willing to pay a fee for licensing would this option also work? http://www.phpwordlib.motion-bg.com/
avatar Staff
Ilija Studen on Nov 8. 2006. 1:47 pm
activeCollab needs to be self-contained so no external services or commercial tools are (or will be) used.

Ticket #192
activeCollab team member | LinkedIn
avatar Pro
Takumi on Nov 9. 2006. 1:56 am
So even PDF integration is a no? (Sorry, it might seem like a 'duh' question)
avatar Staff
Ilija Studen on Nov 9. 2006. 6:06 am
Actually, it means Yes - #192 is scheduled for 0.8 so it will be added in next 4 - 5 months and it will include PDF support and support for office documents.
activeCollab team member | LinkedIn
avatar Pro
Takumi on Nov 9. 2006. 11:34 am
Awesome. Thanks. If there's anything I can do to help, email me. :)
avatar
Nick on Nov 9. 2006. 3:56 pm
Correct me if I'm wrong, but I'm sure I read that PHP 5.2.0 has a built-in PDF extension.
Topic is locked. If you have something important to say about issues discussed on this page please write at hi@a51dev.com.

RSS IconRecent posts in this topic