FUDforum
Fast Uncompromising Discussions. FUDforum will get your users talking.

Home » Imported messages » comp.lang.php » spider PDF
Show: Today's Messages :: Polls :: Message Navigator
Switch to threaded view of this topic Create a new topic Submit Reply
spider PDF [message #174961] Tue, 26 July 2011 20:23 Go to next message
j is currently offline  j
Messages: 9
Registered: July 2011
Karma: 0
Junior Member
What options are there for reading the content of a PDF file? I'd like
to be able to spider pdfs so they can be stuffed into a MySQL full text
search.

I found this:

http://www.foolabs.com/xpdf/about.html

But I think that will be tough to get installed on a shared hosting web
server.

Jeff
Re: spider PDF [message #174964 is a reply to message #174961] Wed, 27 July 2011 10:44 Go to previous message
PP is currently offline  PP
Messages: 4
Registered: July 2011
Karma: 0
Junior Member
> But I think that will be tough to get installed on a shared hosting web
> server.
>    Jeff


I think you can give a look here:
http://stackoverflow.com/questions/1251956/is-there-a-pdf-parser-for-php

Do you need to extract text from pdf ?
If yes, I think you are looking for a parser and not a spider.

M.
  Switch to threaded view of this topic Create a new topic Submit Reply
Previous Topic: email servers
Next Topic: Getting error while requesting data from the UPS webservice
Goto Forum:
  

-=] Back to Top [=-
[ Syndicate this forum (XML) ] [ RSS ]

Current Time: Sun May 26 11:50:50 GMT 2024

Total time taken to generate the page: 0.04165 seconds