Blog
Developers
Careers
Support
Contact
Gossamer Threads
Solutions
Results
About
Mailing Lists
Resource Centre
Forum
Tools
Home
Who's Online
Tags
Favourites
Login
Forum Search
(
Advanced Search
)
This forum
This category
All forums
for
Home
:
General
:
Perl Programming
:
any module for PDF->html, DOC->html?
Previous Thread
Next Thread
Print Thread
View Threaded
Apr 3, 2003, 2:12 PM
long327
User
(248 posts)
Apr 3, 2003, 2:12 PM
Post #1 of 9
Views: 6529
Shortcut
any module for PDF->html, DOC->html?
Hi all,
Is there any perl module for converting PDF and DOC files to HTML? I could not find any at CPAN.
Thanks.
Long
Apr 3, 2003, 3:14 PM
Paul
Veteran
(19537 posts)
Apr 3, 2003, 3:14 PM
Post #2 of 9
Views: 6422
Shortcut
Re: [long327] any module for PDF->html, DOC->html?
In reply to
Easiest thing is probably post to the pre made adobe script...
http://access.adobe.com:8088/ads-cgi/convert.pl
Apr 3, 2003, 3:34 PM
long327
User
(248 posts)
Apr 3, 2003, 3:34 PM
Post #3 of 9
Views: 6394
Shortcut
Re: [Paul] any module for PDF->html, DOC->html?
In reply to
Hi Paul,
Thank you for reply, but I need to use the module in my code. They don't provide the source code.
Long
Apr 3, 2003, 3:38 PM
Paul
Veteran
(19537 posts)
Apr 3, 2003, 3:38 PM
Post #4 of 9
Views: 6450
Shortcut
Re: [long327] any module for PDF->html, DOC->html?
In reply to
>>
I need to use the module in my code.
<<
Why?
Apr 3, 2003, 4:07 PM
long327
User
(248 posts)
Apr 3, 2003, 4:07 PM
Post #5 of 9
Views: 6459
Shortcut
Re: [Paul] any module for PDF->html, DOC->html?
In reply to
I am working on a plugin which caches web pages for links in LinkSql database. It's essentially what Google does.
Long
Apr 4, 2003, 12:07 AM
yogi
Veteran
(2199 posts)
Apr 4, 2003, 12:07 AM
Post #6 of 9
Views: 6402
Shortcut
Re: [long327] any module for PDF->html, DOC->html?
In reply to
There's a command line utility called 'pdftotext' (on linux) that might do the job. I am not aware of any perl solutions.
Ivan
-----
Iyengar Yoga Resources
/
GT Plugins
Apr 4, 2003, 3:24 AM
Wil
Veteran
/ Moderator
(4108 posts)
Apr 4, 2003, 3:24 AM
Post #7 of 9
Views: 6452
Shortcut
Re: [long327] any module for PDF->html, DOC->html?
In reply to
pdf2html
http://www.google.com/...r=&ie=ISO-8859-1
pdf2doc
http://www.google.com/...p;btnG=Google+Search
- wil
Apr 4, 2003, 6:40 AM
Paul
Veteran
(19537 posts)
Apr 4, 2003, 6:40 AM
Post #8 of 9
Views: 6382
Shortcut
Re: [long327] any module for PDF->html, DOC->html?
In reply to
I don't know of a pure perl solution. The only thing I can suggest to do is install something like this:
ftp://atrey.karlin.mff.cuni.cz/...ocal/clock/pdf2html/
...and then you can execute it with a system command in your script.
Apr 4, 2003, 6:41 AM
Paul
Veteran
(19537 posts)
Apr 4, 2003, 6:41 AM
Post #9 of 9
Views: 6401
Shortcut
Re: [long327] any module for PDF->html, DOC->html?
In reply to
Quote:
I am working on a plugin which caches web pages for links in LinkSql database. It's essentially what Google does.
That's fair enough but I'm not sure why you need the source code to be included in your script.
Previous Thread
Next Thread
Print Thread
View Threaded