PDF, PS, DOC, ...

P

Pif

Hi,

I'm looking for one or several library(ies) in Java that should allow to
edit PDF files, MS-DOC, OpenOffice, PS ...

Each library can just manage one file format. I'm trying to extract text
(without presentation information) from thoses files, like Google for
example to extract relevant keywords.

Can somebody suggest me tools ?

Thanks a lot.
 
C

Chris

Pif said:
I'm looking for one or several library(ies) in Java that should allow to
edit PDF files, MS-DOC, OpenOffice, PS ...

Each library can just manage one file format. I'm trying to extract text
(without presentation information) from thoses files, like Google for
example to extract relevant keywords.

Dieselpoint Search extracts text from PDF, Word, etc. and indexes it for
searching. http://dieselpoint.com
 

Ask a Question

Want to reply to this thread or ask your own question?

You'll need to choose a username for the site, which only take a couple of moments. After that, you can post your question and our members will help you out.

Ask a Question

Members online

No members online now.

Forum statistics

Threads
473,769
Messages
2,569,580
Members
45,054
Latest member
TrimKetoBoost

Latest Threads

Top