Accessing PDF Metadata and Page Thumbnails

Ben Gribaudo · Jul 26, 2007

Hello,

I am putting together a PDF archive of our corporate newsletters. I'd
like to iterate though a directory of PDFs, read their metadata (title,
description, etc.) and use this info to dynamically generate a RHTML
index page. There are several Ruby PDF libraries out there but they seem
inclined towards creating PDFs instead of reading them. Any
recommendations on a library to read PDF metadata?

It would be neat to not only read metadata but also to pull the PDF's
first page's thumbnail out as an image. This would allow dynamic
creation of an index page that looks like this:
http://www.reviveourhearts.com/difference/newsletter/newsletter_archive.php

Any thoughts?

Thanks,
Ben

Eugen Minciu · Jul 27, 2007

Excerpts from Ben Gribaudo's message of Thu Jul 26 19:33:32 +0300 2007:

Hello,

I am putting together a PDF archive of our corporate newsletters. I'd
like to iterate though a directory of PDFs, read their metadata (title,
description, etc.) and use this info to dynamically generate a RHTML
index page. There are several Ruby PDF libraries out there but they seem
inclined towards creating PDFs instead of reading them. Any
recommendations on a library to read PDF metadata?

It would be neat to not only read metadata but also to pull the PDF's
first page's thumbnail out as an image. This would allow dynamic
creation of an index page that looks like this:
http://www.reviveourhearts.com/difference/newsletter/newsletter_archive.php

Any thoughts?

Have a look at http://extractor.rubyforge.org . You need libextractor
and its headers to compile it though. Would that work for you?

Ruby Weekly News 28th February - 6th March 2005	1	Mar 6, 2005
comp.lang.vhdl FAQ part 1 of 4: general	0	Jul 8, 2003
comp.lang.c Answers to Frequently Asked Questions (FAQ List)	15	Apr 1, 2006
comp.lang.c Answers to Frequently Asked Questions (FAQ List)	1	Feb 1, 2004
comp.lang.vhdl FAQ part 3 of 4: products & services	0	Jul 8, 2003

Accessing PDF Metadata and Page Thumbnails

Ben Gribaudo

Eugen Minciu

Ask a Question

Similar Threads

Members online

Forum statistics

Latest Threads