Announcement: Cool MHTML library available for review (pre-alpha).

Peter Rilling · Feb 9, 2006

Hi,

I have uploaded a library that, I hope, is better than using CDOSYS/CDONTS
for handling MHTML downloads. Right now the infrastructure for downloading
is in place, but I have not gotten the persistent system in place, but I
will be working on that shortly. If nothing else, this library provides a
simple way to download a page and all its referenced resources.

Here are some of the features that it currently supports:
** Downloads a page and all its referenced resources.
** Downloads them recursively. For instance, a CSS will be downloaded if
referenced by a page, but if that CSS references any images, those will also
be downloaded.
** Only one instance of a resource is downloaded regardless of how many
times they are referenced. The instance is still associated with all the
parent pages that contain it.
** Currently downloadable types include: audio, images, css, html, xml,
scripts.
** Current types of references that are processed include:
background/foreground images (both html and css), css, background sound,
JavaScript, iframe and framesets, xml islands.
** Cool demo app that shows the downloaded content allowing you to sort by
type or referenced relationship.

The following are issues that are on my agenda:
** Update the URLs since they will be eventually viewed locally.
** Support saving to various forms, including the a single mhtml file.
** Support loading of single mhtml files.
** Fix bugs.

Now, what I need from this community is to put this code through its paces.
Find any bugs (including URLs that are not processed correctly). I also
like constructive criticism so any comments about my architecture would be
great, keeping in mind this is pre-alpha so it is far from perfect.

You can download it at http://www.codeproject.com/useritems/mhtmllib.asp.

[ANN] BackgrounDRb release 1.0 available now	4	Dec 17, 2007
Best way to cache a bunch of images on the client? Possible to "stream" image info for caching after	1	Jun 16, 2007
[ANN] Packet - 0.1.7, Ruby Library for EventDriven Networkprogramming	0	Jul 13, 2008
[ANN] Packet 0.1.3 - Library for Event Driven Network Programming	0	Feb 11, 2008
FLV download script works, but I want to enhance it	3	May 6, 2009
Is an ISP's geographical location relevant?	3	Oct 7, 2008
Google Summer of Code proposal: GRuVeR: A General Ruby library forsolving Routing Vehicle Problems	6	Mar 29, 2008
A "How to" about installing Ruby/TK for the Windows RubyInstaller	0	Mar 10, 2010

Announcement: Cool MHTML library available for review (pre-alpha).

Peter Rilling

Ask a Question

Similar Threads

Members online

Forum statistics

Latest Threads