Is there a library to parse Mozilla "mork" documents?

John Reese · Jan 21, 2005

Mozilla, Firefox, Thunderbird, and so forth use this awful format
called MORK to store all kinds of things: which messages you've read
in a newsgroup, headers and indexes into the mbox file of messages in
a mail folder, and address books. It's documented to some extent
here:
http://www.mozilla.org/mailnews/arch/mork/primer.txt

Does anyone know of a Python library for parsing these files? A
single file basically just stores the equivalent of a nested
dictionary with text that can be declared separately and interpolated.
jwz has an over-specific perl version at
http://www.jwz.org/hacks/marginal.html, which I might have to try to
translate if there's nothing already available in Python.

Peter Rowell · Jan 21, 2005

John said:
Mozilla, Firefox, Thunderbird, and so forth use this awful format
called MORK to store all kinds of things: which messages you've read

[ snip ]

I was searching on a similar question (about accessing the history)
when I came across a nifty little bookmarklet. It dumps FF history in
RDF format to the file of your choice. This temporarily solved
my problem, although in the long run I want to have direct read
access to the info.

Perhaps you can get a few ideas and go from there. The bookmarlet
was attached to Bugzilla item 241438.
https://bugzilla.mozilla.org/show_bug.cgi?id=241438

HTH,
Peter

Tim Roberts · Jan 21, 2005

John Reese said:
Mozilla, Firefox, Thunderbird, and so forth use this awful format
called MORK to store all kinds of things: which messages you've read
in a newsgroup, headers and indexes into the mbox file of messages in
a mail folder, and address books.

Yes. What a crock that is. The MORK format is a great way to compress
tabular information, IF the information consists of the same pieces of data
over and over. E-mail boxes do not fit into that class, so I have no doubt
that the typical Thunderbird MORK file is singificantly LARGER than the
same file would be in, say, INI format.

I wrote a Python script to parse it, but it isn't terribly robust. I was
able to produce a dictionary, but I didn't do anything with the results.
You're welcome to take a look:
http://www.probo.com/timr/parsemsf.py

John Reese · Jan 21, 2005

Yes. What a crock that is. The MORK format is a great way to compress
tabular information, IF the information consists of the same pieces of data
over and over. E-mail boxes do not fit into that class, so I have no doubt
that the typical Thunderbird MORK file is singificantly LARGER than the
same file would be in, say, INI format.

I wrote a Python script to parse it, but it isn't terribly robust. I was
able to produce a dictionary, but I didn't do anything with the results.
You're welcome to take a look:
http://www.probo.com/timr/parsemsf.py

Thanks, I'll work with this. I have to say that this has all been
worth it just to read about Jamie Zawinski railing against this file
format. I think your comment at the top sums it up well:

# Why am I doing this?

Is there a library to parse Mozilla "mork" documents?	0	Jan 21, 2005
Is there a python library to parse C++ code file?	2	Mar 17, 2006
comp.lang.c Answers to Frequently Asked Questions (FAQ List)	15	Apr 1, 2006
comp.lang.c Answers to Frequently Asked Questions (FAQ List)	1	Feb 1, 2004

Is there a library to parse Mozilla "mork" documents?

John Reese

Peter Rowell

Tim Roberts

John Reese

Ask a Question

Similar Threads

Members online

Forum statistics

Latest Threads