Bookmark URL Parsing

Thread starter Timothy Wu
Start date Feb 25, 2004

Timothy Wu

Feb 25, 2004

Hi,

I'm trying to parse FireFox bookmark files manually using regular
expressions. I tried to match key-value pairs in tags like the following:

matches = re.findall(r'(\S+)="(.+)"', text)

However, I find that if the URL I'm matching contains something
non-standard I may encounter a problem. For example, one of the link
content I have is javascript code and contains character %22(as shown
when opening with VI). I've figure out that %22 equals to the quotation
mark '"'. That interferes with my match.

How exactly does %22 maps to the quotation mark? I know I often see the
kind of representation in a URL, but what exactly is it and where would
I find info on that? And most importantly, how do I make my match work?

Timothy

Ask a Question

Want to reply to this thread or ask your own question?

You'll need to choose a username for the site, which only take a couple of moments. After that, you can post your question and our members will help you out.

Ask a Question

Similar Threads

Python pyPDF4 code to bookmark pdf based upon date text	1	Jan 18, 2023
HOWTO: Parsing email using Python part2	1	Jul 15, 2011
a little parsing challenge â˜º	70	Jul 17, 2011
HOWTO: Parsing email using Python part1	2	Jul 3, 2011
How to Make CSV Contact Files Work Seamlessly Across All Smartphones?	0	Sep 17, 2025
Question about Munged URL and bookmark / favourite	2	Oct 27, 2004
implementation for Parsing Expression Grammar?	3	May 10, 2008
Must be a bug in the re module [was: Why this result with the remodule]	0	Nov 2, 2010

Facebook Twitter Reddit Pinterest Tumblr WhatsApp Email Link

Members online

LisetteFre

Total: 228 (members: 1, guests: 227)
Robots: 446

Forum statistics

Threads: 474,432

Messages: 2,571,681

Members: 48,796

Latest member: Greg L.

Latest Threads

Will programmers be doomed since AI can write code in seconds?
- Started by John Joe
- Yesterday at 12:39 PM
Files Uploaded to Google Drive but Not Visible Anywhere
- Started by henrywalker
- Thursday at 5:54 AM
Why cant I print my secured PDF file and how can I fix it?
- Started by vorix28193
- Wednesday at 8:14 AM
Can PST files be converted to EML without Outlook?
- Started by samikshasen34
- Wednesday at 6:35 AM
How Can I Convert Outlook PST Files to MBOX Without Losing Attachments?
- Started by annawelson
- Monday at 2:24 PM
Lost in Multiple Mail Folders? Merge PST Files Easily
- Started by juliewhite
- May 27, 2026
Colspan probs
- Started by jakey
- May 21, 2026
Dicy dice
- Started by WhiteCube
- May 13, 2026
Need a reliable PST Converter Software for Outlook mailbox conversion
- Started by Damian01
- May 9, 2026
Need a PST Converter Free Download to Check Emails Before Export
- Started by vorix28193
- May 5, 2026

Top