negate a match in regex

ikeon · Nov 27, 2008

Hi All,
I have a script that I convert xml tags to html. like "<" I convert to
"<" and so on.
after the conversion I need to capture the information inside the tag.
let take for example the string "<abcd>: which is equivalent to
"<abcd>".
I tried to capture the "abcd" which can be different from tag to tag
in the following way:

/\&lt\;([^\&\gt\;]*)/

like match "<" and then match anything that is not ">".
the thing is that it doesn't work on all tags for some reason and I
was wondering on a principal base if doing a [^somestring] suppose to
work ?

Thanks.

Peter Makholm · Nov 27, 2008

ikeon said:
like match "<" and then match anything that is not ">".
the thing is that it doesn't work on all tags for some reason and I
was wondering on a principal base if doing a [^somestring] suppose to
work ?

No, using [^string] wont work as you're expecting. Just like
[string] doesn't match 'string' but only one of the letters s, t, r,
i, n, or g [^stirng] just matches one letter which isn't in 'string'.

What you need is a negative look-ahead (?!string). Read 'perldoc
perlre' for the explanation of it.

//Makholm

John W. Krahn · Nov 27, 2008

ikeon said:
I have a script that I convert xml tags to html. like "<" I convert to
"<" and so on.
after the conversion I need to capture the information inside the tag.
let take for example the string "<abcd>: which is equivalent to
"<abcd>".
I tried to capture the "abcd" which can be different from tag to tag
in the following way:

/\&lt\;([^\&\gt\;]*)/

You probably want something like:

/<(.*?)>/

John

ikeon · Nov 27, 2008

ikeon said:
ikeon said:

I have a script that I convert xml tags to html. like "<" I convert to
"<" and so on.
after the conversion I need to capture the information inside the tag.
let take for example the string "<abcd>: which is equivalent to
"<abcd>".
I tried to capture the "abcd" which can be different from tag to tag
in the following way:

Click to expand...

/\&lt\;([^\&\gt\;]*)/

Click to expand...

You probably want something like:

/<(.*?)>/

John

The (?!string) didn't work for some reason but I have learned a lot
from "perldoc perlre"

The solution was /<(.*?)>/ which is the simple one. I tried it
with only (.*) but it was "greedy".

Thanks John and Peter for your quick respone.

sln · Nov 28, 2008

Hi All,
I have a script that I convert xml tags to html. like "<" I convert to
"<" and so on.
after the conversion I need to capture the information inside the tag.
let take for example the string "<abcd>: which is equivalent to
"<abcd>".
I tried to capture the "abcd" which can be different from tag to tag
in the following way:

/\&lt\;([^\&\gt\;]*)/

like match "<" and then match anything that is not ">".
the thing is that it doesn't work on all tags for some reason and I
was wondering on a principal base if doing a [^somestring] suppose to
work ?

Thanks.

I'm still confused with your terminology 'xml tags to html'.
So be it.

How do you go from "<abcd>" to "<abcd>" without capturing
'abcd' ?

sln

RegEx - matching previous match	4	Feb 27, 2008
Regex, replacing THIS\|THAT	2	Dec 17, 2011
Did you know that there is a match-case function in python?	4	Dec 17, 2023
Regex: match double OR single quote	4	Jul 12, 2012
Regex question; match <br> after opening tag	23	Feb 16, 2011
Help with dynamic regex	14	Mar 7, 2012
FAQ 6.4 How do I match XML, HTML, or other nasty, ugly things with a regex?	0	Jan 27, 2011
Clickable link conversion regex?	0	Nov 30, 2012

negate a match in regex

ikeon

Peter Makholm

John W. Krahn

ikeon

sln

Ask a Question

Similar Threads

Members online

Forum statistics

Latest Threads