RegEx replace "bracketed text"

hrrglburf · Apr 16, 2006

I have information that needs to strip out all tags that start with a
'{' and end with a '}' including whatever may be in between them, but
not outside of them... I tried making my own reg. exp. but i suck at
it. can anyone give me an example?

news reader · Apr 16, 2006

Something like

s/\{.*?\}//g;

An example may be useful to avoid misunderstandings.

The siutation complicates a little if '\}' may be part of a tag.

Current exanple would read in:
dasdfsafdsadsa{dsadas}d dasdasd{dasdas} fsddfsf{dsadas}

and spit out
dasdfsafdsadsad dasdasd fsddfsf

bye

N.

news reader · Apr 16, 2006

Something like

s/\{.*?\}//g;

An example may be useful to avoid misunderstandings.

The siutation complicates a little if '\}' may be part of a tag.

Current exanple would read in:
dasdfsafdsadsa{dsadas}d dasdasd{dasdas} fsddfsf{dsadas}

and spit out
dasdfsafdsadsad dasdasd fsddfsf

bye

N.

Gunnar Hjalmarsson · Apr 16, 2006

Dale said:
MD> Assuming that tags don't nest and that matching '{' and '}'
MD> are not separated by line-ends, the following works:

MD> s/\{.*?\}//g

A more efficient solution is:
s/\{[^}]*\}//g

with the \s modifier this will work across line-ends.

The /s modifier isn't needed for that.

Peter J. Holzer · Apr 16, 2006

Marc said:
Dale Henderson said:

"MD" == Marc Dashevsky <[email protected]> writes:

Click to expand...

MD> Assuming that tags don't nest and that matching '{' and '}'
MD> are not separated by line-ends, the following works:

MD> s/\{.*?\}//g

A more efficient solution is:
s/\{[^}]*\}//g

Click to expand...

Thanks. Would you explain the reasons for the increased efficiency?
I don't know how to even start the analysis.

Theoretically both should be about the same speed since both require
only a linear scan for a single character without backtracking. A simple
benchmark shows that the first expression is slightly faster on my
system:

#!/usr/bin/perl
use strict;
use warnings;

use Benchmark ':all';

my $s = "aaaaaa{bbbbbbbbbbbb}cccccccccc{ddddddddd}eeeeeee";

cmpthese(100000,
{
nongreedy => sub {
local $_ = $s;
s/\{.*?\}//g;
},
class => sub {
local $_ = $s;
s/\{[^}]*\}//g
},
}
);
__END__
Rate class nongreedy
class 90909/s -- -13%
nongreedy 104167/s 15% --

Dr.Ruud · Apr 17, 2006

Dale Henderson schreef:

Marc Dashevsky:

<unattributed>
s/\{.*?\}//g

A more efficient solution is: s/\{[^}]*\}//g

Click to expand...

Click to expand...

Thanks. Would you explain the reasons for the increased
efficiency? I don't know how to even start the analysis.

Click to expand...

It's spelled out in the Owl book

That could be old news.

As I understand it, the non-greedy operator gives up too easily.

That could have been optimized already. The patterns /[^x]*x/ and /.*?x/
have a lot in common.

Regex replace problem	2	Jan 6, 2022
"input-group-text" help	7	Aug 10, 2023
search replace with regex	6	Nov 25, 2011
help with perl search/replace regex	3	Jan 22, 2010
Search and replace text in XML file?	5	Jul 28, 2012
Clickable link conversion regex?	0	Nov 30, 2012
I need help fixing my website	2	Oct 15, 2023
Regex: match double OR single quote	4	Jul 12, 2012

RegEx replace "bracketed text"

hrrglburf

news reader

news reader

Gunnar Hjalmarsson

Peter J. Holzer

Dr.Ruud

Ask a Question

Similar Threads

Members online

Forum statistics

Latest Threads