Match CASE/END SQL Construct

Perry Aynum · Jan 15, 2009

I am working on a SQL parser. I have a routine that recursively removes
enclosing parentheses and it works fine. Below is the regex that I use.

However, I want to use the same routine, but instead of looking for
enclosing parens, I want to look for a string enclosed by CASE and END. Can
someone help me translate the regex below so that it will match a CASE/END
construct?

Thanks very much.

Parens
----------
(?:\s+)?$[^\($]*\)

This is what I've managed so far with the CASE/END

(?:\s+)?case(?!case|end)\s+end

sln · Jan 15, 2009

I am working on a SQL parser. I have a routine that recursively removes
enclosing parentheses and it works fine. Below is the regex that I use.

However, I want to use the same routine, but instead of looking for
enclosing parens, I want to look for a string enclosed by CASE and END. Can
someone help me translate the regex below so that it will match a CASE/END
construct?

Thanks very much.

Parens
----------
(?:\s+)?$[^\($]*\)

This is what I've managed so far with the CASE/END

(?:\s+)?case(?!case|end)\s+end

Its probably not this simple.

sln

-------------------------

use strict;
use warnings;

my $txt = "(this (is a) test)";

while ($txt =~ s/$([^()]*?)$/$1/) {};

print $txt,"\n";

$txt = "case this case is a end test end";

while ($txt =~ s/case\s+(.*?)\s+end/$1/) {};

print $txt,"\n";

__END__

this is a test
this is a test

Jim Gibson · Jan 16, 2009

Perry Aynum said:
I am working on a SQL parser. I have a routine that recursively removes
enclosing parentheses and it works fine. Below is the regex that I use.

However, I want to use the same routine, but instead of looking for
enclosing parens, I want to look for a string enclosed by CASE and END. Can
someone help me translate the regex below so that it will match a CASE/END
construct?

Thanks very much.

Parens
----------
(?:\s+)?$[^\($]*\)

This is what I've managed so far with the CASE/END

(?:\s+)?case(?!case|end)\s+end

Have you tried m{ case \s* (.*?) \s* end }ix

Tad J McClellan · Jan 17, 2009

You have just not yet encountered a test case where it does not work fine...

(?:\s+)?$[^\($]*\)

Click to expand...

Click to expand...

Parenthesis in character classes are not "special" and therefore
do not need to be backslashed...

s/Test/Text/

Why would one need a module for something so apparently simple?

Because appearances can be deceiving.

$_ = "(an opening parenthesis ('(') starts a 'memory' in a Perl regex.)\n";
print "$&\n" if /(?:\s+)?$([^()]*)$/;

Tad J McClellan · Jan 17, 2009

Jim Gibson said:
Have you tried m{ case \s* (.*?) \s* end }ix

Click to expand...

Have you tried that with:

$_ = "fracases can erupt even among friends\n";

Tad J McClellan · Jan 17, 2009

Are you "Perry Aynum" or are you "Buck Turgidson"?

sln · Jan 20, 2009

I am working on a SQL parser. I have a routine that recursively removes
enclosing parentheses and it works fine. Below is the regex that I use.

However, I want to use the same routine, but instead of looking for
enclosing parens, I want to look for a string enclosed by CASE and END. Can
someone help me translate the regex below so that it will match a CASE/END
construct?

Thanks very much.

Parens
----------
(?:\s+)?$[^\($]*\)

This is what I've managed so far with the CASE/END

(?:\s+)?case(?!case|end)\s+end

I've revisited this, became intrigued with zero-assertion width
extented regexp constructs. These constructs don't get enough air-time
here. Since you appear to be leaning in that direction, I thought I would
flesh out a look ahead regexp for your example, perhaps to try to glean insight on
the regexp engine, not really sure. Its very facinating for me. I'm not a big book
reader since I am dislexic, so I try to discover things on my own.

The below would seem to tackle your problem from the perspective of a file slurped
into a variable which is processed. All relavent delimeters are taken into acccount,
my other penchant is for parsing. It is possible to buffer line by line file info
until we just have enough to parse. I didn't do it of course but it is fairly easy.
This would aviod sucking up huge amounts of memory, and is fairly trivial once the
master regexp is known.

I've learned some stuff about the regexp engine's extended operations. I won't go into it.
I decided to include the progression of guesses that went into settling on its final form.
Obviously this form does take into account several delimiting factors as well as look-ahead.
Its not fully tested of course, but it passes my initial alpha form that could be
presented to testers.

As it is now, CASE/END are the targets, however, any can be substituted.
Should you like to employ me for extended projects, set up a contact arangement.

Note the code is at the bottom, the output is at the top, in true dyslexic fashion.
Particularly note in the output, how inner to outter matching goes. This is key.

sln

__OUTPUT__

c:\temp>perl misc9.pl

<<<<<<<<<<< Phase1 >>>>>>>>>>>
$1= --------
' case'
$txt= --------
'
case
1 case end
2 case case end end
fricases can erupt even among friends
end'

<<<<<<<<<<< Phase2 >>>>>>>>>>>
$1= --------
''
$txt= --------
'
case
1
2 case case end end
fricases can erupt even among friends
end'

<<<<<<<<<<< Phase3 >>>>>>>>>>>
$1= --------
''
$txt= --------
'
case
1
2 case end
fricases can erupt even among friends
end'

<<<<<<<<<<< Phase4 >>>>>>>>>>>
$1= --------
''
$txt= --------
'
case
1
2
fricases can erupt even among friends
end'

<<<<<<<<<<< Phase5 >>>>>>>>>>>
$1= --------
'
1
2
fricases can erupt even among friends'
$txt= --------
'
1
2
fricases can erupt even among friends'

************************
FINAL:
'
1
2
fricases can erupt even among friends'

c:\temp>

__CODE__

use strict;
use warnings;

my $txt = join '', <DATA>;

{
# while ($txt =~ s/(?:\s+|^)case(?=\s)(.*)(?!case)(?<=\s)end(?:\s+|$)/$1/is) {} <- sick

# while ($txt =~ s/(?:\s+)case(?=\s)(.*)(?!case)(?<=\s)end(?:\s+)/$1/is) { print "--------\n'$1'\n"} <- disgusting

# while ($txt =~ s/(?:\s+)case(?=\s)(.(?!case)*?)(?<=\s)end(?:\s+)/$1/is) { print "--------\n'$1'\n"} <- putrid

# while ($txt =~ s/(?:\s+)case(?=\s)((?<!case).*?)(?<=\s)end(?:\s+)/$1/is) { print "--------\n'$1'\n"} <- DOA

# while ($txt =~ s/\s+case\s+(.*(?!case))\s+end\s+/ $1 /is) <- what's this?

# while ($txt =~ s/\s+case\s+((.(?!case))*?)end\s+/ $1 /is) <- almost

# while ($txt =~ s/\s+case\s+((.(?!\scase\s))*?)\s+end\s+/ $1 /is) <- better

# while ($txt =~ s/\s+case((.(?!\scase\s))*?)\s+end\s+/ $1 /is) <- more better

# while ($txt =~ s/\s+case((.(?!\scase\s))*?)\s+end\s+/ $1/is) <- hmmm

# while ($txt =~ s/\s+case((.(?!\scase\s))*?)\s+end(\s+)/ $1 /is) <- confused

# while ($txt =~ s/\s+case((?:.(?!\scase\s))*?)\s+end(\s+)/$1$2/is) <- approaching excellence

# while ($txt =~ s/\s+case((?:.(?!\scase\s))*?)\s+end(\s+)/$1$2/is) <- excellence

# while ($txt =~ s/(?:\s+|^)case((?:.(?!\scase\s))*?)\s+end(\s+|$)/$1$2/is) <- PRIMO !!!!

my $cntr = 1;

while ($txt =~ s/(?:\s+|^)case((?:.(?!\scase\s))*?)\s+end(\s+|$)/$1$2/is) # <- Production Regex, Ship to QA
{
print "\n<<<<<<<<<<< Phase".$cntr++." >>>>>>>>>>>\n";
print "\$1= --------\n'$1'\n";
print "\$txt= --------\n'$txt'\n";
}
print "\n\n************************\n FINAL:\n'$txt'\n";
}

__DATA__

case
1 case case end end
2 case case end end
fricases can erupt even among friends
end

sln · Jan 21, 2009

I am working on a SQL parser. I have a routine that recursively removes
enclosing parentheses and it works fine. Below is the regex that I use.

However, I want to use the same routine, but instead of looking for
enclosing parens, I want to look for a string enclosed by CASE and END. Can
someone help me translate the regex below so that it will match a CASE/END
construct?

Thanks very much.

Parens
----------
(?:\s+)?$[^\($]*\)

This is what I've managed so far with the CASE/END

(?:\s+)?case(?!case|end)\s+end

Click to expand...

[snip explanation]

use strict;
use warnings;

my $txt = join '', <DATA>;

{
my $cntr = 1;

while ($txt =~ s/(?:\s+|^)case((?:.(?!\scase\s))*?)\s+end(\s+|$)/$1$2/is) # <- Production Regex, Ship to QA
{
print "\n<<<<<<<<<<< Phase".$cntr++." >>>>>>>>>>>\n";
print "\$1= --------\n'$1'\n";
print "\$txt= --------\n'$txt'\n";
}
print "\n\n************************\n FINAL:\n'$txt'\n";
}

__DATA__

case
1 case case end end
2 case case end end
fricases can erupt even among friends
end

The regex needed a look-ahead for '\s', without it is's a bug.
///g was added to reduce passes, equals the depth of nesting now.
No more posts for a while. See ya later.

sln

-------------------------------------------------

use strict;
use warnings;

my $txt = join '', <DATA>;
my $cntr = 1;

while ($txt =~ s/(\s|^)case(?=\s)((?

?!\scase\s).)*?\s)end(\s|$)/$1$2$3/isg)
{
print "\n<<<<<<<<<<< Phase".$cntr++." >>>>>>>>>>>\n";
print "\$txt= --------\n'$txt'\n";
}
print "\n\n************************\n FINAL:\n'$txt'\n";

__DATA__
case First Line
1 case line case end spacing end
2 case case end end
3 case case END end
fricases can erupt even among friends
end

case can erupt even among end

Did you know that there is a match-case function in python?	4	Dec 17, 2023
FAQ 6.24 How do I match a regular expression that's in a variable?	0	Apr 19, 2011
simple_html_dom: simple use-case - getting a scipt to work	0	Mar 2, 2020
Simple database front-end for simple small business	7	Apr 17, 2021
Regex: match double OR single quote	4	Jul 12, 2012
Issue with textbox script?	0	Sep 5, 2022
Regex to match a numerical IP range	7	Dec 11, 2010
Recursively Removing Embedded Quotes	2	Jul 17, 2007

Match CASE/END SQL Construct

Perry Aynum

sln

Jim Gibson

Tad J McClellan

Tad J McClellan

Tad J McClellan

sln

sln

Ask a Question

Similar Threads

Members online

Forum statistics

Latest Threads