Matching neighbouring words of a pattern using Regex

CV · Aug 30, 2004

How can I match 'n' number of neighbouring words of a pattern using regular
expressions?

For example, suppose I am looking for the pattern "length xyz cm" in some
text. where xyz is a number - integer or fraction or decimal point. How can
I also grab about 3-5 words on either side of the pattern "length xyz cm"?
The surrounding words are not always constant & may be variable. Also, the
original text to be matched is not just a single sentence, but lines from a
file concatenated together - so the text has many newline characters too. I
only want the words on the same line as the pattern.

I have tried using regex of the form
/\b(\w*)\b(\w*)\b(\w*)\b($pattern)\b(\w*)\b(\w*)\b(\w*), but this doesn't
work for some reason. Could someone please offer some suggestions?

thanks!

Gunnar Hjalmarsson · Aug 30, 2004

CV said:
How can I match 'n' number of neighbouring words of a pattern using
regular expressions?

This group is defunct. See reply in comp.lang.perl.misc.

Charles DeRykus · Aug 31, 2004

How can I match 'n' number of neighbouring words of a pattern using regular
expressions?

For example, suppose I am looking for the pattern "length xyz cm" in some
text. where xyz is a number - integer or fraction or decimal point. How can
I also grab about 3-5 words on either side of the pattern "length xyz cm"?
The surrounding words are not always constant & may be variable. Also, the
original text to be matched is not just a single sentence, but lines from a
file concatenated together - so the text has many newline characters too. I
only want the words on the same line as the pattern.

I have tried using regex of the form
/\b(\w*)\b(\w*)\b(\w*)\b($pattern)\b(\w*)\b(\w*)\b(\w*), but this doesn't
work for some reason. Could someone please offer some suggestions?

You may be confused about the \b assertion. Did you intend for
something with \w and \W..? Also, what if the pattern falls
at the beginning or end of the line... do you want to capture
the patterns that may not have 3-5 surrounding words?

One possibility presuming you intend to capture 3-5 surrounding
words:

my $text = "...";
my $pattern = 'length ... cm ';

my $words = '(?:\w+[^\w\n]+){3,5}';
#my $words = '(?:\w+[^\w\n]+){0,5}'; # to catch every pattern

print $1 while /($words$pattern$words)/g;

[ Note the 3-5 surrounding words may consume another
adjacent $pattern instance but you don't specify what
to do in that case. }

hth,

Regex: deleting non-matching words	3	Aug 22, 2010
Regex not matching a string	2	Jan 9, 2013
My regex kung-fu is not strong =(	0	Apr 4, 2020
pattern matching and abstract functions	12	Mar 29, 2011
regex select multiple words in the middle of a sentence	10	Apr 7, 2009
Tips Re Pattern Matching / REGEX	1	Mar 27, 2008
Simple pattern-matching in a functional way	12	May 14, 2010
Pattern matching Question	2	Dec 4, 2008

Matching neighbouring words of a pattern using Regex

CV

Gunnar Hjalmarsson

Charles DeRykus

Ask a Question

Similar Threads

Members online

Forum statistics

Latest Threads