Finding consecutive lines

Peter · Nov 5, 2004

I need to parse a file looking for patterns in 5 consecutive lines.
e.g.

I want to find 5 consecutive lines with "abc" in first line, "def" in second,
"efg" in third etc.

How can I do something like this..

Thanks in advance,
Peter

Tad McClellan · Nov 5, 2004

I want to find 5 consecutive lines with "abc" in first line, "def" in second,
"efg" in third etc.

How can I do something like this..

Buffer the 4 previous lines.

Brad Baxter · Nov 5, 2004

I need to parse a file looking for patterns in 5 consecutive lines.
e.g.

I want to find 5 consecutive lines with "abc" in first line, "def" in second,
"efg" in third etc.

How can I do something like this..

Thanks in advance,
Peter

What have you tried so far?

Regards,

Brad

#!/usr/bin/perl
use warnings;
use strict;

my @f = qw( abc def ghi jkl mno );
my @s;
while( <DATA> ) {
push @s, $_;
shift @s if @s > 5;
next unless @s == 5;
print " @s\n" if
$s[0] =~ /$f[0]/ and
$s[1] =~ /$f[1]/ and
$s[2] =~ /$f[2]/ and
$s[3] =~ /$f[3]/ and
$s[4] =~ /$f[4]/;
}
__DATA__

hey abc
there def abc
june ghi def abc
bug jkl ghi def abc
lets mno jkl ghi def abc
get mno jkl ghi def
in mno jkl ghi
the mno jkl
mud mno

__END__
hey abc
there def abc
june ghi def abc
bug jkl ghi def abc
lets mno jkl ghi def abc

there def abc
june ghi def abc
bug jkl ghi def abc
lets mno jkl ghi def abc
get mno jkl ghi def

june ghi def abc
bug jkl ghi def abc
lets mno jkl ghi def abc
get mno jkl ghi def
in mno jkl ghi

bug jkl ghi def abc
lets mno jkl ghi def abc
get mno jkl ghi def
in mno jkl ghi
the mno jkl

lets mno jkl ghi def abc
get mno jkl ghi def
in mno jkl ghi
the mno jkl
mud mno

David K. Wall · Nov 5, 2004

Brad Baxter said:
I need to parse a file looking for patterns in 5 consecutive
lines. e.g.

I want to find 5 consecutive lines with "abc" in first line,
"def" in second, "efg" in third etc.

Click to expand...

[snip]

#!/usr/bin/perl
use warnings;
use strict;

my @f = qw( abc def ghi jkl mno );
my @s;
while( <DATA> ) {
push @s, $_;
shift @s if @s > 5;
next unless @s == 5;
print " @s\n" if
$s[0] =~ /$f[0]/ and
$s[1] =~ /$f[1]/ and
$s[2] =~ /$f[2]/ and
$s[3] =~ /$f[3]/ and
$s[4] =~ /$f[4]/;
}

I like this a little better because it doesn't require maintenance if
the number of patterns changes.

#!/usr/bin/perl
use strict;
use warnings;

my @buffer;
my @pattern = qw(abc def ghi jkl mno);
while (<DATA>) {
push @buffer, $_;
next if @buffer != @pattern;
my $matches = grep /1/,
map $buffer[$_] =~ /$pattern[$_]/,
0 .. $#pattern;
print @buffer if $matches == @pattern;
shift @buffer;
}

__DATA__
jdfk jkdfh bl
dfjkv dfjk
dfjdj
abc these
def are
ghi the
jkl lines
mno we want
hj dfvhj d
jkfh vblsdjk
jdkf hld

ctcgag · Nov 5, 2004

I need to parse a file looking for patterns in 5 consecutive lines.
e.g.

I want to find 5 consecutive lines with "abc" in first line, "def" in
second, "efg" in third etc.

How can I do something like this..

If your file fits in memory:

warn "Untested";
$file =~ /^(.*abc.*\n.*def.*\n.*efg.*)$/m;

If course you presumably want to do something with these lines, but you
didn't say what that was.

Xho

Brad Baxter · Nov 5, 2004

Brad Baxter said:
Brad Baxter said:

I need to parse a file looking for patterns in 5 consecutive
lines. e.g.

I want to find 5 consecutive lines with "abc" in first line,
"def" in second, "efg" in third etc.

Click to expand...

[snip]

#!/usr/bin/perl
use warnings;
use strict;

my @f = qw( abc def ghi jkl mno );
my @s;
while( <DATA> ) {
push @s, $_;
shift @s if @s > 5;
next unless @s == 5;
print " @s\n" if
$s[0] =~ /$f[0]/ and
$s[1] =~ /$f[1]/ and
$s[2] =~ /$f[2]/ and
$s[3] =~ /$f[3]/ and
$s[4] =~ /$f[4]/;
}

Click to expand...

I like this a little better because it doesn't require maintenance if
the number of patterns changes.

#!/usr/bin/perl
use strict;
use warnings;

my @buffer;
my @pattern = qw(abc def ghi jkl mno);
while (<DATA>) {
push @buffer, $_;
next if @buffer != @pattern;
my $matches = grep /1/,
map $buffer[$_] =~ /$pattern[$_]/,
0 .. $#pattern;
print @buffer if $matches == @pattern;
shift @buffer;
}

I agree that's better. I see you also optimized away an unnecessary
comparison.

Regards,

Brad

Michele Dondi · Nov 5, 2004

I want to find 5 consecutive lines with "abc" in first line, "def" in second,
"efg" in third etc.

How can I do something like this..

Slurp the whole file in and use a regex! If the whole file is huge,
then maintain a buffer and join lines five at a time as you read new
ones.

Michele

Michele Dondi · Nov 6, 2004

#!/usr/bin/perl
use strict;
use warnings;

my @buffer;
my @pattern = qw(abc def ghi jkl mno);
while (<DATA>) {
push @buffer, $_;
next if @buffer != @pattern;

While your scheme is terse and elegant in that it fills in the buffer
as needed, but even if I'm not one of those paranoids about
efficiency, it bothers me a little that the check is done for all
lines, including the majority of them that would not require it. I'd
pre-load it instead.

my $matches = grep /1/,
map $buffer[$_] =~ /$pattern[$_]/,
0 .. $#pattern;

Well, I'm a big fan of map() and grep(), but in this case it seems to
me an overhead to use them. What about a simple counter instead? Also,
another source of inefficiency is in the fact that all patterns are
tried even if some of them fail, thus...

print @buffer if $matches == @pattern;
shift @buffer;
}

....all in all I'd rewrite it as

#!/usr/bin/perl

use strict;
use warnings;

my @pattern = qw(abc def ghi jkl mno);
my @buffer = (0, map scalar <>, 1..$#pattern);

LINE: while (<>) {
shift @buffer; push @buffer, $_;
$buffer[$_] =~ /$pattern[$_]/ or
next LINE for 0..$#pattern;
print @buffer;
}

__END__

Michele

Taskcproblem calendar	4	Aug 31, 2023
Help with code plsss	0	Aug 30, 2023
EEG stream data with mne and brainfolw	0	Jul 26, 2023
printing two consecutive lines	9	Aug 21, 2008
Trouble with prediction code, for the life of me I can't figure out why it isnt running properly. Help would be appreciated.	0	Jul 8, 2023
Python point location of intersect between two lines	0	Feb 28, 2018
extract consecutive lines of data	8	Nov 24, 2007
consecutive node sequence and pathlength problem using networkx graph	0	Jun 14, 2012

Finding consecutive lines

Peter

Tad McClellan

Brad Baxter

David K. Wall

ctcgag

Brad Baxter

Michele Dondi

Michele Dondi

Ask a Question

Similar Threads

Members online

Forum statistics

Latest Threads