Concatenating selective lines from a bunch of files into a singlefile

Rider · Jul 25, 2009

Hi experts,

I have a unix dir, test and it has 100 files. I need to extract all
the lines that contain a word "keyword" key word" (case insensitive)
from each of those files and put all those lines into another file,
concatenate.txt. Similarly, I want all the lines that don't contain
that key word (keyword or key word) into another file,
non_concatenate.txt.

I know a little bit of perl and I am wondering how can I do it here.

Thanks,
J

Rider · Jul 25, 2009

Hi experts,

I have a unix dir, test and it has 100 files. I need to extract all
the lines that contain a word "keyword" key word" (case insensitive)
from each of those files and put all those lines into another file,
concatenate.txt. Similarly, I want all the lines that don't contain
that key word (keyword or key word) into another file,
non_concatenate.txt.

I know a little bit of perl and I am wondering how can I do it here.

Thanks,
J

What is the easiest way? To use File::Find module and then use grep,
join commands?

Thanks in advance,
J

Jim Gibson · Jul 25, 2009

Rider said:
What is the easiest way? To use File::Find module and then use grep,
join commands?

The easiest way would be call your Perl program with all of the file
names on the command-line:

program.pl /test/*

Then use the construct

while(<>) {
...
}

to read all of the lines in all of the files. Open two files before you
enter the while loop:

open(my $yes, '>', 'concatenate.txt') or die(...);
open(my $no, '>', 'non_concatenate.txt') or die(...);

Then in the body of the while loop, use the match operator with a
regular expression (case insensitive):

if( /keyword/i ) {
print $yes;
}else{
print $no;
}

If your shell complains about the length of the command-line, you can
use opendir and readdir to fetch the file names in your directory, then
open each file for reading individually.

sln · Jul 25, 2009

Hi experts,

I have a unix dir, test and it has 100 files. I need to extract all
the lines that contain a word "keyword" key word" (case insensitive)
from each of those files and put all those lines into another file,
concatenate.txt. Similarly, I want all the lines that don't contain
that key word (keyword or key word) into another file,
non_concatenate.txt.

I know a little bit of perl and I am wondering how can I do it here.

Thanks,
J

I am curious as to what end this de-interlace achieves?
There seems to be no pragmatic function to it.
Its fine if its just an exercise. Usually though, a real world
effort will reap more rewards to you personally.

Good luck!
-sln

Jürgen Exner · Jul 25, 2009

Rider said:
I have a unix dir, test and it has 100 files.

opendir() and readdir() or glob().

I need to extract all
the lines that contain a word "keyword" key word" (case insensitive)
from each of those files

loop through all those files who's names you gathered in step 1,
open() each, read it line by line and depening upon if the line matches
(m//), then

and put all those lines into another file,
concatenate.txt.

print() that line to the target file, which you open()ed earlier

Similarly, I want all the lines that don't contain
that key word (keyword or key word) into another file,
non_concatenate.txt.

or the other target file, which you also open()ed earlier.

jue

Martijn Lievaart · Jul 26, 2009

Hi experts,

I have a unix dir, test and it has 100 files. I need to extract all the
lines that contain a word "keyword" key word" (case insensitive) from
each of those files and put all those lines into another file,
concatenate.txt. Similarly, I want all the lines that don't contain that
key word (keyword or key word) into another file, non_concatenate.txt.

I know a little bit of perl and I am wondering how can I do it here.

I would not do this in perl, use the right tool for the job!

#!/bin/sh
grep -i "keyword" /path/to/test/* >/path/concatenate.txt
grep -iv "keyword" /path/to/test/* >/path/non_concatenate.txt

HTH,
M4

Charlton Wilbur · Jul 26, 2009

R> Hi experts, I have a unix dir, test and it has 100 files. I need
R> to extract all the lines that contain a word "keyword" key word"
R> (case insensitive) from each of those files and put all those
R> lines into another file, concatenate.txt. Similarly, I want all
R> the lines that don't contain that key word (keyword or key word)
R> into another file, non_concatenate.txt.

R> I know a little bit of perl and I am wondering how can I do it
R> here.

The simplest thing would be to forgo Perl altogether and use the Unix
grep utility.

Charlton

Looking for a tool to turn hundreds of MSG contacts into VCF files for CRM import.	0	Aug 5, 2025
Help with importing from multiple files and printing lines in designated spot to spit out one file.	1	Jan 16, 2023
Can't get .vcxproj files out of git	1	Nov 19, 2024
Find and count strings of text from multiple files	17	Dec 16, 2021
How do I save information from an GUI into a XML-file?	0	Aug 17, 2022
Word matching with specific parameters	1	Jan 26, 2025
Turning lines of a file into array?	10	May 4, 2013
I need help in understanding these files on my phone, Could someone help me understand these files? Urgent help needed. Please help.	3	Jun 4, 2023

Concatenating selective lines from a bunch of files into a singlefile

Rider

Rider

Jim Gibson

sln

Jürgen Exner

Martijn Lievaart

Charlton Wilbur

Ask a Question

Similar Threads

Members online

Forum statistics

Latest Threads