Reading text file

Kevin B · Oct 16, 2003

I have the following short script that I'm using to clean up the source of a
web page in order to index and search the page:

#!/usr/bin/perl
#striphtml.pl

undef $/;
open FD, "< testfile1.txt" or die $!;

while (<FD>) {
#s/\r\n//gs;

#s/^\s+$//;
s/<.*?>//gs;
trim();
print "$_";
}

sub trim {

my @out = @_ ? @_ : $_;
$_ = join(' ', split(' ')) for @out;
return wantarray ? @out : "@out";
}

the problem is that it leaves blank lines in the output and the use of chomp
does not clean up. What am I missing to clean up the lines?

Kevin

Roy Johnson · Oct 16, 2003

This newsgroup is defunct. You will reach more people if you post in
comp.lang.perl.misc instead.

Kevin B said:
undef $/;

Ok, you're slurping the whole file in at once...

open FD, "< testfile1.txt" or die $!;

while (<FD>) {

No real point in a while, if you're getting the whole file in one
read. Just do

$_ = said:
s/<.*?>//gs;

strip out all the tags...

print "$_";

No need for the quotes. In this case, no need for an argument at all.
Just
print;

the problem is that it leaves blank lines in the output and the use of chomp
does not clean up. What am I missing to clean up the lines?

Maybe something like
tr/\n//s;
or
s/\n\s*\n/\n/g;
?

Php combine identical lines in text file	4	Oct 11, 2023
Remove blank lines from text file	7	Sep 10, 2005
Appropriate technique for altering a text file?	19	Aug 13, 2010
Fields won't add when reading a text file	4	Apr 30, 2006
cgi simple script in c to search text file	15	Mar 4, 2013
How to avoid searching this folder?	11	Mar 25, 2011
Reading and writing to a file creates null characters	3	Jan 12, 2012
Search/Replace text in XML file	4	Jan 9, 2008

Reading text file

Kevin B

Roy Johnson

Ask a Question

Similar Threads

Members online

Forum statistics

Latest Threads